<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>Ecclesiastical Proust Archive</title>
	<atom:link href="http://www.proustarchive.org/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://www.proustarchive.org</link>
	<description>A Textual / Visual Resource</description>
	<pubDate>Mon, 09 May 2011 17:32:52 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
	<language>en</language>
			<item>
		<title>À l&#8217;Ombre des Mains en Latex Violet</title>
		<link>http://www.proustarchive.org/?p=62</link>
		<comments>http://www.proustarchive.org/?p=62#comments</comments>
		<pubDate>Mon, 09 May 2011 17:32:52 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[archives]]></category>

		<category><![CDATA[digital]]></category>

		<category><![CDATA[editing]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=62</guid>
		<description><![CDATA[
]]></description>
			<content:encoded><![CDATA[<p><img src="http://farm3.static.flickr.com/2358/5704239934_0dea76ea78_z.jpg" alt="" width="453" height="640" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=62</wfw:commentRss>
		</item>
		<item>
		<title>Proust</title>
		<link>http://www.proustarchive.org/?p=61</link>
		<comments>http://www.proustarchive.org/?p=61#comments</comments>
		<pubDate>Wed, 30 Mar 2011 01:27:12 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[networked media]]></category>

		<category><![CDATA[other websites]]></category>

		<category><![CDATA[social software]]></category>

		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=61</guid>
		<description><![CDATA[Just found out about this website called Proust that facilitates asking life-defining questions of your loved ones and having deep conversations with them. It works like a questionnaire. Not really sure how this is different from Story Corps, or why you wouldn&#8217;t want to just ask your loved ones in person.

Introduction To Proust.com from Proust [...]]]></description>
			<content:encoded><![CDATA[<p>Just found out about this website called <a href="http://www.proust.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/www.proust.com');" target="_blank">Proust</a> that facilitates asking life-defining questions of your loved ones and having deep conversations with them. It works like a questionnaire. Not really sure how this is different from Story Corps, or why you wouldn&#8217;t want to just ask your loved ones in person.</p>
<p><iframe src="http://player.vimeo.com/video/15227196?title=0&amp;byline=0&amp;portrait=0" width="400" height="225" frameborder="0"></iframe>
<p><a href="http://vimeo.com/15227196" onclick="javascript:pageTracker._trackPageview('/outbound/article/vimeo.com');">Introduction To Proust.com</a> from <a href="http://vimeo.com/proust" onclick="javascript:pageTracker._trackPageview('/outbound/article/vimeo.com');">Proust</a> on <a href="http://vimeo.com" onclick="javascript:pageTracker._trackPageview('/outbound/article/vimeo.com');">Vimeo</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=61</wfw:commentRss>
		</item>
		<item>
		<title>Foray Into Topic Modeling</title>
		<link>http://www.proustarchive.org/?p=60</link>
		<comments>http://www.proustarchive.org/?p=60#comments</comments>
		<pubDate>Thu, 17 Mar 2011 15:45:22 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[data mining]]></category>

		<category><![CDATA[narrative]]></category>

		<category><![CDATA[taxonomy]]></category>

		<category><![CDATA[text mining]]></category>

		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=60</guid>
		<description><![CDATA[Topic modeling suggests new avenues for Proust studies. Applications like Mallet and PhiloMine compute the statistical relationships among tokens (as single, double, or triple word phrases) appearing within specified spans of text such as paragraphs or groups of, say, fifty words. Since the Recherche embodies more than one million words, topic modeling can be used [...]]]></description>
			<content:encoded><![CDATA[<p>Topic modeling suggests new avenues for Proust studies. Applications like Mallet and PhiloMine compute the statistical relationships among tokens (as single, double, or triple word phrases) appearing within specified spans of text such as paragraphs or groups of, say, fifty words. Since the <em>Recherche</em> embodies more than one million words, topic modeling can be used to highlight features of the text that are not perceptible during the act of serial reading. I ran the first tome, which contains <em>Du côté de chez Swann</em> part I, “Combray,” and part II, “Un Amour de Swann,” through Mallet to generate token clusters for ten topics, which reveals some interesting patterns. The command line output shows the top nineteen recurring words that are statistically significant within the top ten recurring patterns (topics) in the text.</p>
<ol>
<li>chose moment pouvait jamais puis rien esprit pourtant visage savait voulait dire savoir mal trouvait première devait autres instant</li>
<li>dit bien dire air jamais beaucoup tête toujours princesse ami docteur reste choses sais enfin regard répondit jeune entendu</li>
<li>vie amour plaisir souvent celle ainsi gilberte pu pensée besoin donnait tant sorte milieu cause femmes étais connaître joie</li>
<li>après temps jusqu heure pendant allait presque chambre longtemps près seul passer heures penser jour tard souvenir chercher toute</li>
<li>combray côté déjà rue soleil semblait fleurs saint bois place eau ciel petits vers jardin matin champs dessus autour</li>
<li>faisait toutes petite peine seule beau toute sourire donner phrase quelques trouver parfois contraire nature suite musique croire corps</li>
<li>swann odette chez verdurin monde disait gens femme forcheville homme soir effet amis connaissait demander personne cœur cottard</li>
<li>voir faire aller autres jours jour toujours maison venait venir désir grande contre dès autant paris rien lequel bien</li>
<li>grand tante mère père françoise faire bien fille disait parents maman voix partie personne bonne petit mort famille laisser</li>
<li>devant guermantes yeux nom air petit surtout or doute mieux église image fit vue dame tant aussitôt figure lesquelles</li>
</ol>
<p>Some of the results are unsurprising, such as topic 7, which clearly derives from the many evening scenes at the Verdurins (<em>soir</em>, <em>chez</em>, <em>maison</em>) where Swann courted Odette among their coterie (<em>forcheville</em>, <em>cottard</em>), often becoming jealously heartbroken (<em>cœur</em>, <em>désir</em>) with wondering whether she was seeing other admirers on the sly (<em>demander</em>, <em>conaissait</em>, <em>amis</em>). Other topics reveal interesting patterns that fit with scenes across the entire narrative, such as number 10. It emphasizes the use and observation of the eyes (<em>yeux</em>, <em>vue</em>) in connection with the Duc and Duchesse de Guermantes, whose mysterious airs and glances are described in several Combray church passages, as well as their association with art and symbolism of France (<em>image</em>, <em>figure</em>). But what also emerges is the consistency of the preposition before (<em>devant</em>), emphasizing the narrator&#8217;s location not only in front of their paintings and of their glances, but also in front of a church in connection to a woman (<em>dame</em>), a recurrence that we can tease out by reading the database passages from the English translation.</p>
<p>Using a PHP script and MySQL database (graciously provided by <a href="https://dhs.stanford.edu/author/emeeks/" onclick="javascript:pageTracker._trackPageview('/outbound/article/dhs.stanford.edu');" target="_blank">Elijah Meeks</a>), we can extract the tokens, word counts, and their connections from the Mallet topic model files into a graph file that generates edges and nodes, allowing us to view the ten topics as a network model in Gephi.</p>
<p><img src="http://farm6.static.flickr.com/5180/5535063164_f57b07d05a_z.jpg" alt="" width="550" height="484" /></p>
<p>This entirely computer-generated model of associative networks in tome 1 of the <em>Recherche</em> is markedly different from the static model created by my particular reading of the church motif above, though it shares some consistencies and interesting disparities.</p>
<p>For instance, when we drill down and filter to look more closely at the terms that join the different topics, we see that the word for nothing (<em>rien</em>) is the one that most frequently connects topics 6 and 9, which respectively center on themes of beautiful bodily gestures in music and family/home relationships, while time (<em>temps</em>) joins topic 6 with 3, which is focused on positive terms for love of Gilberte.</p>
<p><img src="http://farm6.static.flickr.com/5140/5534484067_d9123f65c5_b.jpg" alt="" width="550" height="724" /></p>
<p>According to the statistical features of the text, then, the first two parts of <em>Du côté de chez Swann</em> associate the expression of romantic love primarily with time, while the memory of familial love is associated primarily with absence. This perhaps comes as no shock to most readers of Proust, but if we compare this model with a search for the term nothing in the church motif database, we receive a number of passages associated predominantly with romantic love. These two fields of data, then, suggest a reading of the church motif as concerned with concepts of absence in romantic love, somewhat against the grain of the rest of the novel. There is not enough space here to deal with the problematics of translation/tutor text comparisons or the relation of computation algorithms to critical interpretation. But it is clear that domain expertise is just as necessary with digital scholarship as it is in print, as shown by the (illuminating) disparities between a human-reading and machine-reading of the text.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=60</wfw:commentRss>
		</item>
		<item>
		<title>French Stop Words / Mots d&#8217;arrêt français</title>
		<link>http://www.proustarchive.org/?p=59</link>
		<comments>http://www.proustarchive.org/?p=59#comments</comments>
		<pubDate>Wed, 09 Mar 2011 19:40:15 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[data mining]]></category>

		<category><![CDATA[digital humanities]]></category>

		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=59</guid>
		<description><![CDATA[I&#8217;ve searched for French stop word lists for use in text mining and synthesized my findings here. It may not be definitive, but could be useful for those looking for stop lists.
J&#8217;ai effectué une recherche pour des listes de mots d&#8217;arrêt français pour utilisation dans l&#8217;exploration de texte et synthétisé mes conclusions ici. Il ne [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve searched for French stop word lists for use in text mining and synthesized my findings here. It may not be definitive, but could be useful for those looking for stop lists.</p>
<p>J&#8217;ai effectué une recherche pour des listes de mots d&#8217;arrêt français pour utilisation dans l&#8217;exploration de texte et synthétisé mes conclusions ici. Il ne peut être définitive, mais pourrait être utile pour ceux qui recherchent des listes d&#8217;arrêt.</p>
<p>à<br />
ah<br />
ai<br />
aie<br />
aient<br />
aies<br />
ait<br />
alors<br />
as<br />
au<br />
aucuns<br />
aurai<br />
auraient<br />
aurais<br />
aurait<br />
auras<br />
auriez<br />
aurions<br />
aussi<br />
autre<br />
aux<br />
avaient<br />
avais<br />
avait<br />
avant<br />
avec<br />
avez<br />
aviez<br />
avions<br />
avoir<br />
avons<br />
ayant<br />
ayez<br />
ayons<br />
bon<br />
car<br />
ce<br />
ceci<br />
cela<br />
celà<br />
celles<br />
celui<br />
ces<br />
cet<br />
cette<br />
ceux<br />
chaque<br />
ci<br />
comme<br />
comment<br />
dans<br />
de<br />
des<br />
du<br />
dedans<br />
dehors<br />
depuis<br />
deux<br />
devrait<br />
doit<br />
donc<br />
dont<br />
dos<br />
droite<br />
début<br />
elle<br />
elles<br />
en<br />
encore<br />
es<br />
essai<br />
est<br />
et<br />
eu<br />
eue<br />
eues<br />
eusse<br />
eusses<br />
eûmes<br />
eurent<br />
eus<br />
eussions<br />
eussiez<br />
eut<br />
eût<br />
eûtes<br />
eux<br />
fait<br />
faites<br />
fois<br />
font<br />
force<br />
fûmes<br />
furent<br />
fus<br />
fusse<br />
fussent<br />
fusses<br />
fussions<br />
fussiez<br />
fut<br />
fût<br />
fûtes<br />
haut<br />
hors<br />
ici<br />
il<br />
ils<br />
je<br />
jusqu’à<br />
juste<br />
la<br />
laquelle<br />
lequelle<br />
le<br />
les<br />
leur<br />
leurs<br />
lui<br />
là<br />
ma<br />
maintenant<br />
mais<br />
me<br />
mes<br />
mine<br />
moi<br />
moins<br />
mon<br />
mot<br />
même<br />
ne<br />
ni<br />
nommés<br />
non<br />
nos<br />
notre<br />
nous<br />
nouveaux<br />
on<br />
ont<br />
ou<br />
où<br />
par<br />
parce<br />
parole<br />
pas<br />
personnes<br />
peut<br />
peut-être<br />
peu<br />
pièce<br />
plupart<br />
plus<br />
pour<br />
pourquoi<br />
quand<br />
qu<br />
que<br />
quel<br />
quelle<br />
quelles<br />
quelque<br />
quels<br />
qui<br />
sa<br />
sans<br />
se<br />
sera<br />
serai<br />
seraient<br />
serais<br />
serait<br />
seras<br />
serez<br />
seriez<br />
serions<br />
serons<br />
seront<br />
ses<br />
seulement<br />
si<br />
sien<br />
soi<br />
soient<br />
sois<br />
soit<br />
somme<br />
sommes<br />
son<br />
sont<br />
sous<br />
soyez<br />
soyons<br />
suis<br />
sujet<br />
sur<br />
ta<br />
tandis<br />
te<br />
tellement<br />
tels<br />
tes<br />
toi<br />
ton<br />
tous<br />
tout<br />
trop<br />
très<br />
tu<br />
un<br />
une<br />
valeur<br />
voie<br />
voient<br />
vont<br />
vos<br />
votre<br />
vous<br />
vu<br />
ça<br />
étaient<br />
était<br />
étant<br />
état<br />
étions<br />
été<br />
étée<br />
étées<br />
étés<br />
êtes<br />
être</p>
<p>mme<br />
mlle</p>
<p>a<br />
b<br />
c<br />
d<br />
e<br />
f<br />
g<br />
h<br />
i<br />
j<br />
k<br />
l<br />
m<br />
n<br />
o<br />
p<br />
q<br />
r<br />
s<br />
t<br />
u<br />
v<br />
w<br />
x<br />
y<br />
z<br />
1<br />
2<br />
3<br />
4<br />
5<br />
6<br />
7<br />
8<br />
9<br />
10<br />
~<br />
`<br />
!<br />
@<br />
#<br />
$<br />
%<br />
^<br />
&amp;<br />
*<br />
(<br />
)<br />
_<br />
-<br />
=<br />
+<br />
{<br />
}<br />
[<br />
]<br />
&#8220;<br />
&#8216;<br />
;<br />
:<br />
,<br />
.<br />
/<br />
&lt;<br />
&gt;<br />
?<br />
«<br />
»</p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=59</wfw:commentRss>
		</item>
		<item>
		<title>Gorgeous Network Visualizations with Gephi</title>
		<link>http://www.proustarchive.org/?p=58</link>
		<comments>http://www.proustarchive.org/?p=58#comments</comments>
		<pubDate>Fri, 03 Dec 2010 04:11:17 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=58</guid>
		<description><![CDATA[I just started playing around with Gephi, which produces really amazing interactive graphs for intuitive data analysis. Thought I would post one here. Not sure how to read or use it yet, but it shows the entire church motif in the Recherche.

]]></description>
			<content:encoded><![CDATA[<p>I just started playing around with <a href="http://gephi.org/" onclick="javascript:pageTracker._trackPageview('/outbound/article/gephi.org');" target="_blank">Gephi</a>, which produces really amazing interactive graphs for intuitive data analysis. Thought I would post one here. Not sure how to read or use it yet, but it shows the entire church motif in the <em>Recherche</em>.</p>
<p><a href="http://farm6.static.flickr.com/5165/5228184976_9569a53d14_b.jpg" onclick="javascript:pageTracker._trackPageview('/outbound/article/farm6.static.flickr.com');" target="_blank"><img src="http://farm6.static.flickr.com/5165/5228184976_9569a53d14.jpg" alt="" width="500" height="454" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=58</wfw:commentRss>
		</item>
		<item>
		<title>A Little Close Reading with Network Analysis Software</title>
		<link>http://www.proustarchive.org/?p=57</link>
		<comments>http://www.proustarchive.org/?p=57#comments</comments>
		<pubDate>Tue, 30 Nov 2010 17:12:04 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[archives]]></category>

		<category><![CDATA[churches / cathedrals]]></category>

		<category><![CDATA[digital]]></category>

		<category><![CDATA[digital humanities]]></category>

		<category><![CDATA[narrative]]></category>

		<category><![CDATA[networked media]]></category>

		<category><![CDATA[novel]]></category>

		<category><![CDATA[proust]]></category>

		<category><![CDATA[textual criticism]]></category>

		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=57</guid>
		<description><![CDATA[I thought I would do some closer &#8220;distant-reading&#8221; of the Recherche. When using ORA to look at a metanetwork, the visualization can be manipulated in real time to highlight the links among nodes and their related concepts or passages.  What this means for the study of Proust is that we can think of the novel [...]]]></description>
			<content:encoded><![CDATA[<p>I thought I would do some closer &#8220;distant-reading&#8221; of the <em>Recherche</em>. When using ORA to look at a metanetwork, the visualization can be manipulated in real time to highlight the links among nodes and their related concepts or passages.  What this means for the study of Proust is that we can think of the novel (and the novel genre) as a network of nodes consisting of concepts, characters, narrative elements, and any other unit of meaning that might enhance exploration of its text.</p>
<p>For instance, isolating the network constellated around the note “Finding common element of formative impressions” shows that the narrator&#8217;s activity of reflecting upon his formative impressions is primarily connected to the associations of Memory, Music, and Literature.<br />
<a href="http://farm5.static.flickr.com/4114/5220817407_2ccd8f1afd_b.jpg" onclick="javascript:pageTracker._trackPageview('/outbound/article/farm5.static.flickr.com');" target="_blank"><img src="http://farm5.static.flickr.com/4114/5220817407_2ccd8f1afd.jpg" alt="" /></a><br />
The Memory association in turn connects with “Memory at Grandmother&#8217;s deathbed”; Music connects with “End of Mass at Combray church, return home” and “Vinteul&#8217;s sonata played at Swann&#8217;s”; and Literature connects with “Charlus berating Marcel,” “Epiphany at Guermantes&#8217; Party,” and “Describing morning routine back in Paris, after second Balbec visit.” When the nodes above are moused over to show the passages they represent, we see that most of them, from various sequences involving musical performance or literary discussion from all over the novel, refer to the twin steeples at Martinville that formed the subject of the narrator&#8217;s first piece of writing as a youth (I.253-257). Thus, the steeples form an orientation point for that part of the narrator&#8217;s writerly vocation that pertains to analysis of impressions, filtered primarily through memories of music and literature. These connections are not apparent by searching the database at the Ecclesiastical Proust Archive because it does not provide the simultaneous view of layered networks afforded by ORA. It seems on the surface, then, that this particular network within the Recherche forms a theory of impressionism based on the structural commonalities between music and literature.</p>
<p>The network for the Time association similarly connects various types of recollection to provide insight into the narrator&#8217;s artistic development.<br />
<a href="http://farm5.static.flickr.com/4139/5221415688_d9ba780cd6_b.jpg" onclick="javascript:pageTracker._trackPageview('/outbound/article/farm5.static.flickr.com');" target="_blank"><img src="http://farm5.static.flickr.com/4139/5221415688_d9ba780cd6.jpg" alt="" width="500" height="261" /></a><br />
We find Time at the center, ringed by “Contemplation sparked by conversation with M. de Cambremer, at Guermantes party,” “Imagining Florence and Venice (before visit),” “Contemplating experience of Vinteuil&#8217;s sonata while jealous of Mlle Vinteuil and Albertine,” “Contemplating women and past,” “Observations at Guermantes party,” and “First visit to Balbec.” The last in turn connects with Narthex and Carqueville. In other words, the primary function of Time as a backwards-looking concept is associated with jealousy over women, while the forward-looking passages imagine the reddish domes of Florence and the frescos of Venice. This suggests a deepening of the structure that became apparent in the database searches, where the thought of meeting a future lover in early passages, though not explicitly concerned with the nature of time, took place on the porch of a gothic cathedral. These nodes presented by ORA show that the church passages that consciously deal with the nature of time happen after the narrator as had experience being in love with women. And correspondingly, the architectural element of this ring is the narthex, which is the entrance area just indoors or on the threshold to the porch. The narthex was not considered part of the church proper, but was placed close enough so that those not worthy of entry, such as the unbaptized or unconfessed, could still receive instruction from services. Hence, the experience of love has brought the narrator past the porch but, because he is lost through jealousy, he still remains an outsider.</p>
<p>The Truth network presents a very clear view of the novel&#8217;s main thematic chains and character developments.<br />
<a href="http://farm5.static.flickr.com/4088/5220817467_35eea0d6aa_b.jpg" onclick="javascript:pageTracker._trackPageview('/outbound/article/farm5.static.flickr.com');" target="_blank"><img src="http://farm5.static.flickr.com/4088/5220817467_35eea0d6aa.jpg" alt="" width="500" height="320" /></a><br />
With Truth at the center of the middle network, the first ring comprises “Riding in Dr. Percepied&#8217;s carriage” (the moment at which he observed in motion the twin steeples of Martinville), “Reflections on getting the truth about Albertine from Andrée” (in which he had final confirmation of his lack of knowledge about Albertine&#8217;s lesbianism, the root of his obsessive jealousy), “Reading Bergotte” (the writer who most influenced his literary sensibility, and who figures so prominently in his appreciation for churches), and “Reflection on Charlus&#8217; perversion” (the unmasking of homosexuality as a major recurring element of the novel&#8217;s concern with epistemology). What we also see in the picture above are two micro networks that are not directly connected, yet were placed close to the Truth network because of their conceptual affinity. If we take all three networks into consideration, the second ring around the Truth association comprises Motion, Laws, the Archaic, Beauty, and Knowledge, which further connect with three passages about household habits and the Great War. Taken together, Truth in Proust&#8217;s novel can ultimately be understood as a rather stable essence based on the epistemic laws of motion and observation, as well as the aesthetic laws of beauty as evident in old objects. These, too, are a function of Time. While this visualization might not provide much insight that is new in Proust studies, the interface at least allows the reader instantly to access the passages that contribute to a given part of the network.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=57</wfw:commentRss>
		</item>
		<item>
		<title>Tome I Word Cloud</title>
		<link>http://www.proustarchive.org/?p=56</link>
		<comments>http://www.proustarchive.org/?p=56#comments</comments>
		<pubDate>Wed, 17 Nov 2010 04:47:26 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=56</guid>
		<description><![CDATA[Generated by Wordle. Wordle recognized the French and stripped out the common words, but many of them, like comme, quand, et, si, etc., still crept in. Interesting, though, that in this visualization of absolute word frequency, the words Swann and Odette are weighted as heavy or heavier than many prepositions and conjunctions. Given that this [...]]]></description>
			<content:encoded><![CDATA[<p>Generated by Wordle. Wordle recognized the French and stripped out the common words, but many of them, like comme, quand, et, si, etc., still crept in. Interesting, though, that in this visualization of absolute word frequency, the words Swann and Odette are weighted as heavy or heavier than many prepositions and conjunctions. Given that this tome covers the &#8220;Combray&#8221; and &#8220;Swann in Love&#8221; sections, it accounts for the narrator&#8217;s obsession with the pair in his early childhood, and likewise Swann&#8217;s obsession with Odette in the years before the narrator&#8217;s birth. I would have expected words like église or fenêtre or mère to weigh heavier. Interesting too that the other meaningful words that make it into this cloud are Verdurin (emphasizing the salon and the coterie culture), yeux (where the narrator reads the souls of others), Françoise (who is mentioned &#8212; and valued? &#8212; more than his mother), and tante (Léonie, the relative in residence at Combray).</p>
<p><img src="http://farm2.static.flickr.com/1022/5183856732_e2ff0dac47.jpg" alt="" width="500" height="318" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=56</wfw:commentRss>
		</item>
		<item>
		<title>Upcoming Lecture &#8212; Digital Methods for Literary Criticism: Proust, Illustration, and the Archive</title>
		<link>http://www.proustarchive.org/?p=55</link>
		<comments>http://www.proustarchive.org/?p=55#comments</comments>
		<pubDate>Sun, 07 Nov 2010 03:40:08 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[events]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=55</guid>
		<description><![CDATA[I&#8217;m giving a lecture on some of my recent digital research on Proust. The talk will cover methods in text annotation and visualization, with a view toward their theoretical implications for literary criticism. Along the way it will describe some of my experiments with text mining and social network analysis for generating and representing associative [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m giving a lecture on some of my recent digital research on Proust. The talk will cover methods in text annotation and visualization, with a view toward their theoretical implications for literary criticism. Along the way it will describe some of my experiments with text mining and social network analysis for generating and representing associative paths.</p>
<ul>
<li>Wednesday November 17, 5:00-7:00 pm</li>
<li>Lucy Ellis Lounge, 1st floor Foreign Language Building</li>
<li>University of Illinois, Urbana-Champaign</li>
</ul>
<p><img class="alignnone" src="http://farm2.static.flickr.com/1088/5153296762_1fe3ee8c73_z.jpg" alt="Proust Flyer" width="479" height="640" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=55</wfw:commentRss>
		</item>
		<item>
		<title>Proust LOLCat</title>
		<link>http://www.proustarchive.org/?p=54</link>
		<comments>http://www.proustarchive.org/?p=54#comments</comments>
		<pubDate>Tue, 26 Oct 2010 03:15:12 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[churches / cathedrals]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=54</guid>
		<description><![CDATA[Just made a LOLCat to explain Proust and churches.

]]></description>
			<content:encoded><![CDATA[<p>Just made a LOLCat to explain Proust and churches.</p>
<p><img class="alignleft" src="http://images.cheezburger.com/completestore/2010/10/25/9f6f8353-3cc7-49de-b869-f02f5bdae0d2.jpg" alt="" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=54</wfw:commentRss>
		</item>
		<item>
		<title>Social Network Analysis of the Recherche (and The Novel)</title>
		<link>http://www.proustarchive.org/?p=53</link>
		<comments>http://www.proustarchive.org/?p=53#comments</comments>
		<pubDate>Thu, 21 Oct 2010 20:22:14 +0000</pubDate>
		<dc:creator>Jeff Drouin</dc:creator>
		
		<category><![CDATA[archives]]></category>

		<category><![CDATA[books]]></category>

		<category><![CDATA[churches / cathedrals]]></category>

		<category><![CDATA[collaboration]]></category>

		<category><![CDATA[community]]></category>

		<category><![CDATA[developing the archive]]></category>

		<category><![CDATA[digital]]></category>

		<category><![CDATA[digital humanities]]></category>

		<category><![CDATA[disciplinarity]]></category>

		<category><![CDATA[editing]]></category>

		<category><![CDATA[literary scholarship]]></category>

		<category><![CDATA[multimedia]]></category>

		<category><![CDATA[narrative]]></category>

		<category><![CDATA[networked media]]></category>

		<category><![CDATA[novel]]></category>

		<category><![CDATA[reading]]></category>

		<category><![CDATA[social software]]></category>

		<category><![CDATA[taxonomy]]></category>

		<category><![CDATA[textual criticism]]></category>

		<category><![CDATA[textual theory]]></category>

		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://www.proustarchive.org/?p=53</guid>
		<description><![CDATA[We can run Proust through social network analysis (SNA) software in order to generate network models among its information nodes. The visualization to the left shows the association (basically, an uncategorized tag I use to label church-related passages) of Venice as it is networked among passages (referenced by their ID numbers and pagination codes) and [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://a.yfrog.com/img535/2847/ltq.png" onclick="javascript:pageTracker._trackPageview('/outbound/article/a.yfrog.com');" target="new"><img style="margin-right: 8px" src="http://a.yfrog.com/img535/2847/ltq.png?w=300" alt="Venice" width="300" height="187" align="left" /></a>We can run Proust through social network analysis (SNA) software in order to generate network models among its information nodes. The visualization to the left shows the association (basically, an uncategorized tag I use to label church-related passages) of Venice as it is networked among passages (referenced by their ID numbers and pagination codes) and notes on narrative context. When manipulated in real time, the visualization highlights the links to other nodes and their related concepts or passages.  What this means for the study of Proust is that we can think of the novel as a network of nodes consisting of concepts, characters, narrative elements, and any other unit of meaning that might enhance exploration of its text.</p>
<p><a href="http://a.yfrog.com/img220/6246/43v.png" onclick="javascript:pageTracker._trackPageview('/outbound/article/a.yfrog.com');" target="new"><img style="margin-left: 8px" src="http://a.yfrog.com/img220/6246/43v.png?w=300" alt="Narrative Contexts" width="300" height="187" align="right" /></a>We can even include various texts and external information for a genetic or contextual study of the novel. In a hypothetical archive containing digitized <em>avant-texte</em> and published variants of the <em>Recherche</em>, we could potentially see &#8212; in strikingly visual terms &#8212; the correspondence of, say, the impact of WWI on the development of different sections. This could provide new insights into Proust&#8217;s writing process as this work continually ballooned and changed during and after the war. What kinds of associations got the most development during and immediately after the war, and in what points of the narrative did they occur? Which churches received the most attention and where are they located in both fictional and real space? There is a wealth of traditional scholarship addressing genetic and contextual issues like these. However, SNA presents an opportunity to view all of the information nodes simultaneously, a much more powerful (and accurate) tool for the study of a book than other print books are. In that way, we would see and move around in the <em>Recherche</em> as a writerly text that responds to its own inner needs in reference to the war.</p>
<p>Tools like SNA also present new pastures for narratology. If the text is marked up appropriately, all instances of a particular narrative device or structure could be instantly recalled by a researcher and viewed in relation to any parameters desired. It would be an even more comprehensive supplement than Barthes and Genette.</p>
<p>All of this goes to say that a more rigorous taxonomy of the <em>Recherche</em> would be necessary for a meaningful SNA application. At present, the interpretive apparatus of the Ecclesiastical Proust Archive consists solely of associations, which are uncategorized tags denoting concepts, themes, important details, architectural elements of the churches described, and so on. It would be far more meaningful to tag <em>separately</em> the characters, churches, architectural elements, themes, concepts, and plot elements that make up the rich density of this novel, as well as the images and other media added here to illustrate it, so that they become individual nodes within the information network. Then a far more rigorous and powerful visualization of the novel would be possible, and new discoveries will almost certainly be made.</p>
<p>But so far, these notions pertain only to my particular study of the church motif. A far richer application would be made if we took an entire electronic text (or, better, all of the variants and translations) and allowed researchers to mark them up and add media by way of illustration. In that way, the Proust archive would become a collaborative, electronic research and editing environment that takes shape from individuals&#8217; own scholarly pursuits</p>
<p><a href="http://a.yfrog.com/img823/738/pxx.png" onclick="javascript:pageTracker._trackPageview('/outbound/article/a.yfrog.com');" target="new"><img style="margin-left: 8px" src="http://a.yfrog.com/img823/738/pxx.png?w=300" alt="Associations" width="300" height="187" align="right" /></a><a href="http://en.wikipedia.org/wiki/Social_network" onclick="javascript:pageTracker._trackPageview('/outbound/article/en.wikipedia.org');" target="_blank">Social network analysis</a> (SNA) software combines a variety of methods commonly used in digital humanities research, such as text mining, visualization, and modeling. SNA software can pour over the data and metadata in the archive&#8217;s XML files and generate a network of nodes. It could be trained to recognize and normalize names, or even pseudonyms, and the metadata, provided by readers, would tell it whether a given passage contained the idea of Venice, or the subject/object distinction, or jealousy (or all three). If it had a qualitative analysis component it could even recognize concepts. And of course there would need to be the capability for scholars to add and tag information about the documents in the archive. This is a daunting task, but eminently possible with the aid of text and data mining software.</p>
<p>There are pitfalls, of course. The accuracy of any analytic tool depends on the quality of the data it operates upon. We must always be aware that tools like these, powerful and impressive though they are, always represent a <em>state</em> of the information realm. This is no different from traditional, print-based scholarship, but it bears consideration given the sometimes exaggerated hype of digital humanities at the time of this writing.</p>
<p>So, now that AccessTEI has provided us an XML file with structural TEI markup, I&#8217;ll be looking for ways to mine it with text analysis and SNA software. Stay tuned for more updates.</p>
<p>As an aside, SNA has tremendous possibilities for the study of modernist magazine culture, which is an <em>actual</em>, publishing network. See my post from earlier today at the <a href="http://magmods.wordpress.com/2010/10/21/social-network-analysis-of-magazine-culture/" onclick="javascript:pageTracker._trackPageview('/outbound/article/magmods.wordpress.com');" target="_blank">Magazine Modernisms blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.proustarchive.org/?feed=rss2&amp;p=53</wfw:commentRss>
		</item>
	</channel>
</rss>

