<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Mathemagenic &#187; blog research tools</title>
	<atom:link href="http://blog.mathemagenic.com/tags/blog-research-tools/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.mathemagenic.com</link>
	<description>Lilia Efimova on personal productivity in knowledge-intensive environments, weblog research, knowledge management, PhD, serendipity and lack of work-life balance...</description>
	<lastBuildDate>Fri, 30 Dec 2011 22:25:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.5</generator>
		<item>
		<title>Comparing weblog text to the PhD dissertation via tagclouds</title>
		<link>http://blog.mathemagenic.com/2008/07/07/comparing-weblog-text-to-phd-dissertation/</link>
		<comments>http://blog.mathemagenic.com/2008/07/07/comparing-weblog-text-to-phd-dissertation/#comments</comments>
		<pubDate>Mon, 07 Jul 2008 13:03:10 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 3. Ideas]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[blog writing]]></category>
		<category><![CDATA[citedCh3]]></category>
		<category><![CDATA[writing]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/?p=1516</guid>
		<description><![CDATA[About a year ago I looked for Tools to find similarity between two texts (weblog and papers) &#8211; I wanted to find a relatively objective way to judge how much of my weblog writing ends up in the dissertation. Between other things I experimented with generating and comparing tagclouds from texts that were supposed to [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>About a year ago I looked for <a title="Permanent Link: Tools to find similarity between two texts (weblog and papers)" rel="bookmark" href="../../2007/06/12/tools-to-find-similarity-between-two-texts-weblog-and-papers/">Tools to find similarity between two texts (weblog and papers)</a> &#8211; I wanted to find a relatively objective way to judge how much of my weblog writing ends up in the dissertation.</p>
<p>Between other things I experimented with generating and comparing tagclouds from texts that were supposed to correspond to each other. I tried several tools, but ended up with <a href="http://www.tagcrowd.com">tagCrowd</a> since it allowed using generic and custom-made lists of stop words.</p>
<p>As an experiment I used text of five dissertation chapters (draft versions as of April 17, 2008) and the text of blog posts coded as corresponding to those chapters to generate a visualisation of most frequent words in each case. After removing stop words (general English plus those from my own list that I was stupid enough not to save) 65 most frequent words are visualised.</p>
<p>For example, two tagclouds below are those from the <a title="Category Chapter 6. Microsoft" href="http://blog.mathemagenic.com/categories/phd/chapter6/">blogposts related to the Microsoft study</a> and the draft chapter with the results of it.<br />
<a title="Tagcrowd: blogposts related to chapter 6 (Microsoft) by Lilia Efimova, on Flickr" href="http://www.flickr.com/photos/mathemagenic/2430350495/"><img src="http://farm3.static.flickr.com/2419/2430350495_9d0e953150_m.jpg" alt="Tagcrowd: blogposts related to chapter 6 (Microsoft)" width="240" height="180" /></a><a title="Tagcrowd: current draft chapter 6 (Microsoft) by Lilia Efimova, on Flickr" href="http://www.flickr.com/photos/mathemagenic/2430350539/"><img src="http://farm3.static.flickr.com/2188/2430350539_7b78d02143_m.jpg" alt="Tagcrowd: current draft chapter 6 (Microsoft)" width="240" height="193" /></a></p>
<p>In total I had 5 pairs of visualisations. I then mixed them and asked five people familiar with my research (supervisors and collaborators) and eight students (taking a class with <a href="http://anjo.blogs.com/">Anjo</a>) to find matching pairs. The results are below.</p>
<table style="text-align: left;" border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="226" valign="top"></td>
<td style="text-align: center;" width="76" valign="top">Total pairs</td>
<td style="text-align: center;" width="104" valign="top">Correctly matched pairs</td>
<td style="text-align: center;" width="113" valign="top">Correctly matched pairs, %</td>
</tr>
<tr>
<td width="226" valign="top">Chapter 1. Introduction</td>
<td style="text-align: center;" width="76" valign="top">13</td>
<td style="text-align: center;" width="104" valign="top">10</td>
<td style="text-align: center;" width="113" valign="top">77%</td>
</tr>
<tr>
<td width="226" valign="top">Chapter 2. Methodology</td>
<td style="text-align: center;" width="76" valign="top">13</td>
<td style="text-align: center;" width="104" valign="top">11</td>
<td style="text-align: center;" width="113" valign="top">85%</td>
</tr>
<tr>
<td width="226" valign="top">Chapter 3. Ideas</td>
<td style="text-align: center;" width="76" valign="top">13</td>
<td style="text-align: center;" width="104" valign="top">6</td>
<td style="text-align: center;" width="113" valign="top">46%</td>
</tr>
<tr>
<td width="226" valign="top">Chapter 4. Conversations</td>
<td style="text-align: center;" width="76" valign="top">13</td>
<td style="text-align: center;" width="104" valign="top">10</td>
<td style="text-align: center;" width="113" valign="top">77%</td>
</tr>
<tr>
<td width="226" valign="top">Chapter 5. Microsoft</td>
<td style="text-align: center;" width="76" valign="top">13</td>
<td style="text-align: center;" width="104" valign="top">9</td>
<td style="text-align: center;" width="113" valign="top">69%</td>
</tr>
<tr>
<td width="226" valign="top"><strong>Total</strong></td>
<td style="text-align: center;" width="76" valign="top"><strong>65</strong></td>
<td style="text-align: center;" width="104" valign="top"><strong>46</strong></td>
<td style="text-align: center;" width="113" valign="top"><strong>71%</strong></td>
</tr>
<tr>
<td style="text-align: right;" width="226" valign="top">by people familiar with the research</td>
<td style="text-align: center;" width="76" valign="top">25</td>
<td style="text-align: center;" width="104" valign="top">20</td>
<td style="text-align: center;" width="113" valign="top">80%</td>
</tr>
<tr>
<td style="text-align: right;" width="226" valign="top">by people not familiar with the research</td>
<td style="text-align: center;" width="76" valign="top">40</td>
<td style="text-align: center;" width="104" valign="top">26</td>
<td style="text-align: center;" width="113" valign="top">65%</td>
</tr>
</tbody>
</table>
<p>Some comments:</p>
<ul>
<li>I guess there is a connection between PhD chapters and blogposts :)</li>
<li>The high score for the methodology chapter is explained by its qualitative difference from the rest of the dissertation.</li>
<li>The low score for this chapter is explained by the fact that the coding of weblog entries in relation to chapters was done prior to writing it. As a results it included many &#8220;might be relevant&#8221; posts, while for other chapters the focus was more clear. In addition, the draft version of the chapter used to generate the visualisation was the first draft, while in other cases those were revised several times.</li>
</ul>
<p><a href="http://www.flickr.com/photos/mathemagenic/2441928422/" title="Tagcrowds: current state of the dissertation by Lilia Efimova, on Flickr"><img src="http://farm4.static.flickr.com/3117/2441928422_0bb10fb4e1_m.jpg" width="240" align=right height="180" alt="Tagcrowds: current state of the dissertation" /></a>It was nice to see that although many of the visualisations looked similar (with <em>blogging</em> and <em>weblog</em> being big ;) it was actually possible to match the pairs. But the nicest thing was simply making all those pictures, laying them on the floor and thinking that I actually had some version of 5 chapters out of the 7 :)</p>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/blog-writing/" title="blog writing" rel="tag">blog writing</a>, <a href="http://blog.mathemagenic.com/tags/citedch3/" title="citedCh3" rel="tag">citedCh3</a>, <a href="http://blog.mathemagenic.com/tags/writing/" title="writing" rel="tag">writing</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2005/08/20/studying-weblogs-at-microsoft-connecting-the-dots/" title="Studying weblogs at Microsoft: connecting the dots (August 20, 2005)">Studying weblogs at Microsoft: connecting the dots</a> </li>
	<li><a href="http://blog.mathemagenic.com/2004/03/04/weblog-conversations-are-flows-in-a-river-delta/" title="Weblog conversations are flows in a river delta (March 4, 2004)">Weblog conversations are flows in a river delta</a> </li>
	<li><a href="http://blog.mathemagenic.com/2007/06/25/affectionate-writing-reduces-cholesterol/" title="Affectionate writing reduces cholesterol (June 25, 2007)">Affectionate writing reduces cholesterol</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2008/07/07/comparing-weblog-text-to-phd-dissertation/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Getting more by reading less blogs: some thoughts on &#8216;Cost-Effective Outbreak Detection in Networks&#8217;</title>
		<link>http://blog.mathemagenic.com/2007/11/14/getting-more-by-reading-less-blogs/</link>
		<comments>http://blog.mathemagenic.com/2007/11/14/getting-more-by-reading-less-blogs/#comments</comments>
		<pubDate>Wed, 14 Nov 2007 13:43:46 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 3. Ideas]]></category>
		<category><![CDATA[Chapter 4. Conversations]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[Iceberg: selected]]></category>
		<category><![CDATA[Meta-blogging]]></category>
		<category><![CDATA[Weblog research]]></category>
		<category><![CDATA[blog reading]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[BlogTrace]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2007/11/14.html#a1953</guid>
		<description><![CDATA[Matthew Hurst on the most important blogs for efficient readers: A group of researchers at CMU have been considering a notion of blog importance based on how likely a set of blogs is to ensure that you will be informed of topics bursting in the blogosphere. By analogy, they consider a graph of water pipelines. [...]]]></description>
			<content:encoded><![CDATA[<p></p><p><a href="http://datamining.typepad.com/data_mining/2007/10/the-most-import.html">Matthew Hurst on the most important blogs for efficient readers</a>:</p>
<blockquote class="cite"><p>A group of researchers at CMU have been considering a notion of blog importance based on how likely a set of blogs is to ensure that you will be informed of topics bursting in the blogosphere. By analogy, they consider a graph of water pipelines. Their paper &#8211; <a href="http://www.cs.cmu.edu/%7Ejure/pubs/detect-kdd07.pdf">Cost-Effective Outbreak Detection in Networks</a> Leskovec, Krause, Guestrin, Faloutsos, VanBriesen, Glance &#8211; poses the problem:</p>
<blockquote><p>Given a water distribution network, where should we place sensors to quickly detect contaminants? Or, which blogs should we read to avoid missing important stories? These seemingly different problems share common structure: Outbreak detection can be modeled as selecting nodes (sensor locations, blogs) in a network, in order to detect the spreading of a virus or information as quickly as possible.</p></blockquote>
<p>As a result of this work, the authors have published some blog lists which answer a fundamentally important question in terms of weblog reading habits: Which weblogs should I read to be most up to date? The lists answering this question &#8211; generated by the approach described in their paper &#8211; come in a number of varieties to be found on <a href="http://www.cs.cmu.edu/%7Ejure/blogs/">the project&#8217;s page</a>.</p></blockquote>
<p dir="ltr">I scanned (skipped most of the math :) through the <a href="http://www.cs.cmu.edu/%7Ejure/pubs/detect-techRept.pdf">extended version of the paper</a> and this is something I would love to see applied to niche blogging networks. For example, starting from a subset of weblogs that mention topic X or, better, those that participate in a discussion (cascade) that mentiones topic X.</p>
<p dir="ltr">A few points relevant from the practical perspective &#8211; having a tool that helps a blogreader to make a selection of blogs to read (my expectations in that respect are pretty high given that <a href="http://www.linkedin.com/in/nglance">Natalie Glance</a> is working for Google now :)</p>
<p dir="ltr">1. <strong>&#8220;Costs&#8221; of reading</strong>. The authors played with optimising the number of blogs and number of posts one reads. Assuming that reading less blog posts is more cost-effective, the algorithm shows that &#8220;the popular blogs might not be the most effective way to catch relevant information cascades&#8221; (p.23). Instead, it makes more sense to read &#8220;good summarizer blogs that may not be very popular, but which, by using few posts, catch most of the important stories propagating over the blogosphere&#8221; (p.15).</p>
<p dir="ltr">2. <strong>Predicting the future</strong>. From a reader perspective one would like to have a recommendation of blogs that will cover most interesting stories in the future. From what I understood the algorithm does not work that well for making those predictions. The authors optimised the performance by including only big blogs (= at least one post per day), but I wonder if there are some other alternatives.</p>
<p>Anyway, I guess I should go back to my PhD writing and wait patiently till people who read the paper without skipping the math do something with it. So far I&#8217;m happy that the paper promises lots of interesting developments and that it also makes me feeling less guilty with our alternative approach to vaccination by suggesting that &#8220;uniform ummunization strategy corresponds to randomly placing sensors in a water network&#8221; (p.22), which in not optimal :)))</p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2007/11/14.html#a1953">http://blog.mathemagenic.com/2007/11/14.html#a1953</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1953&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2007%2F11%2F14.html%23a1953">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-reading/" title="blog reading" rel="tag">blog reading</a>, <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/blogtrace/" title="BlogTrace" rel="tag">BlogTrace</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2005/01/29/blogtrace/" title="BlogTrace (January 29, 2005)">BlogTrace</a> </li>
	<li><a href="http://blog.mathemagenic.com/2003/11/20/phd-dissertation-on-implementing-blogs-in-manufacturing-environment/" title="PhD dissertation on implementing blogs in manufacturing environment (November 20, 2003)">PhD dissertation on implementing blogs in manufacturing environment</a> </li>
	<li><a href="http://blog.mathemagenic.com/2004/01/25/orkut-and-a-bit-of-thinking-on-on-line-networking-tools/" title="Orkut and a bit of thinking on on-line networking tools (January 25, 2004)">Orkut and a bit of thinking on on-line networking tools</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2007/11/14/getting-more-by-reading-less-blogs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tools to find similarity between two texts (weblog and papers)</title>
		<link>http://blog.mathemagenic.com/2007/06/12/tools-to-find-similarity-between-two-texts-weblog-and-papers/</link>
		<comments>http://blog.mathemagenic.com/2007/06/12/tools-to-find-similarity-between-two-texts-weblog-and-papers/#comments</comments>
		<pubDate>Tue, 12 Jun 2007 16:00:20 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 3. Ideas]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[Iceberg: selected]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[blog writing]]></category>
		<category><![CDATA[knowledge mapping]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2007/06/12.html#a1909</guid>
		<description><![CDATA[I&#8217;m playing with an idea of comparing (parts of) my weblog with some of my published papers (and with the dissertation as a whole when I&#8217;m done). So far I&#8217;m interested in two things: how much of the text is reused how conceptually close two texts (weblog and a paper) are Thought of a couple [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>I&#8217;m playing with an idea of comparing (parts of) my weblog with some of my published papers (and with the dissertation as a whole <a href="http://blog.mathemagenic.com/2004/02/13.html#a1082">when I&#8217;m done</a>). So far I&#8217;m interested in two things:</p>
<ul>
<li>how much of the text is reused</li>
<li>how conceptually close two texts (weblog and a paper) are</li>
</ul>
<p>Thought of a couple of ways to do so:</p>
<ul>
<li>One way would be to use all kinds of weblog analysis tool from <a href="http://anjo.blogs.com/">Anjo</a>. One of the difficulties there would be to figure out how to find similarities between weblog text, which is relatively self-contained microcontent pieces, and linear &#8220;build upon previousely said&#8221; academic papers.</li>
<li>Another option would be to use some <a href="http://www.plagiarism.org/">plagiarism</a> detection tools. Only wonder if you can configure those to compare target paper with a specific weblog, rather than with &#8220;everything published&#8221;.</li>
</ul>
<p>Any ideas?</p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2007/06/12.html#a1909">http://blog.mathemagenic.com/2007/06/12.html#a1909</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1909&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2007%2F06%2F12.html%23a1909">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/blog-writing/" title="blog writing" rel="tag">blog writing</a>, <a href="http://blog.mathemagenic.com/tags/knowledge-mapping/" title="knowledge mapping" rel="tag">knowledge mapping</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2004/04/27/weblog-research-ethics/" title="Weblog research ethics (April 27, 2004)">Weblog research ethics</a> </li>
	<li><a href="http://blog.mathemagenic.com/2004/03/16/common-visual-design-elements-of-weblogs/" title="Common visual design elements of weblogs (March 16, 2004)">Common visual design elements of weblogs</a> </li>
	<li><a href="http://blog.mathemagenic.com/2004/07/05/blogtalk-20-mark/" title="BlogTalk 2.0: Mark (July 5, 2004)">BlogTalk 2.0: Mark</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2007/06/12/tools-to-find-similarity-between-two-texts-weblog-and-papers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Papers of WWW2006 workshop on the weblogging ecosystem</title>
		<link>http://blog.mathemagenic.com/2006/06/15/papers-of-www2006-workshop-on-the-weblogging-ecosystem/</link>
		<comments>http://blog.mathemagenic.com/2006/06/15/papers-of-www2006-workshop-on-the-weblogging-ecosystem/#comments</comments>
		<pubDate>Thu, 15 Jun 2006 08:18:19 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Weblog research]]></category>
		<category><![CDATA[blog ecosystem]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[papers]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2006/06/15.html#a1777</guid>
		<description><![CDATA[Papers from 3rd Annual Workshop on the Weblogging Ecosystem (see also papers from 2004 and 2005 workshops). *Leave a Reply: An Analysis of Weblog Comments, Gilad Mishne and Natalie Glance The Ties that Blog: Examining the Relationship Between Social Ties and Continued Participation in the Wallop Weblogging System, Thomas Lento, Howard T. Welser, Lei Gu [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Papers from <a href="http://www.blogpulse.com/www2006-workshop/program.html">3rd Annual Workshop on the Weblogging Ecosystem</a> (see also papers from <a href="http://blog.mathemagenic.com/2004/05/11.html#a1201">2004</a> and <a href="http://blog.mathemagenic.com/2005/08/15.html#a1632">2005</a> workshops). </p>
<ul>
<li>*<a href="http://www.blogpulse.com/www2006-workshop/papers/wwe2006-blogcomments.pdf">Leave a Reply: An Analysis of Weblog Comments</a>, Gilad Mishne and Natalie Glance
</li>
<li><a href="http://www.blogpulse.com/www2006-workshop/papers/Lento-Welser-Gu-Smith-TiesThatBlog.pdf">The Ties that Blog: Examining the Relationship Between Social Ties and Continued Participation in the Wallop Weblogging System</a>, Thomas Lento, Howard T. Welser, Lei Gu and Marc Smith
</li>
<li>*<a href="http://www.blogpulse.com/www2006-workshop/papers/blogs-during-london-attacks.pdf">Blogs During the London Attacks: Top Information Sources and Topics</a>, Mike Thelwall
</li>
<li><a href="http://www.blogpulse.com/www2006-workshop/papers/persian-weblogs.pdf">Experiments on Persian Weblogs</a>, Kyumars Sheykh Esmaili, Mohsen Jamali, Mahmood Neshati, Hassan Abolhassani and Yasaman Soltan-Zadeh
</li>
<li>**<a href="http://www.blogpulse.com/www2006-workshop/papers/wwe2006-discovery-lin-final.pdf">Discovery of Blog Communities Based on Mutual Awareness</a>, Yu-Ru Lin, Hari Sundaram, Yun Chi, Jun Tatemura and Belle Tseng
</li>
<li><a href="http://www.blogpulse.com/www2006-workshop/papers/splogosphere.pdf">Characterizing the Splogosphere</a>, Pranam Kolari, Akshay Java and Tim Finin
</li>
<li><a href="http://www.blogpulse.com/www2006-workshop/papers/detecting-blog-spam.pdf">Detecting Blog Spams using the Vocabulary Size of All Substrings in Their Copies</a>, Kazuyuki Narisawa, Yasuhiro Yamada, Daisuke Ikeda and Masayuki Takeda
</li>
<li><a href="http://www.blogpulse.com/www2006-workshop/papers/collaborative-blogspam-filtering.pdf">Collaborative Blog Spam Filtering Using Adaptive Percolation Search</a>, Seungyeop Han, Yong-yeol Ahn, Sue Moon and Hawoong Jeong
</li>
<li>*<a href="http://www.blogpulse.com/www2006-workshop/papers/wwe2006-decomposing-moods.pdf">Decomposing Bloggers&#8217; Moods: Towards a Time Series Analysis of Moods in the Blogosphere</a>, Krisztian Balog and Maarten de Rijke
</li>
<li>**<a href="http://www.blogpulse.com/www2006-workshop/papers/wwe2006-oka.pdf">Extracting Topics From Weblogs Through Frequency Segments</a>, Mizuki Oka, Hirotake Abe and Kazuhiko Kato
</li>
<li><a href="http://www.blogpulse.com/www2006-workshop/papers/wwe2006-fujimura.pdf">BLOGRANGER &#8211; A Multi-Faceted Blog Search Engine</a>, Ko Fujimura, Hiroyuki Toda, Takafumi Inoue, Nobuaki Hiroshima, Ryoji Kataoka and Masayuki Sugizaki
</li>
<li>*<a href="http://www.blogpulse.com/www2006-workshop/papers/wwe2006-blog-ohkura.pdf">Browsing System for Weblog Articles based on Automated Folksonomy</a>, Tsutomu Ohkura, Yoji Kiyota and Hiroshi Nakagawa </li>
</ul>
<p>I seriousely considered going, but it would cut a week from my honeymoon&#8230; At least now there is nice collection for reading.</p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2006/06/15.html#a1777">http://blog.mathemagenic.com/2006/06/15.html#a1777</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1777&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2006%2F06%2F15.html%23a1777">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-ecosystem/" title="blog ecosystem" rel="tag">blog ecosystem</a>, <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/papers/" title="papers" rel="tag">papers</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2006/04/12/jan-schmidt-on-blogging-practices/" title="Jan Schmidt on blogging practices (April 12, 2006)">Jan Schmidt on blogging practices</a> </li>
	<li><a href="http://blog.mathemagenic.com/2006/06/15/papers-from-aaai-2006-symposia-on-computational-approaches-to-analyzing-weblogs/" title="Papers from AAAI 2006 Symposia on Computational Approaches to Analyzing Weblogs (June 15, 2006)">Papers from AAAI 2006 Symposia on Computational Approaches to Analyzing Weblogs</a> </li>
	<li><a href="http://blog.mathemagenic.com/2004/07/05/blogtalk-20-panel-2-blogging-beyond-webpublishing/" title="BlogTalk 2.0: Panel 2 &#8211; blogging beyond webpublishing (July 5, 2004)">BlogTalk 2.0: Panel 2 &#8211; blogging beyond webpublishing</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2006/06/15/papers-of-www2006-workshop-on-the-weblogging-ecosystem/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Feed your blog to tOKo and see what comes out</title>
		<link>http://blog.mathemagenic.com/2006/04/11/feed-your-blog-to-toko-and-see-what-comes-out/</link>
		<comments>http://blog.mathemagenic.com/2006/04/11/feed-your-blog-to-toko-and-see-what-comes-out/#comments</comments>
		<pubDate>Tue, 11 Apr 2006 11:44:27 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 4. Conversations]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[blog communities]]></category>
		<category><![CDATA[blog conversations]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[BlogTrace]]></category>
		<category><![CDATA[knowledge mapping]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2006/04/11.html#a1761</guid>
		<description><![CDATA[Anjo is moving further in developing a blog-friendly version of tOKo (related to all our earlier work on weblog communities, conversations and topics): A little bit of progress on the open source version of tOKo (and the like), and in particular making it suitable for bloggers. The first problem is turning a (your?) blog into [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Anjo is moving further in developing a <a href="http://anjo.blogs.com/metis/2006/04/toko_does_movab.html">blog-friendly version of tOKo</a> (related to all our earlier work on weblog communities, conversations and topics):<br />
<blockquote class=cite>A little bit of progress on <a href="http://anjo.blogs.com/metis/2006/03/open_source_tok.html">the open source version of tOKo</a> (and the like), and in particular making it suitable for bloggers.</p></blockquote>
<p>The first problem is turning a (your?) blog into a corpus. tOKo is pretty flexible as to what a corpus looks like, but the process must be automated. <a href="http://blog.jackvinson.com/">Jack Vinson</a> and <a href="http://www.zylstra.org/blog/">Ton Zijlstra</a> provided great help by converting their blogs to a Movable Type export file and making the result available. Therefore, tOKo now contains a &#8220;Create corpus from Movable Type&#8221; function. The nice thing is that several blogging platforms provide Movable Type (MT) export. For example, in <a href="http://www.typepad.com/">TypePad</a> (which I use) a MT file can be generated from the web interface. Moreover, an MT file contains all information, including comments and trackbacks. </p>
<p>I&#8217;m getting into research fun anticipation &#8211; getting hold of comments next to post text would be such a great thing for the analysis :) </p>
<p>And, if want to help to develop the tool you can contribute your blog archives in Movable Type format (<a title="Eric Pierce: WPexport 0.2" href="http://epierce.blog.usf.edu/?p=15">WPexport</a> could be handy for WordPress users). This especially makes sense if you feel belonging to <a href="http://blog.mathemagenic.com/2005/09/29.html#a1680">KM bloggers community</a> (<a href="http://blog.mathemagenic.com/2005/10/07.html#a1686">paper</a>) &#8211; or, as Anjo <a href="http://anjo.blogs.com/metis/2006/04/toko_does_movab.html">puts it</a>:<br />
<blockquote class=cite>If you have linked to Jack, Ton, <a href="http://blog.mathemagenic.com/">Lilia</a> or myself in the past, this would be particularly interesting (also if you can only export to Movable Type). The only disadvantage of making your weblog available is that I might ask you to alpha-test tOKo :-). </p></blockquote>
<p>My email address is: anjo science uva nl (one at, two dots). </p>
<p>You get a bit more insight about this work from <a href="http://www.zylstra.org/blog/archives/2006/04/toko_eats_movab.html">Ton&#8217;s impressions on the work in progress</a> and Anjo&#8217;s visualisations (<a href="http://anjo.blogs.com/metis/2005/11/a_model_for_web.html">1</a>, <a href="http://anjo.blogs.com/metis/2006/03/topics_in_mathe_1.html">2</a>, <a href="http://anjo.blogs.com/metis/2006/03/open_source_tok.html">3</a>, <a href="http://anjo.blogs.com/metis/2006/04/toko_does_movab.html">4</a>).</p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2006/04/11.html#a1761">http://blog.mathemagenic.com/2006/04/11.html#a1761</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1761&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2006%2F04%2F11.html%23a1761">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-communities/" title="blog communities" rel="tag">blog communities</a>, <a href="http://blog.mathemagenic.com/tags/blog-conversations/" title="blog conversations" rel="tag">blog conversations</a>, <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/blogtrace/" title="BlogTrace" rel="tag">BlogTrace</a>, <a href="http://blog.mathemagenic.com/tags/knowledge-mapping/" title="knowledge mapping" rel="tag">knowledge mapping</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2007/08/12/weblog-conversations-revisited-an-introduction/" title="Weblog conversations revisited: an introduction (August 12, 2007)">Weblog conversations revisited: an introduction</a> </li>
	<li><a href="http://blog.mathemagenic.com/2006/11/08/understanding-weblog-communities-through-digital-traces-a-framework-a-tool-and-an-example/" title="Understanding weblog communities through digital traces: a framework, a tool and an example (November 8, 2006)">Understanding weblog communities through digital traces: a framework, a tool and an example</a> </li>
	<li><a href="http://blog.mathemagenic.com/2002/08/07/blogs-in-reseach/" title="Blogs in reseach (August 7, 2002)">Blogs in reseach</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2006/04/11/feed-your-blog-to-toko-and-see-what-comes-out/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Shout if you want to be heard or Technorati blog finder</title>
		<link>http://blog.mathemagenic.com/2005/09/04/shout-if-you-want-to-be-heard-or-technorati-blog-finder/</link>
		<comments>http://blog.mathemagenic.com/2005/09/04/shout-if-you-want-to-be-heard-or-technorati-blog-finder/#comments</comments>
		<pubDate>Sun, 04 Sep 2005 03:48:30 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 3. Ideas]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[Meta-blogging]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[citedCh3]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2005/09/04.html#a1654</guid>
		<description><![CDATA[For all the unhappy ones: Technorati performance and scalability improvement progress and Technorati blog finder (via David Sifry). Things to know: 1. You have to categorise your weblog manually: By default, the blogs are presented in order of authority, which means highly-linked blogs appear first. So each of these Blog Finder pages is like a [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>For all the unhappy ones: <a href="http://www.sifry.com/alerts/archives/000338.html">Technorati performance and scalability improvement progress</a> and <a href="http://technorati.com/blogs/">Technorati blog finder</a> (via <a href="http://www.sifry.com/alerts/archives/000339.html">David Sifry</a>).</p>
<p>Things to know:</p>
<p>1. You have to <a href="http://www.technorati.com/weblog/2005/09/44.html">categorise your weblog manually</a>:</p>
<blockquote class="cite"><p>By default, the blogs are presented in order of authority, which means highly-linked blogs appear first. So each of these Blog Finder pages is like a mini <a href="http://technorati.com/pop/blogs/">Top 100</a> for any topic you can imagine. You can also sort each tag by how recently the blogs have updated, or alphabetical by title.</p>
<p>And for all you bloggers out there, this is a great opportunity for your blog to get found. If you&#8217;re already a Technorati member with a claimed blog, all you have to do is visit your <a href="http://technorati.com/account/#blogs">Configure Blog page</a> to choose which tags you want to use. You can add up to 20 tags per blog.</p></blockquote>
<p>2. It&#8217;s <a href="http://technorati.com/help/blogfinder.html">prepopulated based on existing tags</a>:</p>
<blockquote class="cite"><p>We kicked off the Blog Finder by auto-classifying blogs based on the tags they use in posts most often. But you can list your blog under any tag you like, up to 20.</p></blockquote>
<p>Of course I went to check for my weblog and didn&#8217;t find it under <acronym title="knowledge management">KM</acronym>, &#8220;knowledge management&#8221; and &#8220;learning&#8221;. Not surprising since I don&#8217;t really use Technorati tags (necessary mark-up is not produced by LiveTopics and I&#8217;m too lazy to add tags manually next to adding topics).</p>
<p>Clearly that those users who don&#8217;t know or don&#8217;t care about tagging especially for Technorati are out of the system (which reinforces &#8220;shout if you want to be heard&#8221; behavior with all its implications).</p>
<p>Thoughts:</p>
<ul>
<li>May be some kind of extrapolation could work &#8211; if top blogs on a topic link to the specific blog frequently it could be included into the topic list.</li>
<li>Wonder how tagging at post level (curent auto-classification) would intergrate with blog-level tagging that is asked for.</li>
<li>If auto-classification stays (which makes sense) and continues influencing one&#8217;s inclusion into the lists &#8211; how this would influence post-level tagging (e.g. adding unnecessary tags).</li>
<li>First though of spammers, but then realised that it&#8217;s more or less covered by sorting based on incoming links (of course, untill someone heavily linked in one domain starts adding tags for another domain that has nothing to do with the blog focus).</li>
</ul>
<p>And, an example of overcoming being lazy and conforming to &#8220;shout if you want to be heard&#8221; practice <a rel="tag" href="http://technorati.com/tag/Technorati+Blog+Finder">Technorati+Blog+Finder</a> :)</p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2005/09/04.html#a1654">http://blog.mathemagenic.com/2005/09/04.html#a1654</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1654&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2005%2F09%2F04.html%23a1654">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/citedch3/" title="citedCh3" rel="tag">citedCh3</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2004/04/03/oklc04-my-presentation/" title="OKLC04: my presentation (April 3, 2004)">OKLC04: my presentation</a> </li>
	<li><a href="http://blog.mathemagenic.com/2006/06/15/international-conference-on-weblogs-and-social-media/" title="International Conference on Weblogs and Social Media (June 15, 2006)">International Conference on Weblogs and Social Media</a> </li>
	<li><a href="http://blog.mathemagenic.com/2003/12/05/the-power-of-loose-ends-3-or-the-weakness-of-weblogs-when-it-comes-to-joint-actions/" title="The power of loose ends (3) or the weakness of weblogs when it comes to joint actions (December 5, 2003)">The power of loose ends (3) or the weakness of weblogs when it comes to joint actions</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2005/09/04/shout-if-you-want-to-be-heard-or-technorati-blog-finder/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Social computing symposium: BlogTrace demo</title>
		<link>http://blog.mathemagenic.com/2005/04/26/social-computing-symposium-blogtrace-demo/</link>
		<comments>http://blog.mathemagenic.com/2005/04/26/social-computing-symposium-blogtrace-demo/#comments</comments>
		<pubDate>Tue, 26 Apr 2005 19:23:09 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 4. Conversations]]></category>
		<category><![CDATA[Chapter 5. Relations]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Iceberg: selected]]></category>
		<category><![CDATA[PhD news]]></category>
		<category><![CDATA[blog research]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[SCS]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2005/04/26.html#a1566</guid>
		<description><![CDATA[I&#8217;m presenting today our work on BlogTrace (Anjo Anjewierden does most of the hard work, but there are others as well &#8211; see below). Some links that could be interesting: Mapping knowledge flows in weblogs (with Anjo, Rogier Brussee &#38; Robert de Hoog) Detecting knowledge flows in weblogs (final version) &#8211; a paper with an [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>I&#8217;m presenting today our work on <a href="http://anjo.blogs.com/metis/2005/01/blogtrace.html">BlogTrace</a> (<a href="http://anjo.blogs.com/metis/">Anjo Anjewierden</a> does most of the hard work, but there are others as well &#8211; see below). Some links that could be interesting:</p>
<ul>
<li>Mapping knowledge flows in weblogs (with Anjo, <a href="http://rogierbrussee.blogspot.com/">Rogier Brussee</a> &amp; Robert de Hoog)</li>
</ul>
<ul>
<li><a href="http://anjo.blogs.com/metis/2005/01/knowledge_flows.html">Detecting knowledge flows in weblogs</a> (<a href="http://staff.science.uva.nl/%7Eanjo/kflows_iccs2005.pdf">final version</a>) &#8211; a paper with an overview of blog research tools we are working on and examples of their uses for detecting knowledge flows</li>
<li>
<div><a href="http://blog.mathemagenic.com/2004/09/06.html#a1333">Shared conceptualisations in weblogs</a> &#8211; paper with background of <a href="http://anjo.blogs.com/metis/2004/08/sigmund_goes_pu.html">Sigmund</a> and examples</div>
</li>
<li><a href="http://anjo.blogs.com/metis/2005/01/visual_settleme.html">Visual settlements</a> &#8211; experiments with visual representation of linking/size/connections between weblog posts in a weblog</li>
</ul>
<li>Mapping weblog communities (work with Anjo and <a href="http://www.sumofmyparts.com/blog/">Stephanie Hendrick</a>)</li>
<ul>
<li>Start here with <a href="http://anjo.blogs.com/metis/2005/04/black_holes_in_.html">examples</a></li>
<li><a href="http://blog.mathemagenic.com/2004/11/18.html#a1435">In search for a virtual settlement: An exploration of weblog community boundaries (draft)</a> &#8211; an overview of weblog community research challenges, community indicators, examples of visualisations</li>
<div><span></span></div>
<p> </p>
<p><span></p>
<li><span>My blog posts on <a onmouseover="window.status='See more posts about: blog communities'; return true;" href="http://blog.mathemagenic.com/topics/topicsB.html#blog_communities" target="_self">blog communities</a></span></li>
<p> </p>
<p></span></ul>
<p>Things I&#8217;m not showing, related</p>
<ul>
<li>Work on weblog conversations with <a href="http://growingpains.blogs.com/home/">Aldo de Moor</a></li>
</ul>
<ul>
<li><a href="http://blog.mathemagenic.com/2004/09/15.html#a1353">Beyond personal webpublishing: An exploratory study of conversational blogging practices</a> &#8211; HICSS paper with visualisations and the rest</li>
<li><span>My blog posts on <a onmouseover="window.status='See more posts about: blogging conversations'; return true;" href="http://blog.mathemagenic.com/topics/topicsB.html#blogging_conversations" target="_self">blogging conversations</a></span></li>
</ul>
<li>Background</li>
<ul>
<li>My <span><a href="http://iceberg.notlong.com/">PhD research</a> (personal knowledge management and weblogs)</span></li>
<li>My <a href="http://www.telin.nl/project.cfm?id=340&amp;language=en">publications</a></li>
</ul>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2005/04/26.html#a1566">http://blog.mathemagenic.com/2005/04/26.html#a1566</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1566&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2005%2F04%2F26.html%23a1566">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-research/" title="blog research" rel="tag">blog research</a>, <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/scs/" title="SCS" rel="tag">SCS</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2004/11/18/in-search-for-a-virtual-settlement-an-exploration-of-weblog-community-boundaries-draft/" title="In search for a virtual settlement: An exploration of weblog community boundaries (draft) (November 18, 2004)">In search for a virtual settlement: An exploration of weblog community boundaries (draft)</a> </li>
	<li><a href="http://blog.mathemagenic.com/2005/04/26/social-computing-symposium-community-morning/" title="Social computing symposium: community morning (April 26, 2005)">Social computing symposium: community morning</a> </li>
	<li><a href="http://blog.mathemagenic.com/2004/05/22/can-blogging-replace-communities-of-practice/" title="Can blogging replace communities of practice? (May 22, 2004)">Can blogging replace communities of practice?</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2005/04/26/social-computing-symposium-blogtrace-demo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BlogTrace</title>
		<link>http://blog.mathemagenic.com/2005/01/29/blogtrace/</link>
		<comments>http://blog.mathemagenic.com/2005/01/29/blogtrace/#comments</comments>
		<pubDate>Sat, 29 Jan 2005 10:34:04 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Chapter 4. Conversations]]></category>
		<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[BlogTrace]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2005/01/29.html#a1494</guid>
		<description><![CDATA[Anjo shares details about BlogTrace, weblog analysis tool we are working on (as you can see from Anjo&#8217;s post my main contribution is motivating the work and then going for a vacation :))) There are too many specific comments I have, so at this moment just an image representing BlogTrace architecture. Read Anjo&#8217;s post for [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Anjo shares <a href="http://anjo.blogs.com/metis/2005/01/blogtrace.html">details about BlogTrace</a>, weblog analysis tool we are working on (as you can see from Anjo&#8217;s post my main contribution is motivating the work and then going for a vacation :))) </p>
<p>There are too many specific comments I have, so at this moment just an image representing BlogTrace architecture. Read <a href="http://anjo.blogs.com/metis/2005/01/blogtrace.html">Anjo&#8217;s post</a> for more details.</p>
<p><a href="http://anjo.blogs.com/metis/2005/01/blogtrace.html"><img src="http://anjo.blogs.com/metis/gifs/blogtrace.gif"/></a></p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2005/01/29.html#a1494">http://blog.mathemagenic.com/2005/01/29.html#a1494</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1494&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2005%2F01%2F29.html%23a1494">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/blogtrace/" title="BlogTrace" rel="tag">BlogTrace</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2005/09/04/shout-if-you-want-to-be-heard-or-technorati-blog-finder/" title="Shout if you want to be heard or Technorati blog finder (September 4, 2005)">Shout if you want to be heard or Technorati blog finder</a> </li>
	<li><a href="http://blog.mathemagenic.com/2005/09/29/km-bloggers-community/" title="KM bloggers community (September 29, 2005)">KM bloggers community</a> </li>
	<li><a href="http://blog.mathemagenic.com/2005/08/30/experimenting-with-creating-an-ontology-based-on-weblog-content/" title="Experimenting with creating an ontology based on weblog content (August 30, 2005)">Experimenting with creating an ontology based on weblog content</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2005/01/29/blogtrace/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ontological fingeprinting: documents or people</title>
		<link>http://blog.mathemagenic.com/2005/01/28/ontological-fingeprinting-documents-or-people/</link>
		<comments>http://blog.mathemagenic.com/2005/01/28/ontological-fingeprinting-documents-or-people/#comments</comments>
		<pubDate>Fri, 28 Jan 2005 10:29:11 +0000</pubDate>
		<dc:creator>Lilia Efimova</dc:creator>
				<category><![CDATA[Digital traces]]></category>
		<category><![CDATA[blog research tools]]></category>
		<category><![CDATA[knowledge mapping]]></category>
		<category><![CDATA[ontologies]]></category>

		<guid isPermaLink="false">http://blog.mathemagenic.com/2005/01/28.html#a1493</guid>
		<description><![CDATA[Anjo gives a bit of insight into our internal discussions on uses of ontologies: Andy Boyd came up with a wonderful new term: &#8220;ontological fingerprinting&#8221; and to illustrate how imaginative he is: zero hits on Google! Suppose one has an ontology (lexicon, thesaurus) and some software that can determine whether the terms in the ontology [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Anjo gives <a href="http://anjo.blogs.com/metis/2005/01/ontological_fin.html">a bit of insight</a> into our internal discussions on uses of ontologies:<br />
<blockquote class=cite><a href="http://croeso.typepad.com/">Andy Boyd</a> came up with a wonderful new term: <b>&#8220;ontological fingerprinting&#8221;</b> and to illustrate how imaginative he is: zero hits on Google! Suppose one has an ontology (lexicon, thesaurus) and some software that can determine whether the terms in the ontology are present in a document. Applying the software, one gets a &#8220;fingerprint&#8221; of the concepts in the ontology for a given document. Comparing fingerprints for different documents, such is the assumption, provides a better metric of the similarity between these documents than comparing plain words. Ideas like this simply have to be tested in practice. Fortunately, Andy is making available a lot of real data to try it.</p></blockquote>
<p>I like the term, but find it a bit misleading: usually documents do not have fingers :)</p>
<p>I&#8217;d associate the term with people &#8211; you may think of &#8220;ontological fingerprint&#8221; of a person, which could be something like conceptualisations produced by <a class="blines2" title="Link to another page in this blog" href="http://anjo.blogs.com/metis/2004/09/shared_conceptu.html" target="_blank">Sigmund</a> based on analysis of weblog posts written by someone, set of personal categories someone uses to classify a document or mapping one&#8217;s documents to a shared ontology. Then you can look for others with similar &#8220;fingerprints&#8221; (this was one of uses I imagined for Sigmund, but didn&#8217;t have such a nice term to talk about it :). </p>
<p>May be we should rather talk about &#8220;ontological abstract&#8221; in case of documents&#8230;</p>
<blockquote class="oldblog"><p>Archived version of this entry is available at <a href="http://blog.mathemagenic.com/2005/01/28.html#a1493">http://blog.mathemagenic.com/2005/01/28.html#a1493</a>; comments are <a href="http://radiocomments.userland.com/comments?u=109961&amp;p=1493&amp;link=http%3A%2F%2Fblog.mathemagenic.com%2F2005%2F01%2F28.html%23a1493">here</a>.</p></blockquote>

	Tags: <a href="http://blog.mathemagenic.com/tags/blog-research-tools/" title="blog research tools" rel="tag">blog research tools</a>, <a href="http://blog.mathemagenic.com/tags/knowledge-mapping/" title="knowledge mapping" rel="tag">knowledge mapping</a>, <a href="http://blog.mathemagenic.com/tags/ontologies/" title="ontologies" rel="tag">ontologies</a><br />

	<br>Related posts
	<ul class="st-related-posts">
	<li><a href="http://blog.mathemagenic.com/2004/06/08/my-dream-wikiweblog-tool/" title="My dream wiki/weblog tool (June 8, 2004)">My dream wiki/weblog tool</a> </li>
	<li><a href="http://blog.mathemagenic.com/2006/10/09/noaggregate-what-if-i-dont-want-my-digital-bits-to-be-connected-at-one-place/" title="NOAGGREGATE: what if I don&#8217;t want my digital bits to be connected at one place? (October 9, 2006)">NOAGGREGATE: what if I don&#8217;t want my digital bits to be connected at one place?</a> </li>
	<li><a href="http://blog.mathemagenic.com/2002/09/20/knowledge-workers-time-spent-finding-information/" title="Knowledge workers time spent finding information (September 20, 2002)">Knowledge workers time spent finding information</a> </li>
</ul>

]]></content:encoded>
			<wfw:commentRss>http://blog.mathemagenic.com/2005/01/28/ontological-fingeprinting-documents-or-people/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

