Feed your blog to tOKo and see what comes out

by Lilia Efimova on 11 April 2006

Anjo is moving further in developing a blog-friendly version of tOKo (related to all our earlier work on weblog communities, conversations and topics):

A little bit of progress on the open source version of tOKo (and the like), and in particular making it suitable for bloggers.

The first problem is turning a (your?) blog into a corpus. tOKo is pretty flexible as to what a corpus looks like, but the process must be automated. Jack Vinson and Ton Zijlstra provided great help by converting their blogs to a Movable Type export file and making the result available. Therefore, tOKo now contains a “Create corpus from Movable Type” function. The nice thing is that several blogging platforms provide Movable Type (MT) export. For example, in TypePad (which I use) a MT file can be generated from the web interface. Moreover, an MT file contains all information, including comments and trackbacks.

I’m getting into research fun anticipation – getting hold of comments next to post text would be such a great thing for the analysis :)

And, if want to help to develop the tool you can contribute your blog archives in Movable Type format (WPexport could be handy for WordPress users). This especially makes sense if you feel belonging to KM bloggers community (paper) – or, as Anjo puts it:

If you have linked to Jack, Ton, Lilia or myself in the past, this would be particularly interesting (also if you can only export to Movable Type). The only disadvantage of making your weblog available is that I might ask you to alpha-test tOKo :-).

My email address is: anjo science uva nl (one at, two dots).

You get a bit more insight about this work from Ton’s impressions on the work in progress and Anjo’s visualisations (1, 2, 3, 4).

Archived version of this entry is available at http://blog.mathemagenic.com/2006/04/11.html#a1761; comments are here.

Leave a Comment

You can use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

Previous post:

Next post: