Visualizing English, Spanish, Japanese in the blogosphere

Update (Feb. 2012): The paper is now published and freely available from the Journal of Computer-Mediated Communication: http://onlinelibrary.wiley.com/doi/10.1111/j.1083-6101.2011.01568.x/full.

Update (Dec. 2011): The full paper from which this dataset comes will be published in the Journal of Computer-Mediated Communication in January 2012. The preprint copy of this paper is linked above. In addition, this visualization is now live on the new OII Visualization Gallery.

I recently revisited the data I collected last year following the January earthquake in Haiti. I found a new visualization package, Tulip, and was able to successfully visualize the largest connected component of my network. The result and a description follow:


This diagram represents 5,703 blog posts about the Haitian earthquake and the links between them in the largest connected component of the network. Blog posts are in English (yellow), Spanish (red), and Japanese (blue). The nodes are positioned using a force-directed GEM layout in Tulip.

The overall network consists of 113,117 blog posts collected in a 45-day period following the earthquake. Only about 5% of the links connect posts of different languages. Of these, most link from personal blogs in Japanese and Spanish to media and professional blogs in English. About 1% of links contain human translation of the blog content. Significantly fewer cross-lingual links originate in English posts than in Spanish or Japanese posts.

This entry was posted in multilingual, OII, Visualizations. Bookmark the permalink.

2 Responses to Visualizing English, Spanish, Japanese in the blogosphere

  1. Pingback: Graphics used at Mindshare Language Huddle | Net Increase?

  2. NEX-5N says:

    Tökéletes kialakítás révén

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>