Friday, June 13, 2008
Inspired by Laura’s recent post, I decided it would be fun to run some of the data from DrupalModules.com through Wordle.
So, I wrote a script to parse the entire collection of released Drupal modules. Over 2000 descriptions, written by hundreds of different module developers, were analyzed. This image shows the top 150 words used in the module descriptions, sized according to frequency.
Drupal Modules visualized
Unfortunately, Wordle outputs fairly small images. Fortunately, I figured out a way around that. Here’s a 1680 pixel wide version!
I wanted to see what it would look like without the duplication caused by plural words, so I wrote a script to identify plurals and convert them to singular form (easier said than done, English is a tricky language). Here are the results: