http://people.apache.org/~robweir/ooo-dev-cloud.png

This looks at the top 1000 terms used in ooo-dev post subjects since
this project moved to Apache in June 2011.  The only thing I removed
was "Re:", since that would have dominated the cloud and is machine,
not user written

In this particular cloud, I used all posts, including responses.  So
if a term was used in a thread that had many responses, it would have
additional weight in this chart.

Technologies used:

Python's mailbox API to extract the post titles.  Could have done this
with any number of command line text tools as well, but it is trivial
in Python as well:

import mailbox

box = mailbox.mbox(fileName)

for message in box:
     print message['Subject']


Then I used Wordle.net to generate the graphic.

Based on the reaction given to the previous word cloud, I know that
some list subscribers are curious to see how often we write about
LibreOffice.  So I'll help you find it in this graphic.  Look for the
big "AOO", then under that see the "COMMIT".  Under COMMIT you can
make out LIBREOFFICE, to the left of USERS.

Regards,

-Rob

Reply via email to