Maybe of interest to some of you: A data challenge is scheduled for the "4th Int'l AAAI Conference on Weblogs and Social Media". It is based on the Spinn3r blog data set.
Among the community-created data resources, there is also a Lucene index of the ICWSM 2009 collection (Dan Knights, JD Power & Assoc.). For more information: http://www.icwsm.org/2010/data.shtml Isabel
