Hi Jason, to get access to the Twitter Data you could use the Twitter Streaming API: http://apiwiki.twitter.com/Streaming-API-Documentation Regards Hannes
On Wed, Jan 20, 2010 at 10:02 PM, Ian Holsman <[email protected]> wrote: > On 1/20/10 2:35 AM, Jason Rutherglen wrote: > >> We've got Newsgroup classification. I'm kinda of interested in >> creating a Twitter classification system, or at least playing >> around with it. Also I think as a relevant growing large data >> set, it seems Twitter fit well with Hadoop based machine >> learning algorithms... Just throwing out into the wild! >> >> >> > Hi Jason. > I think the biggest issues here are twofold. > > 1. access to the data, although I'm sure the ASF could work something out > here > 2. training data. wouldn't you need a set of 'tweets' classified in some > manner? or were you thinking of using a different data source to base it on? >
