On Wednesday 11 February 2009, Grant Ingersoll wrote: > I'm looking for papers that you recommend on text clustering (I can, > of course, go search for them, but I'd like recommendations). New, > old, doesn't matter. Either send them here or add them to the wiki at > http://cwiki.apache.org/confluence/display/MAHOUT/Reference+Reading
Hmm, I know a few books that also cover the topic of clustering texts - maybe one of these would be a good starting point. I like the book "Introduction to Information Retrieval" by Manning, Raghavan and Schütze. It also contains some chapters on the topic. "Data Mining" from Witten and Frank has a chapter on the topic. "Foundations of Statistical Natural Language Processing" has a chapter as well. Are you looking for something in particular? Isabel -- Check it out, send me comments, and dance joyously in the streets, -- Linus Torvalds announcing 2.0.27 |\ _,,,---,,_ Web: <http://www.isabel-drost.de> /,`.-'`' -. ;-;;,_ |,4- ) )-,_..;\ ( `'-' '---''(_/--' `-'\_) (fL) IM: <xmpp://[email protected]>
signature.asc
Description: This is a digitally signed message part.
