I've read a number of papers on it, was just looking for items that people recommend as a way to, potentially, round out my knowledge of the different approaches.

I've got the Data Mining book and the Foundations book, so will refresh my memory on those as well


On Feb 11, 2009, at 12:39 PM, Isabel Drost wrote:

On Wednesday 11 February 2009, Grant Ingersoll wrote:
I'm looking for papers that you recommend on text clustering (I can,
of course, go search for them, but I'd like recommendations).  New,
old, doesn't matter. Either send them here or add them to the wiki at
http://cwiki.apache.org/confluence/display/MAHOUT/Reference+Reading

Hmm, I know a few books that also cover the topic of clustering texts - maybe
one of these would be a good starting point.

I like the book "Introduction to Information Retrieval" by Manning, Raghavan
and Schütze. It also contains some chapters on the topic.

"Data Mining" from Witten and Frank has a chapter on the topic.

"Foundations of Statistical Natural Language Processing" has a chapter as
well.

Are you looking for something in particular?

Isabel


--
Check it out, send me comments, and dance joyously in the streets, -- Linus
Torvalds announcing 2.0.27
 |\      _,,,---,,_       Web:   <http://www.isabel-drost.de>
 /,`.-'`'    -.  ;-;;,_
|,4-  ) )-,_..;\ (  `'-'
'---''(_/--'  `-'\_) (fL)  IM:  <xmpp://[email protected]>

Reply via email to