I've read a number of papers on it, was just looking for items that
people recommend as a way to, potentially, round out my knowledge of
the different approaches.
I've got the Data Mining book and the Foundations book, so will
refresh my memory on those as well
On Feb 11, 2009, at 12:39 PM, Isabel Drost wrote:
On Wednesday 11 February 2009, Grant Ingersoll wrote:
I'm looking for papers that you recommend on text clustering (I can,
of course, go search for them, but I'd like recommendations). New,
old, doesn't matter. Either send them here or add them to the wiki
at
http://cwiki.apache.org/confluence/display/MAHOUT/Reference+Reading
Hmm, I know a few books that also cover the topic of clustering
texts - maybe
one of these would be a good starting point.
I like the book "Introduction to Information Retrieval" by Manning,
Raghavan
and Schütze. It also contains some chapters on the topic.
"Data Mining" from Witten and Frank has a chapter on the topic.
"Foundations of Statistical Natural Language Processing" has a
chapter as
well.
Are you looking for something in particular?
Isabel
--
Check it out, send me comments, and dance joyously in the
streets, -- Linus
Torvalds announcing 2.0.27
|\ _,,,---,,_ Web: <http://www.isabel-drost.de>
/,`.-'`' -. ;-;;,_
|,4- ) )-,_..;\ ( `'-'
'---''(_/--' `-'\_) (fL) IM: <xmpp://[email protected]>