On May 17, 2008, at 1:15 PM, Supheakmungkol SARIN wrote:
You're right. I want document clustering precisely the documents
that are already in the index. I don't know much about Mahout
project, but it seems that it doesn't help much. What I want is
simply to group together similar documents
he term vectors.
Anyway, thank you Otis and Grant for your suggestions. I appreciate them.
Regards,
Supheakmungkol
- Original Message
From: Grant Ingersoll <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Friday, May 16, 2008 7:22:39 PM
Subject: Re: Document clustering wit
Do you want search result clustering or document clustering? My
understanding of Carrot2 is it isn't designed for the latter. The
difference being it is designed to work off of shorter snippets of
text, as opposed to the whole document. FWIW, you _might_ find some
help over on the Mahout
Have you tried using Carrot2 with Lucene? They work quite well in tandem!
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Supheakmungkol SARIN <[EMAIL PROTECTED]>
> To: java-user@lucene.apache.org
> Sent: Wednesday, May 14, 2008 11:23:45 PM