Re: Document clustering with Lucene

2008-05-17 Thread Grant Ingersoll
On May 17, 2008, at 1:15 PM, Supheakmungkol SARIN wrote: You're right. I want document clustering precisely the documents that are already in the index. I don't know much about Mahout project, but it seems that it doesn't help much. What I want is simply to group together similar documents

Re: Document clustering with Lucene

2008-05-17 Thread Supheakmungkol SARIN
he term vectors. Anyway, thank you Otis and Grant for your suggestions. I appreciate them. Regards, Supheakmungkol - Original Message From: Grant Ingersoll <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, May 16, 2008 7:22:39 PM Subject: Re: Document clustering wit

Re: Document clustering with Lucene

2008-05-16 Thread Grant Ingersoll
Do you want search result clustering or document clustering? My understanding of Carrot2 is it isn't designed for the latter. The difference being it is designed to work off of shorter snippets of text, as opposed to the whole document. FWIW, you _might_ find some help over on the Mahout

Re: Document clustering with Lucene

2008-05-15 Thread Otis Gospodnetic
Have you tried using Carrot2 with Lucene? They work quite well in tandem! Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Supheakmungkol SARIN <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Wednesday, May 14, 2008 11:23:45 PM