If it's just search results clustering and not full document clustering then you
can take a look at the Carrot2 project. It all depends how you want the
clustering to be integrated in query processing pipeline, but you can use
Carrot2 demo applications as a starting point.
http://www.carrot2.org
Dawid
Max wrote:
Hi list,
I would like to do some Lucene Documents clustering.
I have a
Lucene index and I run my search on the index.
The search result is
composed of a list of documents.
How can I translate my list of
document in a format suitable with Mahout format?
I have seen this
library contains some clustering algorithms, but they don't provide
(at
least I haven't found) any translation from a document to a point.
Do I have to implement this by myself, or does it already exist?
Thanks
in advance.