Yes I know carrot, but that is not a possibility for me to use that. There isn't any way to tell mahout which subset of documents to cluster?
2010/11/2 Ted Dunning <[email protected]> > Have you looked at Carrot? It works very well > > http://search.carrot2.org/stable/search > > On Tue, Nov 2, 2010 at 11:54 AM, Borbála Siklósi <[email protected]> > wrote: > > > Maybe I have quite a simple question, but I haven't been able to find out > > the solution. I have a solr index of doucuments and I run kmeans > clustering > > on them. It all works fine. How can I do that I make a keyword search on > > the > > solr index and run the clustering only on the result set? Can I someway > > determine what documents the algorithm should cluster? > > >
