Hi,
I was wondering if its possible to use only partial solr index for
clustering. For example, my crawler updates my solr index every hour with
new documents, and I just want to cluster those new documents, not the old
ones. If I was programming normally, I could query solr for the latest
documents with the time constraint and then pass it as vectors to my
clustering program. But since mahout accepts solr indices directly I
thought there might be a simpler way.

-- 
Regards,
Ankit Goel
http://about.me/ankitgoel

Reply via email to