Hi Grant, In Carrot2 (and Carrot Search's commercial products) we're not using Lucene as an indexing/ search service directly, but we are re-using a lot of internal infrastructure (like analyzers, ported snowball stemmers and other segmentation stuff). We also plan on using the new language identifiers, automata, tests framework...
I guess this shows that Lucene is a lot _more_ than just a document retrieval library. There are nuggets in the codebase that one can utilize on their own, without the rest of Lucene. If you need details, let me know on prv, I'll scan the sources and provide concrete examples. Dawid On Sun, Oct 23, 2011 at 2:33 AM, Shashi Kant <[email protected]> wrote: > Using Lucene as a recommendation engine. > > On Sat, Oct 22, 2011 at 6:33 PM, Grant Ingersoll <[email protected]> wrote: >> >> On Oct 22, 2011, at 6:03 PM, Sujit Pal wrote: >> >>> Hi Grant, >>> >>> Not sure if this qualifies as a "bet you didn't know", but one could use >>> Lucene term vectors to construct document vectors for similarity, >>> clustering and classification tasks. I found this out recently (although >>> I am probably not the first one), and I think this could be quite >>> useful. >> >> Yep, had these on my list! >> >> --------------------------------------------------------------------- >> To unsubscribe, e-mail: [email protected] >> For additional commands, e-mail: [email protected] >> >> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
