Re: applying cosine similarity directly

2009-09-12 Thread Anthony Urso
There is a MoreLikeThis similarity search class in Lucene, it should do what you're looking for. http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/search/similar/MoreLikeThis.html Cheers, Anthony On Fri, Sep 11, 2009 at 11:25 PM, Alexy Khrabrov wrote: > Given that I have a field for whi

applying cosine similarity directly

2009-09-11 Thread Alexy Khrabrov
Given that I have a field for which term vector was computed and stored, and that field is the text of a document, I'd like to rank a subset of such documents by similarity to a given held-out document, or query, directly using the cosine measure. How can that be done without going through creatin