Re: Lucene cosine similarity score for more like this query

2015-02-03 Thread Ali Nazemian
Dear Koji, Thank you very much. Do you know what is the range of score in this new formula? What is the reasonable threshold for considering two documents as similar enough in this formula? Regards. On Tue, Feb 3, 2015 at 1:35 PM, Koji Sekiguchi wrote: > Lucene uses TFIDFSimilarity class to calc

Re: Lucene cosine similarity score for more like this query

2015-02-03 Thread Koji Sekiguchi
Lucene uses TFIDFSimilarity class to calculate the similarity. It is implemented on the idea of cosine measurement but it modifies the cosine formula. Please take a look at "Lucene Practical Scoring Function" in the following Javadoc: http://lucene.apache.org/core/4_10_3/core/org/apache/lucene/

Re: Lucene cosine similarity score for more like this query

2015-02-02 Thread Dikshant Shahi
39 > > To: solr-user@lucene.apache.org > > Subject: Re: Lucene cosine similarity score for more like this query > > > > Dear Erik, > > Thank you for your response. Would younplease tell me why this score > could > > be higher than 1? While cosine similarity ca

RE: Lucene cosine similarity score for more like this query

2015-02-02 Thread Markus Jelsma
ucene cosine similarity score for more like this query > > Dear Erik, > Thank you for your response. Would younplease tell me why this score could > be higher than 1? While cosine similarity can not be higher than 1. > On Feb 2, 2015 7:32 PM, "Erik Hatcher" wrote: > &g

Re: Lucene cosine similarity score for more like this query

2015-02-02 Thread Ali Nazemian
Dear Erik, Thank you for your response. Would younplease tell me why this score could be higher than 1? While cosine similarity can not be higher than 1. On Feb 2, 2015 7:32 PM, "Erik Hatcher" wrote: > The scoring is the same as Lucene. To get deeper insight into how a score > is computed, use S

Re: Lucene cosine similarity score for more like this query

2015-02-02 Thread Erik Hatcher
The scoring is the same as Lucene. To get deeper insight into how a score is computed, use Solr’s debug=true mode to see the explain details in the response. Erik > On Feb 2, 2015, at 10:49 AM, Ali Nazemian wrote: > > Hi, > I was wondering what is the range of score is brought by more