Re: Lucene scoring components

Adrien Grand Tue, 17 Jul 2018 10:14:14 -0700

You could extend this class and provide your own implementation to
incorporate term frequency into the final score. For the record, you might
want to look into BM25Similarity, which takes term frequency into account,
but in a way that gives a much lower score contribution to hits than
ClassicSimilarity. More generally, BM25Similarity is considered a superior
alternative to ClassicSimilarity (the canonical implementation of
TFIDFSimilarity that you linked).


Le mar. 17 juil. 2018 à 19:04, <baris.ka...@oracle.com> a écrit :

> i forgot to put the doc that i was referring to:
>
>
> https://lucene.apache.org/core/6_0_1/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html
>
> Best regards
>
>
> On 7/17/18 1:01 PM, baris.ka...@oracle.com wrote:
> > Hi,-
> >
> >  is there a way to diminish the tf(t in d) component to 1? i dont want
> > the number of times a word appears to affect the scoring for my app.
> >
> > Best regards
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> > For additional commands, e-mail: java-user-h...@lucene.apache.org
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-user-h...@lucene.apache.org
>
>

Re: Lucene scoring components

Reply via email to