You could extend this class and provide your own implementation to incorporate term frequency into the final score. For the record, you might want to look into BM25Similarity, which takes term frequency into account, but in a way that gives a much lower score contribution to hits than ClassicSimilarity. More generally, BM25Similarity is considered a superior alternative to ClassicSimilarity (the canonical implementation of TFIDFSimilarity that you linked).
Le mar. 17 juil. 2018 à 19:04, <baris.ka...@oracle.com> a écrit : > i forgot to put the doc that i was referring to: > > > https://lucene.apache.org/core/6_0_1/core/org/apache/lucene/search/similarities/TFIDFSimilarity.html > > Best regards > > > On 7/17/18 1:01 PM, baris.ka...@oracle.com wrote: > > Hi,- > > > > is there a way to diminish the tf(t in d) component to 1? i dont want > > the number of times a word appears to affect the scoring for my app. > > > > Best regards > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > > For additional commands, e-mail: java-user-h...@lucene.apache.org > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >