Thank you for the information, Robert.
The argument against the normalized score make sense for the regular
kind of search where queries are much shorter than the documents.
But MLT is a document vs document search. Can't we define 100% match as
all terms are found in both documents at the same
See https://cwiki.apache.org/confluence/display/LUCENE/ScoresAsPercentages
which has some broken nabble links, but is still valid.
TLDR: Scoring just doesn't work the way you think. Don't try to
interpret it as an absolute value, it is a relative one.
On Fri, May 28, 2021 at 1:36 PM TK Solr
I'd like to have suggestions on changing the scoring algorithm
of MoreLikeThis.
When I feed the identical string as the content of a document in the index
to MoreLikeThis.like("field", new StringReader(docContent)),
I get a score less than 1.0 (0.944 in one of my test cases) that I expect.
What