enable DefaultSimilarity.setDiscountOverlaps by default
-------------------------------------------------------
Key: LUCENE-2286
URL: https://issues.apache.org/jira/browse/LUCENE-2286
Project: Lucene - Java
Issue Type: Improvement
Components: Query/Scoring
Reporter: Robert Muir
I think we should enable setDiscountOverlaps in DefaultSimilarity by default.
If you are using synonyms or commongrams or a number of other
0-posInc-term-injecting methods, these currently screw up your length
normalization.
These terms have a position increment of zero, so they shouldnt count towards
the length of the document.
I've done relevance tests with persian showing the difference is significant,
and i think its a big trap to anyone using synonyms, etc: your relevance can
actually get worse if you don't flip this boolean flag.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]