[
https://issues.apache.org/jira/browse/LUCENE-7347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15339551#comment-15339551
]
Michael McCandless commented on LUCENE-7347:
--------------------------------------------
No modern IR scoring models use coord anymore ... they simply have better term
saturation such that zillions of occurrences of one term, by design, cannot
alter the score as much as one occurrence of one of the other terms in the
query. This makes coord obsolete.
I don't think Lucene should cling to archaic scoring models, especially when
this clinging holds back important improvements, e.g. {{BooleanQuery}} could do
more aggressive rewriting if its hands were not tied by coord.
> Remove queryNorm and coords
> ---------------------------
>
> Key: LUCENE-7347
> URL: https://issues.apache.org/jira/browse/LUCENE-7347
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Assignee: Adrien Grand
>
> These two features are specific to TF-IDF and introduce some complexity (see
> eg. handling of coords in BooleanWeight) and bugs/corner-cases (see eg. how
> taking the query norm into account causes scoring challenges on LUCENE-7337).
> Since we made BM25 the default in 6.0, I propose that we remove these
> TF-IDF-specific features in 7.0.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]