[ https://issues.apache.org/jira/browse/LUCENE-6986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Muir updated LUCENE-6986: -------------------------------- Attachment: LUCENE-6986.patch more docs improvements and mark classes lucene.experimental. I think its ready. > Add more DFI independence measures > ---------------------------------- > > Key: LUCENE-6986 > URL: https://issues.apache.org/jira/browse/LUCENE-6986 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Robert Muir > Attachments: LUCENE-6986.patch, LUCENE-6986.patch, LUCENE-6986.patch > > > Since LUCENE-6818 we have DFISimilarity which implements normalized > chi-squared distance. > But there are other alternatives (as described in > http://trec.nist.gov/pubs/trec21/papers/irra.web.nb.pdf): > * normalized chi-squared: "can be used for tasks that require high precision, > against both short and long queries" > * standardized: "good at tasks that require high recall and high precision, > especially against short queries composed of a few words as in the case of > Internet searches" > * saturated: "for tasks that require high recall against long queries" > I think we should just provide the three independence measures, and let the > user choose. Similar to how we do DFR/IB/etc. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org