[ https://issues.apache.org/jira/browse/LUCENE-7686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless updated LUCENE-7686: --------------------------------------- Attachment: LUCENE-7686.patch Another iteration ... it was trickier than I first realized because I have to handle the case where another (later) segment has the same suggestion with a better score, and replace the previous one in the priority queue. So I broke out the dedup handling to a separate collector, {{DeduplicatingTopSuggestDocsCollector}}. And I added a new randomized test case in addition to the dedicated specific test case. I think it's ready. > NRT suggester should have option to filter out duplicates > --------------------------------------------------------- > > Key: LUCENE-7686 > URL: https://issues.apache.org/jira/browse/LUCENE-7686 > Project: Lucene - Core > Issue Type: Improvement > Reporter: Michael McCandless > Assignee: Michael McCandless > Fix For: master (7.0), 6.5 > > Attachments: LUCENE-7686.patch, LUCENE-7686.patch > > > Some of the other suggesters have this ability, and it's quite simple to add > it to the NRT suggester as long as the thing we are filtering on is the > suggest key itself, not e.g. another stored field from the document. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org