Atri Sharma created LUCENE-8875: ----------------------------------- Summary: Should TopScoreDocCollector Always Populate Sentinel Values? Key: LUCENE-8875 URL: https://issues.apache.org/jira/browse/LUCENE-8875 Project: Lucene - Core Issue Type: Improvement Reporter: Atri Sharma
TopScoreDocCollector always initializes HitQueue as the PQ implementation, and instruct HitQueue to populate with sentinels. While this is a great safety mechanism, for very large datasets where the query's selectivity is high, the sentinel population can be redundant and can become a large enough bottleneck in itself. Does it make sense to introduce a new parameter in TopScoreDocCollector which uses a heuristic (say number of hits > 10k) and does not populate sentinels? -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org