vigyasharma commented on issue #12313: URL: https://github.com/apache/lucene/issues/12313#issuecomment-2107568883
> In another scenario, the results would just return the top doc and not repeat it. I believe this is what the parent-block join implementation for vector values does currently. Collected vector values are deduped within the `DiversifyingNearestChildrenKnnCollector`, and we only keep the top scoring doc per parent. > In one scenario, the search results would be the same as above, and the docs would repeat. So the idea is that query returns the same document multiple times, with different scores? I'd worry that we lose top-k slots to same document scores, which we'd probably want to aggregate per document anyway before using (similar to the parent-block join approach). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org