vigyasharma commented on issue #12313:
URL: https://github.com/apache/lucene/issues/12313#issuecomment-2107568883

   > In another scenario, the results would just return the top doc and not 
repeat it.
   
   I believe this is what the parent-block join implementation for vector 
values does currently. Collected vector values are deduped within the 
`DiversifyingNearestChildrenKnnCollector`, and we only keep the top scoring doc 
per parent.
   
   > In one scenario, the search results would be the same as above, and the 
docs would repeat.
   
   So the idea is that query returns the same document multiple times, with 
different scores? I'd worry that we lose top-k slots to same document scores, 
which we'd probably want to aggregate per document anyway before using (similar 
to the parent-block join approach).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to