[ 
https://issues.apache.org/jira/browse/LUCENE-6185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14279068#comment-14279068
 ] 

Michael McCandless commented on LUCENE-6185:
--------------------------------------------

Hmm one problem with TopDocs.merge is that it doesn't re-base the docIDs.  
Instead, it sets shardIndex for each hit.  I think this patch should sometimes 
fail tests, when newSearcher swaps in an ExecutorService?

TopDocs.merge does this so that you can merge across indices that sum to > 2.1B 
docs.  But in this usage, the number of docs will be < 2.1B ... so maybe we 
need an option to TopDocs.merge to rebase?  Or we rebase afterwards in 
IndexSearcher?

> Fix IndexSearcher with threads to not collect documents out of order
> --------------------------------------------------------------------
>
>                 Key: LUCENE-6185
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6185
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Assignee: Adrien Grand
>            Priority: Blocker
>             Fix For: 5.0, Trunk
>
>         Attachments: LUCENE-6185.patch
>
>
> When created with an executor, IndexSearcher searches all leaves in a 
> different task and eventually merges the results when all tasks are 
> completed. However, this merging logic involves a TopFieldCollector which is 
> collected out-of-order. I think it should just use TopDocs.merge?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to