[jira] [Commented] (SOLR-12291) Async prematurely reports completed status that causes severe shard loss

Mikhail Khludnev (Jira) Thu, 29 Aug 2019 02:50:57 -0700


    [ 
https://issues.apache.org/jira/browse/SOLR-12291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16918455#comment-16918455
 ]


Mikhail Khludnev commented on SOLR-12291:
-----------------------------------------

It does. I haven't thought about porting to 7.x. Would you like me to do so?   

> Async prematurely reports completed status that causes severe shard loss
> ------------------------------------------------------------------------
>
>                 Key: SOLR-12291
>                 URL: https://issues.apache.org/jira/browse/SOLR-12291
>             Project: Solr
>          Issue Type: Bug
>          Components: Backup/Restore, SolrCloud
>            Reporter: Varun Thacker
>            Assignee: Mikhail Khludnev
>            Priority: Major
>             Fix For: 8.1, master (9.0)
>
>         Attachments: SOLR-12291.patch, SOLR-12291.patch, SOLR-12291.patch, 
> SOLR-12291.patch, SOLR-12291.patch, SOLR-12291.patch, SOLR-12291.patch, 
> SOLR-122911.patch
>
>
> The OverseerCollectionMessageHandler sliceCmd assumes only one replica exists 
> on one node
> When multiple replicas of a slice are on the same node we only track one 
> replica's async request. This happens because the async requestMap's key is 
> "node_name"
> I discovered this when [~alabax] shared some logs of a restore issue, where 
> the second replica got added before the first replica had completed it's 
> restorecore action.
> While looking at the logs I noticed that the overseer never called 
> REQUESTSTATUS for the restorecore action , almost as if it had missed 
> tracking that particular async request.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-12291) Async prematurely reports completed status that causes severe shard loss

Reply via email to