[ 
https://issues.apache.org/jira/browse/SOLR-7819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shalin Shekhar Mangar updated SOLR-7819:
----------------------------------------
    Attachment: SOLR-7819.patch

Patch updated to trunk.

Thanks for the review [~andyetitmoves] and sorry for the delay in getting back 
to you.

bq. It goes ahead and does `publishDownState` still if `forcePublishState` is 
true, is that intentional?

Yes, because if the replica somehow became 'active' when the LIR state is still 
'down', we want to force publish its state again. The forcePublishState=true is 
only set in this one scenario.

bq. The caller does check for if the replica is live, but there could a race. 
Similarly, if our state is suspect due to zk disconnect/session (the block 
before this), should the force be respected?

I think you're right. We should short-circuit the publishing part complete if 
replica is not live or if our state is suspect.

This patch incorporates both of your review comments.


> ZkController.ensureReplicaInLeaderInitiatedRecovery does not respect 
> retryOnConnLoss
> ------------------------------------------------------------------------------------
>
>                 Key: SOLR-7819
>                 URL: https://issues.apache.org/jira/browse/SOLR-7819
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>    Affects Versions: 5.2, 5.2.1
>            Reporter: Shalin Shekhar Mangar
>              Labels: Jepsen
>             Fix For: Trunk, 5.4
>
>         Attachments: SOLR-7819.patch, SOLR-7819.patch, SOLR-7819.patch
>
>
> SOLR-7245 added a retryOnConnLoss parameter to 
> ZkController.ensureReplicaInLeaderInitiatedRecovery so that indexing threads 
> do not hang during a partition on ZK operations. However, some of those 
> changes were unintentionally reverted by SOLR-7336 in 5.2.
> I found this while running Jepsen tests on 5.2.1 where a hung update managed 
> to put a leader into a 'down' state (I'm still investigating and will open a 
> separate issue about this problem).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to