Mano Kovacs created SOLR-11417:
----------------------------------

             Summary: Crashed leader's hanging emphemral will make restarting 
followers stuck in recovering
                 Key: SOLR-11417
                 URL: https://issues.apache.org/jira/browse/SOLR-11417
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
    Affects Versions: 6.3
            Reporter: Mano Kovacs


If replicas are starting up after leader crash and within the ZK session 
timeout, replicas
* will lose leader election due to hanging ephemerals
* will read stale data from ZK about current leader
* will fail recovery and stuck in recovering state

If leader is down permanently (eg. hardware failure) and all replicas are 
affected, shard will not come up (see also SOLR-7065).

Tested on 6.3. See attached image for details.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to