[jira] [Commented] (SOLR-6530) Commits under network partition can put any node in down state

ASF subversion and git services (JIRA) Thu, 02 Oct 2014 20:44:54 -0700

    [ 
https://issues.apache.org/jira/browse/SOLR-6530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14157648#comment-14157648
 ]


ASF subversion and git services commented on SOLR-6530:
-------------------------------------------------------

Commit 1629108 from sha...@apache.org in branch 'dev/branches/branch_5x'
[ https://svn.apache.org/r1629108 ]

SOLR-6530: Commits under network partitions can put any node in down state

> Commits under network partition can put any node in down state
> --------------------------------------------------------------
>
>                 Key: SOLR-6530
>                 URL: https://issues.apache.org/jira/browse/SOLR-6530
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>            Reporter: Shalin Shekhar Mangar
>            Priority: Critical
>             Fix For: 5.0, Trunk
>
>         Attachments: SOLR-6530.patch, SOLR-6530.patch, SOLR-6530.patch, 
> SOLR-6530.patch, SOLR-6530.patch, SOLR-6530.patch, SOLR-6530.patch, 
> SOLR-6530.patch
>
>
> Commits are executed by any node in SolrCloud i.e. they're not routed via the 
> leader like other updates. 
> # Suppose there's 1 collection, 1 shard, 2 replicas (A and B) and A is the 
> leader
> # Suppose a commit request is made to node B during a time where B cannot 
> talk to A due to a partition for any reason (failing switch, heavy GC, 
> whatever)
> # B fails to distribute the commit to A (times out) and asks A to recover
> # This was okay earlier because a leader just ignores recovery requests but 
> with leader initiated recovery code, B puts A in the "down" state and A can 
> never get out of that state.
> tl;dr; During network partitions, if enough commit/optimize requests are sent 
> to the cluster, all the nodes in the cluster will eventually be marked as 
> "down".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6530) Commits under network partition can put any node in down state

Reply via email to