[ 
https://issues.apache.org/jira/browse/CASSANDRA-2603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis reassigned CASSANDRA-2603:
-----------------------------------------

    Assignee: Brandon Williams

> node stuck in 'Down' in nodetool ring, until disablegossip/enablegossip 
> flapped it back into submission
> -------------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-2603
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2603
>             Project: Cassandra
>          Issue Type: Bug
>    Affects Versions: 0.7.4
>            Reporter: Peter Schuller
>            Assignee: Brandon Williams
>
> Cluster with 0.7.4 and 9 machines. I was doing rolling restarts so nodes were 
> expected to have flappted up/down a bit.
> After cleanup, I noticed that one of the nodes 'nodetool ring' claimed that 
> another node was Down. I'll call the node that considered the *other* one to 
> be down "UpNode" and the node that was considered *down* "DownNode".
> DownNode was the next successor on the ring relative to UpNode. Only UpNode 
> thought it was down; all others members of the clusters agreed it was up. 
> This stayed the case for almost 24 hours.
> In system.log on UpNode, it is clearly visible that DownNode flapped to state 
> UP recently with no notification of flapping to state DOWN afterwards. Yet 
> 'nodetool ring' reported Down.
> Today, I did disablegossip+wait-for-a-bit+enablegossip on DownNode. This 
> caused 'nodetool ring' on UpNode to again reflect reality that DownNode is in 
> fact up.
> I do not have a reproducable test case but wanted to file it since I don't 
> remember seeing, and didn't easily find, a JIRA bug indicating a bug with 
> this effect has recently been fixed.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to