Re: Cluster not recovering when a single node dies
Sounds reasonable. Thanks. On Fri, May 27, 2011 at 7:12 PM, Jonathan Ellis wrote: > It does not. (Most failures are transient, so Cassandra doesn't > inflict the non-negligible performance impact of re-replicating a full > node's worth of data until you tell it "that guys' not coming back > this time.") > > On Fri, May 27, 2011 at 10:47 AM, Paul Loy wrote: > > I guess my next question is: the data should be complete somewhere in the > > ring with RF = 2. Does cassandra not redistribute the replication ring > > without a nodetool decommission call? > > > > On Fri, May 27, 2011 at 4:45 PM, Paul Loy wrote: > >> > >> ahh, thanks. > >> > >> On Fri, May 27, 2011 at 4:43 PM, Jonathan Ellis > wrote: > >>> > >>> Quorum of 2 is 2. You need at least RF=3 for quorum to tolerate losing > >>> a node indefinitely. > >>> > >>> On Fri, May 27, 2011 at 10:37 AM, Paul Loy > wrote: > >>> > We have a 4 node cluster with a replication factor of 2. When one > node > >>> > dies, > >>> > the other nodes throw UnavailableExceptions for quorum reads (as > >>> > expected > >>> > initially). They never get out of that state. > >>> > > >>> > Is there something we can do in nodetool to make the remaining nodes > >>> > function? > >>> > > >>> > Thanks. > >>> > > >>> > -- > >>> > - > >>> > Paul Loy > >>> > p...@keteracel.com > >>> > http://uk.linkedin.com/in/paulloy > >>> > > >>> > >>> > >>> > >>> -- > >>> Jonathan Ellis > >>> Project Chair, Apache Cassandra > >>> co-founder of DataStax, the source for professional Cassandra support > >>> http://www.datastax.com > >> > >> > >> > >> -- > >> - > >> Paul Loy > >> p...@keteracel.com > >> http://uk.linkedin.com/in/paulloy > > > > > > > > -- > > - > > Paul Loy > > p...@keteracel.com > > http://uk.linkedin.com/in/paulloy > > > > > > -- > Jonathan Ellis > Project Chair, Apache Cassandra > co-founder of DataStax, the source for professional Cassandra support > http://www.datastax.com > -- - Paul Loy p...@keteracel.com http://uk.linkedin.com/in/paulloy
Re: Cluster not recovering when a single node dies
It does not. (Most failures are transient, so Cassandra doesn't inflict the non-negligible performance impact of re-replicating a full node's worth of data until you tell it "that guys' not coming back this time.") On Fri, May 27, 2011 at 10:47 AM, Paul Loy wrote: > I guess my next question is: the data should be complete somewhere in the > ring with RF = 2. Does cassandra not redistribute the replication ring > without a nodetool decommission call? > > On Fri, May 27, 2011 at 4:45 PM, Paul Loy wrote: >> >> ahh, thanks. >> >> On Fri, May 27, 2011 at 4:43 PM, Jonathan Ellis wrote: >>> >>> Quorum of 2 is 2. You need at least RF=3 for quorum to tolerate losing >>> a node indefinitely. >>> >>> On Fri, May 27, 2011 at 10:37 AM, Paul Loy wrote: >>> > We have a 4 node cluster with a replication factor of 2. When one node >>> > dies, >>> > the other nodes throw UnavailableExceptions for quorum reads (as >>> > expected >>> > initially). They never get out of that state. >>> > >>> > Is there something we can do in nodetool to make the remaining nodes >>> > function? >>> > >>> > Thanks. >>> > >>> > -- >>> > - >>> > Paul Loy >>> > p...@keteracel.com >>> > http://uk.linkedin.com/in/paulloy >>> > >>> >>> >>> >>> -- >>> Jonathan Ellis >>> Project Chair, Apache Cassandra >>> co-founder of DataStax, the source for professional Cassandra support >>> http://www.datastax.com >> >> >> >> -- >> - >> Paul Loy >> p...@keteracel.com >> http://uk.linkedin.com/in/paulloy > > > > -- > - > Paul Loy > p...@keteracel.com > http://uk.linkedin.com/in/paulloy > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of DataStax, the source for professional Cassandra support http://www.datastax.com
Re: Cluster not recovering when a single node dies
I guess my next question is: the data should be complete somewhere in the ring with RF = 2. Does cassandra not redistribute the replication ring without a nodetool decommission call? On Fri, May 27, 2011 at 4:45 PM, Paul Loy wrote: > ahh, thanks. > > On Fri, May 27, 2011 at 4:43 PM, Jonathan Ellis wrote: > >> Quorum of 2 is 2. You need at least RF=3 for quorum to tolerate losing >> a node indefinitely. >> >> On Fri, May 27, 2011 at 10:37 AM, Paul Loy wrote: >> > We have a 4 node cluster with a replication factor of 2. When one node >> dies, >> > the other nodes throw UnavailableExceptions for quorum reads (as >> expected >> > initially). They never get out of that state. >> > >> > Is there something we can do in nodetool to make the remaining nodes >> > function? >> > >> > Thanks. >> > >> > -- >> > - >> > Paul Loy >> > p...@keteracel.com >> > http://uk.linkedin.com/in/paulloy >> > >> >> >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of DataStax, the source for professional Cassandra support >> http://www.datastax.com >> > > > > -- > - > Paul Loy > p...@keteracel.com > http://uk.linkedin.com/in/paulloy > -- - Paul Loy p...@keteracel.com http://uk.linkedin.com/in/paulloy
Re: Cluster not recovering when a single node dies
ahh, thanks. On Fri, May 27, 2011 at 4:43 PM, Jonathan Ellis wrote: > Quorum of 2 is 2. You need at least RF=3 for quorum to tolerate losing > a node indefinitely. > > On Fri, May 27, 2011 at 10:37 AM, Paul Loy wrote: > > We have a 4 node cluster with a replication factor of 2. When one node > dies, > > the other nodes throw UnavailableExceptions for quorum reads (as expected > > initially). They never get out of that state. > > > > Is there something we can do in nodetool to make the remaining nodes > > function? > > > > Thanks. > > > > -- > > - > > Paul Loy > > p...@keteracel.com > > http://uk.linkedin.com/in/paulloy > > > > > > -- > Jonathan Ellis > Project Chair, Apache Cassandra > co-founder of DataStax, the source for professional Cassandra support > http://www.datastax.com > -- - Paul Loy p...@keteracel.com http://uk.linkedin.com/in/paulloy
Re: Cluster not recovering when a single node dies
Quorum of 2 is 2. You need at least RF=3 for quorum to tolerate losing a node indefinitely. On Fri, May 27, 2011 at 10:37 AM, Paul Loy wrote: > We have a 4 node cluster with a replication factor of 2. When one node dies, > the other nodes throw UnavailableExceptions for quorum reads (as expected > initially). They never get out of that state. > > Is there something we can do in nodetool to make the remaining nodes > function? > > Thanks. > > -- > - > Paul Loy > p...@keteracel.com > http://uk.linkedin.com/in/paulloy > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of DataStax, the source for professional Cassandra support http://www.datastax.com
Cluster not recovering when a single node dies
We have a 4 node cluster with a replication factor of 2. When one node dies, the other nodes throw UnavailableExceptions for quorum reads (as expected initially). They never get out of that state. Is there something we can do in nodetool to make the remaining nodes function? Thanks. -- - Paul Loy p...@keteracel.com http://uk.linkedin.com/in/paulloy