Re: Propose new ConsistencyLevel.ALL_AVAIL for reads

AJ Thu, 16 Jun 2011 13:07:09 -0700

On 6/16/2011 10:58 AM, Dan Hendry wrote:

I think this would add a lot of complexity behind the scenes and be 
conceptually confusing, particularly for new users.

I'm not so sure about this. Cass is already somewhat sophisticated andI don't see how this could trip-up anyone who can already grasp thebasics. The only thing I am adding to the CL concept is the concept ofavailable replication nodes, versus total replication nodes. But, don'tforget; a competitor to Cass is probably in the works this very minuteso constant improvement is a good thing.

The Cassandra consistency model is pretty elegant and this type of approach 
breaks that elegance in many ways. It would also only really be useful when the 
value has a high probability of being updated between a node going down and the 
value being read.

I'm not sure what you mean. A node can be down for days during whichtime the value can be updated. The intention is to use the nodesavailable even if they fall below the RF. If there is only 1 nodeavailable for accepting a replica, that should be enough given theconditions I stated and updated below.

Perhaps the simpler approach which is fairly trivial and does not require any 
Cassandra change is to simply downgrade your read from ALL to QUORUM when you 
get an unavailable exception for this particular read.

It's not so trivial, esp since you would have to build that into yourclient at many levels. I think it would be more appropriate (if thisidea survives) to put it into Cass.

I think the general answerer for 'maximum consistency' is QUORUM reads/writes. 
Based on the fact you are using CL=ALL for reads I assume you are using CL=ONE 
for writes: this itself strikes me as a bad idea if you require 'maximum 
consistency for one critical operation'.

Very true. Specifying quorum for BOTH reads/writes provides the 100%consistency because of the overlapping of the availability numbers.But, only if the # of available nodes is not < RF.

Upon further reflection, this idea can be used for any consistencylevel. The general thrust of my argument is: If a particular value canbe overwritten by one process regardless of it's prior value, then thatimplies that the value in the down node is no longer up-to-date and canbe disregarded. Just work with the nodes that are available.


Actually, now that I think about it...

ALL_AVAIL guarantees 100% consistency iff the latest timestamp of thevalue > latest unavailability time of all unavailable replica nodes forthat value's row key. Unavailable is defined as a node's Cass processthat is not reachable from ANY node in the cluster in the same datacenter. If the node in question is available to at least one node, thenthe read should fail as there is a possibility that the value could havebeen updated some other way.

After looking at the code, it doesn't look like it will be difficult.Instead of skipping the request for values from the nodes when CL nodesaren't available, it would have to go ahead and request the values fromthe available nodes as usual and then look at the timestamps which itdoes anyways and compare it to the latest unavailability time of therelevant replica nodes. The code that keeps track of what nodes aredown simply records the time it went down. But, I've only been lookingat the code for a few days so I'm not claiming to know everything by anystretch.

Dan


Thanks for your reply.  I still welcome critiques.

Re: Propose new ConsistencyLevel.ALL_AVAIL for reads

Reply via email to