Re: Problem with deleting keys

Nico Meyer Thu, 16 Jun 2011 02:58:56 -0700

Hello David,

this behaviour is quite expected if you think about how Riak works.

Assuming you use the default replication factor of n=3, each key isstored on all of your three nodes. If you delete a key while one node(let's call it A) is down, the key is deleted from the two nodes thatare still up (let's call them B and C), and remains on the downed node A.Once node A is up again, the situation is indistinguishable from B and Chaving a hard drive crash and loosing all their data, in that A has thekey and B and C know nothing about it.

If you do a GET of the deleted key at this point, the result depends onthe r-value that you choose. For r>1 you will get a not_found on thefirst get. For r=1 you might get the data or a not_found, depending onwhich two nodes answer first (seehttps://issues.basho.com/show_bug.cgi?id=992 about basic quorum for anexplanation). Also, at that point read repair will kick in andre-replicate the key to all nodes, so subsequent GETs will always returnthe original datum.

listing keys on the other hand does not use quorum but just does a setunion of all keys of all the nodes in you cluster. Essentially it isequivalent to r=1 without basic quorum. The same is true for map/reducequeries to my knowledge

The essential problem is that a real physical delete isindistinguishable from data loss (or never having had the data in thefirst place), while those two things are logically different.If you want to be sure that a key is deleted with all its replicas youmust delete it with a write quorum setting of w=n. Also you need to tellRiak not to count fallback vnodes toward you write quorum. This featureis quite new and I believe only available in the head revision. Also Iforgot the name of the parameter and don't know if it is even applicablefor DELETEs.Anyhow, if you do all this, your DELETEs will simply fail if any of thenodes that has a copy of the key is down (so in your case, if any nodeis down).

If you only want to logically delete, and don't care about freeing thedisk space and RAM that is used by the key, you should use a specialvalue, which is interpreted by your application as a not found. That wayyou also get proper conflict resolution between DELETEs and PUTs (sayone client deletes a key while another one updates it).


Cheers,
Nico

Am 16.06.2011 00:55, schrieb David Mitchell:

Erlang: R13B04

Riak: 0.14.2
I have a three node cluster, and while one node was down, I deletedevery key in a certain bucket. Then, I started the node that wasdown, and it joined the cluster.
Now, when do a listing on these keys in this bucket, and I get theentire list. I can also get the values of the bucket. However, whenI try to delete the keys, the keys are not deleted.
Can anyone help me get the nodes back in a consistent state? I havetried restarting the nodes.
David


_______________________________________________
riak-users mailing list
[email protected]
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

_______________________________________________
riak-users mailing list
[email protected]
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Re: Problem with deleting keys

Reply via email to