Re: ketama consistent hashing update

Steven Grimm Wed, 08 Aug 2007 05:26:08 -0700

Alex Stapleton wrote:

As Steve says in his follow-up we also need to handle collisions whilecreating the continuum. Simply incrementing the point value by 1 untilwe find an empty place is going to give us a rather biased continuum.I think that incrementing the point number 1 and recalculating thepoint value from that should give a better distribution while beingfairly easy to actually implement.

I'm not a big fan of that idea because an implementer would have to bevery careful to avoid insertion order bugs. You need the mapping from akey to a host to be the same whether the host list was initialized as(A,B,C,D) or as (A,B,D) with (C) added later, and any kind of "Oops,that bucket is taken already, do something else with the node I'm tryingto insert" algorithm will, unless it's carefully implemented,potentially give you a subtly different data structure in those two cases.

I'm much more in favor of making "n" large enough that it's reasonableto say something like, "Oops, that bucket is taken already; overwrite itwith me if the address of the node it currently points to is numericallyless than mine, else don't insert myself here." Then you willdeterministically end up with exactly the same search tree no matterwhat order the nodes are inserted. If you are inserting each node 10times in a tree and your search space is a full 32 bits, the occasionalnode that's inserted 9 times instead is going to be basically irrelevantto the health of the system. Not perfectly balanced, sure, but therewill almost never be any collisions to begin with and it isn't worthoptimizing such a rare edge case.


-Steve

Re: ketama consistent hashing update

Reply via email to