Re: HADOOP-4539 question

Steve Loughran Thu, 13 Aug 2009 07:01:58 -0700

Konstantin Shvachko wrote:

And the only remaining step is to implement fail-over mechanism.

:)

Colleagues of mine work on HA stuff; I try and steer clear of it as itgets complex fast. Test case: what happens when a network failuresplits the datacentre in two, you now have two clusters each with halfthe data and possibly a primary/2ary master in each one. Then leave thepartition up for a while, do inconsistent operations on each then havethe network come back up. Then work out how to merge the state

Looking at the facebook/google "multi-master" solution, I think theydon't worry about consistency, just let the masters drift apart.


see also Johan's recent talk on HDFS: http://www.slideshare.net/steve_l/hdfs

Re: HADOOP-4539 question

Reply via email to