Re: Stretched HDFS cluster

Steve Loughran Wed, 16 Sep 2009 04:28:17 -0700

Touretsky, Gregory wrote:

Hi,


    Does anyone have an experience running HDFS cluster stretched over 
high-latency WAN connections?
Any specific concerns/options/recommendations?
I'm trying to setup the HDFS cluster with the nodes located in the US, Israel 
and India - considering it as a potential solution for cross-site data 
sharing...

I would back up todd here and say "don't do it -yet". I think there aresome minor placeholders in the rack hierarchy to have an explicit notionof different sites, but nobody has done the work yet. Cross datacentredata balancing and work scheduling is complex, and all the code inHadoop, zookeeper, etc, is built on the assumption that latency is low,all machines clocks are going forward at roughly the same rate, thenetwork is fairly reliable, routers are unlikely to corrupt data, etc.

Now, if you do want to do >1 site, it would be a profound and usefuldevelopment -I'd expect the MR scheduler, or even the Pig/Hive codegenerators , to take datacentre locality into account, doing as muchwork per site as possible. The problem of block distribution changestoo, as you would want 1 copy of each block in the other datacentre.Even then, I'd start with sites in a single city, on a MAE or other linkwhere bandwidth matters less. Note that (as discussed below) on the MANscale things can start to go wrong in ways that are unlikely in adatacentre, and its those failures that will burn you


worth reading
http://status.aws.amazon.com/s3-20080720.html
http://www.allthingsdistributed.com/2008/12/eventually_consistent.html

-Steve

Re: Stretched HDFS cluster

Reply via email to