Using HDFS to replicate data is problematic because the map-reduce programs will tend to send data across the link between the data centers (and a cluster can generate a LOT of traffic).
My guess is that you want two clusters with a scripted mirroring capability. On 3/13/08 2:05 PM, "Cagdas Gerede" <[EMAIL PROTECTED]> wrote: >> If your data center fails, then you probably have to worry more about how to >> get your data. > > I assume having multiple data centers. I know thanks to HDFS > replication data in the other data center will be enough. > However, as much as I see for now, HDFS has no support for replication > of namenode. > Is this true? > If there is no automated support, and If I need to do this replication > with some custom code or manual intervention, > what are the steps to do this replication? > > Any help is appreciated. > > Cagdas