Using HDFS to replicate data is problematic because the map-reduce programs
will tend to send data across the link  between the data centers (and a
cluster can generate a LOT of traffic).

My guess is that you want two clusters with a scripted mirroring capability.


On 3/13/08 2:05 PM, "Cagdas Gerede" <[EMAIL PROTECTED]> wrote:

>> If your data center fails, then you probably have to worry more about how to
>> get your data.
> 
> I assume having multiple data centers. I know thanks to HDFS
> replication data in the other data center will be enough.
> However, as much as I see for now, HDFS has no support for replication
> of namenode.
> Is this true?
> If there is no automated support, and If I need to do this replication
> with some custom code or manual intervention,
> what are the steps to do this replication?
> 
> Any help is appreciated.
> 
> Cagdas

Reply via email to