DistCp is the standard way to copy data between clusters. What it does is run a mapreduce job to copy data between a source cluster and a destination cluster. See http://hadoop.apache.org/core/docs/r0.19.1/distcp.html
On Mon, Apr 6, 2009 at 9:49 PM, Mithila Nagendra <mnage...@asu.edu> wrote: > Hey all > I'm trying to connect two separate Hadoop clusters. Is it possible to do > so? > I need data to be shuttled back and forth between the two clusters. Any > suggestions? > > Thank you! > Mithila Nagendra > Arizona State University >