I've run into an interesting problem with syncing a couple of clusters using distcp. We've validated that it works to a local installation from our remote cluster. I suspect our firewalls 'may' be responsible for the problem we're experiencing. We're using ports 9000, 9001 and 50010.I've verified all three ports are available to the namenodes and datanodes in both directions. Is there something else we're missing?

Looks like it get's to 80% before it fails.  Here's what we're seeing.

# u...@hnn1:~$ hadoop distcp hdfs://hnn1:9000/user/testing hdfs://hnn2:9000/user

10/12/03 15:58:10 INFO tools.DistCp: srcPaths=[hdfs://hnn1:9000/user/testing]

10/12/03 15:58:10 INFO tools.DistCp: destPath=hdfs://hnn2:9000/user

10/12/03 15:58:11 INFO tools.DistCp: srcCount=6

10/12/03 15:58:11 INFO mapred.JobClient: Running job: job_201011221457_0019

10/12/03 15:58:12 INFO mapred.JobClient:  map 0% reduce 0%

 10/12/03 15:58:36 INFO mapred.JobClient:  map 19% reduce 0%

10/12/03 15:58:45 INFO mapred.JobClient:  map 39% reduce 0%

10/12/03 15:59:03 INFO mapred.JobClient:  map 60% reduce 0%

10/12/03 15:59:12 INFO mapred.JobClient:  map 80% reduce 0%

10/12/03 15:59:32 INFO mapred.JobClient: Task Id : attempt_201011221457_0019_m_000000_0, Status : FAILED

java.io.IOException: Copied: 0 Skipped: 0 Failed: 5

at org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)

        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)

        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)

        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)

        at org.apache.hadoop.mapred.Child.main(Child.java:170)

10/12/03 15:59:33 INFO mapred.JobClient:  map 0% reduce 0%

10/12/03 15:59:55 INFO mapred.JobClient:  map 19% reduce 0%

10/12/03 16:00:04 INFO mapred.JobClient:  map 39% reduce 0%

10/12/03 16:00:22 INFO mapred.JobClient:  map 60% reduce 0%

10/12/03 16:00:31 INFO mapred.JobClient:  map 80% reduce 0%

10/12/03 16:00:51 INFO mapred.JobClient: Task Id : attempt_201011221457_0019_m_000000_1, Status : FAILED

java.io.IOException: Copied: 0 Skipped: 0 Failed: 5

at org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)

        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)

        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)

        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)

        at org.apache.hadoop.mapred.Child.main(Child.java:170)

Thanks!

Reply via email to