distcp returns success but does not copy files due to connection problem. Error
is logged on target HDFS log directory
----------------------------------------------------------------------------------------------------------------------
Key: MAPREDUCE-684
URL: https://issues.apache.org/jira/browse/MAPREDUCE-684
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: distcp
Affects Versions: 0.20.1
Reporter: Suhas Gogate
Distcp returns success even though files are not copied due to connection
problem. It creates empty directory structure on the target and log the error
message on the target HDFS log directory.
distcp command is run on hadoop 20 fetching data from hadoop 18 cluster.
-bash-3.1$ hadoop distcp -Dmapred.job.queue.name=xxxx -i -p -update -delete
hftp://xxx.mydomain.com:50070/user/gogate/mirror_test2
hdfs://yyy.mydomain.com:8020/user/gogate/mirror_test2'
09/06/30 18:41:29 INFO tools.DistCp:
srcPaths=[hftp://xxx.mydomain.com:50070/user/gogate/mirror_test2]
09/06/30 18:41:29 INFO tools.DistCp:
destPath=hdfs://yyy.mydomain.com:8020/user/gogate/mirror_test2
09/06/30 18:41:30 INFO tools.DistCp:
hdfs://yyy.mydomain.com:8020/user/gogate/mirror_test2 does not exist.
09/06/30 18:41:30 INFO tools.DistCp: srcCount=4
09/06/30 18:41:36 INFO mapred.JobClient: Running job: job_200906290541_3336
09/06/30 18:41:37 INFO mapred.JobClient: map 0% reduce 0%
09/06/30 18:43:05 INFO mapred.JobClient: map 100% reduce 0%
09/06/30 18:43:28 INFO mapred.JobClient: Job complete: job_200906290541_3336
echo $?
09/06/30 18:43:35 INFO mapred.JobClient: Counters: 8
09/06/30 18:43:35 INFO mapred.JobClient: Job Counters
09/06/30 18:43:35 INFO mapred.JobClient: Launched map tasks=1
09/06/30 18:43:35 INFO mapred.JobClient: FileSystemCounters
09/06/30 18:43:35 INFO mapred.JobClient: HDFS_BYTES_READ=534
09/06/30 18:43:35 INFO mapred.JobClient: HDFS_BYTES_WRITTEN=3655
09/06/30 18:43:35 INFO mapred.JobClient: distcp
09/06/30 18:43:35 INFO mapred.JobClient: Files failed=2
09/06/30 18:43:35 INFO mapred.JobClient: Map-Reduce Framework
09/06/30 18:43:35 INFO mapred.JobClient: Map input records=3
09/06/30 18:43:35 INFO mapred.JobClient: Spilled Records=0
09/06/30 18:43:35 INFO mapred.JobClient: Map input bytes=434
09/06/30 18:43:35 INFO mapred.JobClient: Map output records=2
-bash-3.1$ echo $?
0
target HDFS log directory message.
-bash-3.1$ hadoop fs -cat /user/gogate/_distcp_logs_f7twl9/part-00000
FAIL pig_1245890239320.log : java.net.ConnectException: Connection refused
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.Socket.connect(Socket.java:519)
at java.net.Socket.connect(Socket.java:469)
at sun.net.NetworkClient.doConnect(NetworkClient.java:157)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:394)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:529)
at sun.net.www.http.HttpClient.<init>(HttpClient.java:233)
at sun.net.www.http.HttpClient.New(HttpClient.java:306)
at sun.net.www.http.HttpClient.New(HttpClient.java:323)
at
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:788)
at
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:729)
at
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:654)
at
sun.net.www.protocol.http.HttpURLConnection.followRedirect(HttpURLConnection.java:1868)
at
sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1172)
at org.apache.hadoop.hdfs.HftpFileSystem.open(HftpFileSystem.java:142)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:351)
at org.apache.hadoop.tools.DistCp$CopyFilesMapper.copy(DistCp.java:410)
at org.apache.hadoop.tools.DistCp$CopyFilesMapper.map(DistCp.java:537)
at org.apache.hadoop.tools.DistCp$CopyFilesMapper.map(DistCp.java:306)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:356)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.Child.main(Child.java:170)
FAIL dir1/xxx.pig : java.net.ConnectException: Connection refused
at java.net.PlainSocketImpl.socketConnect(Native Method)
at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
at java.net.Socket.connect(Socket.java:519)
at java.net.Socket.connect(Socket.java:469)
at sun.net.NetworkClient.doConnect(NetworkClient.java:157)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:394)
at sun.net.www.http.HttpClient.openServer(HttpClient.java:529)
at sun.net.www.http.HttpClient.<init>(HttpClient.java:233)
at sun.net.www.http.HttpClient.New(HttpClient.java:306)
at sun.net.www.http.HttpClient.New(HttpClient.java:323)
at
sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(HttpURLConnection.java:788)
at
sun.net.www.protocol.http.HttpURLConnection.plainConnect(HttpURLConnection.java:729)
at
sun.net.www.protocol.http.HttpURLConnection.connect(HttpURLConnection.java:654)
at
sun.net.www.protocol.http.HttpURLConnection.followRedirect(HttpURLConnection.java:1868)
at
sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1172)
at org.apache.hadoop.hdfs.HftpFileSystem.open(HftpFileSystem.java:142)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:351)
at org.apache.hadoop.tools.DistCp$CopyFilesMapper.copy(DistCp.java:410)
at org.apache.hadoop.tools.DistCp$CopyFilesMapper.map(DistCp.java:537)
at org.apache.hadoop.tools.DistCp$CopyFilesMapper.map(DistCp.java:306)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:356)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.Child.main(Child.java:170)
-bash-3.1$
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.