Hi,

I have launched an AWS Spark cluster using the spark-ec2 script
(--hadoop-major-version=1). The ephemeral-HDFS is setup correctly and I can
see the name node at <master hostname>:50070. When I try to copy files from
S3 into ephemeral-HDFS using distcp using the following command:

ephemeral-hdfs/bin/hadoop distcp <S3 URL> hdfs://<url of machine hosting
namenode>:9001/data-platform/backfill/weekly-vel-acc-data-tables/data/drive-sample-distcp

I get the following:

Copy failed: java.net.ConnectException: Call to
ec2-54-89-53-102.compute-1.amazonaws.com/10.146.200.172:9001 failed on
connection exception: java.net.ConnectException: Connection refused

at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099)

at org.apache.hadoop.ipc.Client.call(Client.java:1075)

at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225)

at org.apache.hadoop.mapred.$Proxy2.getProtocolVersion(Unknown Source)

at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396)

at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379)

at org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480)

at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474)

at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457)

at org.apache.hadoop.tools.DistCp.setup(DistCp.java:1015)

at org.apache.hadoop.tools.DistCp.copy(DistCp.java:666)

at org.apache.hadoop.tools.DistCp.run(DistCp.java:881)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)

at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)

at org.apache.hadoop.tools.DistCp.main(DistCp.java:908)

Caused by: java.net.ConnectException: Connection refused

at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)

at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)

at
org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)

at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489)

at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:434)

at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:560)

at org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:184)

at org.apache.hadoop.ipc.Client.getConnection(Client.java:1206)

at org.apache.hadoop.ipc.Client.call(Client.java:1050)

... 13 more

Reply via email to