Are your datanodes and namenode machines able to see each other - ping etc? Is the /etc/hosts configured correctly? Is the namenode process(seen through jps on master) up ?
Thanks and Regards, Sonal <https://github.com/sonalgoyal/hiho>Hadoop ETL and Data Integration<https://github.com/sonalgoyal/hiho> Nube Technologies <http://www.nubetech.co> <http://in.linkedin.com/in/sonalgoyal> On Tue, Apr 12, 2011 at 11:19 AM, prasunb <[email protected]>wrote: > > Hello, > > I am trying to configure Hadoop in fully distributed mode on three virtual > Fedora machines. During configuring I am not getting any error. Even when I > am executing the script "start-dfs.sh", there aren't any error. > > But practically the namenode isn't able to connect the datanodes. These are > the error snippents from the "hadoop-root-datanode-hadoop2.log" files of > both datanodes.... > > ================================================================== > > 2011-04-08 15:33:03,549 INFO > org.apache.hadoop.security.UserGroupInformation: JAAS Configuration already > set up for Hadoop, not re-installing. > 2011-04-08 15:33:03,691 ERROR > org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: Call > to hadoop1/192.168.161.198:8020 failed on local exception: > java.io.IOException: Connection reset by peer > at org.apache.hadoop.ipc.Client.wrapException(Client.java:1139) > at org.apache.hadoop.ipc.Client.call(Client.java:1107) > at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:226) > at $Proxy4.getProtocolVersion(Unknown Source) > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:398) > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:342) > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:317) > at org.apache.hadoop.ipc.RPC.waitForProxy(RPC.java:297) > at > > org.apache.hadoop.hdfs.server.datanode.DataNode.startDataNode(DataNode.java:338) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.(DataNode.java:280) > at > > org.apache.hadoop.hdfs.server.datanode.DataNode.makeInstance(DataNode.java:1527) > at > > org.apache.hadoop.hdfs.server.datanode.DataNode.instantiateDataNode(DataNode.java:1467) > at > > org.apache.hadoop.hdfs.server.datanode.DataNode.createDataNode(DataNode.java:1485) > at > > org.apache.hadoop.hdfs.server.datanode.DataNode.secureMain(DataNode.java:1610) > at > org.apache.hadoop.hdfs.server.datanode.DataNode.main(DataNode.java:1620) > Caused by: java.io.IOException: Connection reset by peer > at sun.nio.ch.FileDispatcher.read0(Native Method) > at sun.nio.ch.SocketDispatcher.read(SocketDispatcher.java:21) > at sun.nio.ch.IOUtil.readIntoNativeBuffer(IOUtil.java:202) > at sun.nio.ch.IOUtil.read(IOUtil.java:175) > at sun.nio.ch.SocketChannelImpl.read(SocketChannelImpl.java:243) > at > > org.apache.hadoop.net.SocketInputStream$Reader.performIO(SocketInputStream.java:55) > at > > org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:142) > at > org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:155) > at > org.apache.hadoop.net.SocketInputStream.read(SocketInputStream.java:128) > at java.io.FilterInputStream.read(FilterInputStream.java:116) > at > > org.apache.hadoop.ipc.Client$Connection$PingInputStream.read(Client.java:375) > at java.io.BufferedInputStream.fill(BufferedInputStream.java:218) > at java.io.BufferedInputStream.read(BufferedInputStream.java:237) > at java.io.DataInputStream.readInt(DataInputStream.java:370) > at > org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:812) > at org.apache.hadoop.ipc.Client$Connection.run(Client.java:720) > > 2011-04-08 15:33:03,692 INFO > org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG: > /************************************************************ > SHUTDOWN_MSG: Shutting down DataNode at hadoop2/127.0.0.1 > ************************************************************/ > STARTUP_MSG: Starting DataNode > STARTUP_MSG: host = hadoop2/127.0.0.1 > STARTUP_MSG: args = [] > STARTUP_MSG: version = 0.20.2-CDH3B4 > STARTUP_MSG: build = -r 3aa7c91592ea1c53f3a913a581dbfcdfebe98bfe; > compiled by 'root' on Mon Feb 21 17:31:12 EST 2011 > ************************************************************/ > 2011-04-08 15:47:46,738 INFO > org.apache.hadoop.security.UserGroupInformation: JAAS Configuration already > set up for Hadoop, not re-installing. > 2011-04-08 15:47:47,839 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 0 time(s). > 2011-04-08 15:47:48,849 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 1 time(s). > 2011-04-08 15:47:49,859 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 2 time(s). > 2011-04-08 15:47:50,869 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 3 time(s). > 2011-04-08 15:47:51,878 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 4 time(s). > 2011-04-08 15:47:52,889 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 5 time(s). > 2011-04-08 15:47:53,900 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 6 time(s). > 2011-04-08 15:47:54,908 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 7 time(s). > 2011-04-08 15:47:55,917 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 8 time(s). > 2011-04-08 15:47:56,926 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 9 time(s). > 2011-04-08 15:47:56,928 INFO org.apache.hadoop.ipc.RPC: Server at > hadoop1/192.168.161.198:8020 not available yet, Zzzzz... > 2011-04-08 15:47:58,944 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 0 time(s). > 2011-04-08 15:47:59,953 INFO org.apache.hadoop.ipc.Client: Retrying connect > to server: hadoop1/192.168.161.198:8020. Already tried 1 time(s). > > ====================================================================== > > Can anyone please help me to understand the problem. > > Thanks in advance. > -- > View this message in context: > http://old.nabble.com/%22Retrying-connect%22-error-while-configuring-hadoop-tp31376243p31376243.html > Sent from the Hadoop core-user mailing list archive at Nabble.com. >
