time outs when accessing port 50010

David J. O'Dell Wed, 25 Nov 2009 11:28:22 -0800

I have 2 clusters:
30 nodes running 0.18.3
and
36 nodes running 0.20.1

I've intermittently seen the following errors on both of my clusters, ithappens when writing files.I was hoping this would go away with the new version but I see the samebehavior on both versions.The namenode logs don't show any problems, its always on the client anddatanodes.

Below is any example from this morning, unfortunately I haven't found abug or config that specifically addresses this issue.


Any insight would be greatly appreciated.

Client log:

09/11/25 10:54:15 INFO hdfs.DFSClient: Exception increateBlockOutputStream java.net.SocketTimeoutException: 69000 millistimeout while waiting for channel to be ready for read. ch :java.nio.channels.SocketChannel[connected local=/10.1.75.11:37852remote=/10.1.75.125:50010]09/11/25 10:54:15 INFO hdfs.DFSClient: Abandoning blockblk_-105422935413230449_2260809/11/25 10:54:15 INFO hdfs.DFSClient: Waiting to find target node:10.1.75.125:50010


Datanode log:

2009-11-25 10:54:51,170 ERRORorg.apache.hadoop.hdfs.server.datanode.DataNode:DatanodeRegistration(10.1.75.125:50010,storageID=DS-1401408597-10.1.75.125-50010-1258737830230, infoPort=50075,ipcPort=50020):DataXceiverjava.net.SocketTimeoutException: 120000 millis timeout while waiting forchannel to be ready for connect. ch :java.nio.channels.SocketChannel[connection-pendingremote=/10.1.75.104:50010]atorg.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:213)

       at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:404)

atorg.apache.hadoop.hdfs.server.datanode.DataXceiver.writeBlock(DataXceiver.java:282)atorg.apache.hadoop.hdfs.server.datanode.DataXceiver.run(DataXceiver.java:103)

       at java.lang.Thread.run(Thread.java:619)

time outs when accessing port 50010

Reply via email to