[ https://issues.apache.org/jira/browse/HDFS-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12787900#action_12787900 ]
Zheng Shao commented on HDFS-770: --------------------------------- I saw the same thing with hadoop 0.19 while doing heavy-weight writes. {code} 09/10/29 01:05:18 WARN hdfs.DFSClient: DataStreamer Exception: java.net.SocketTimeoutException: 30000 millis timeout while waiting for channel to be ready for write. ch : java.nio.channels.\ SocketChannel[connected local=/aa.bb.cc.dd:55040 remote=/ee.ff.gg.hh:50010] at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:162) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:146) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:107) at java.io.BufferedOutputStream.write(BufferedOutputStream.java:105) at java.io.DataOutputStream.write(DataOutputStream.java:90) at org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2323) {code} > SocketTimeoutException: timeout while waiting for channel to be ready for read > ------------------------------------------------------------------------------ > > Key: HDFS-770 > URL: https://issues.apache.org/jira/browse/HDFS-770 > Project: Hadoop HDFS > Issue Type: Bug > Components: contrib/libhdfs, data-node, hdfs client, name-node > Affects Versions: 0.20.1 > Environment: Ubuntu Linux 8.04 > Reporter: Leon Mergen > Attachments: client.txt, datanode.txt, namenode.txt > > > We're having issues with timeouts occurring in our client: for some reason, a > timeout of 63000 milliseconds is triggered while writing HDFS data. Since we > currently have a single-server setup, this results in our client terminating > with a "All datanodes are bad" IOException. > We're running all services, including the client, on our single server, so it > cannot be a network error. The load on the client is extremely low during > this period: only a few kilobytes a minute were being written around the time > the error occured. > After browsing a bit online, a lot of people talk about setting > "dfs.datanode.socket.write.timeout" to 0 as a solution for this problem. Due > to the low load of our system during this period, however, I do feel this is > a real error and a timeout that should not be occurring. I have attached 3 > logs of the namenode, datanode and client. > It could be that this is related to > http://issues.apache.org/jira/browse/HDFS-693 > Any pointers on how I can assist to resolve this issue will be greatly > appreciated. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.