[ 
https://issues.apache.org/jira/browse/HDFS-770?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12787900#action_12787900
 ] 

Zheng Shao commented on HDFS-770:
---------------------------------

I saw the same thing with hadoop 0.19 while doing heavy-weight writes.

{code}
09/10/29 01:05:18 WARN hdfs.DFSClient: DataStreamer Exception: 
java.net.SocketTimeoutException: 30000 millis timeout while waiting for channel 
to be ready for write. ch : java.nio.channels.\
SocketChannel[connected local=/aa.bb.cc.dd:55040 remote=/ee.ff.gg.hh:50010]
        at 
org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:162)
        at 
org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:146)
        at 
org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:107)
        at java.io.BufferedOutputStream.write(BufferedOutputStream.java:105)
        at java.io.DataOutputStream.write(DataOutputStream.java:90)
        at 
org.apache.hadoop.hdfs.DFSClient$DFSOutputStream$DataStreamer.run(DFSClient.java:2323)
{code}


> SocketTimeoutException: timeout while waiting for channel to be ready for read
> ------------------------------------------------------------------------------
>
>                 Key: HDFS-770
>                 URL: https://issues.apache.org/jira/browse/HDFS-770
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: contrib/libhdfs, data-node, hdfs client, name-node
>    Affects Versions: 0.20.1
>         Environment: Ubuntu Linux 8.04
>            Reporter: Leon Mergen
>         Attachments: client.txt, datanode.txt, namenode.txt
>
>
> We're having issues with timeouts occurring in our client: for some reason, a 
> timeout of 63000 milliseconds is triggered while writing HDFS data. Since we 
> currently have a single-server setup, this results in our client terminating 
> with a "All datanodes are bad" IOException.
> We're running all services, including the client, on our single server, so it 
> cannot be a network error. The load on the client is extremely low during 
> this period: only a few kilobytes a minute were being written around the time 
> the error occured. 
> After browsing a bit online, a lot of people talk about setting 
> "dfs.datanode.socket.write.timeout" to 0 as a solution for this problem. Due 
> to the low load of our system during this period, however, I do feel this is 
> a real error and a timeout that should not be occurring. I have attached 3 
> logs of the namenode, datanode and client.
> It could be that this is related to 
> http://issues.apache.org/jira/browse/HDFS-693
> Any pointers on how I can assist to resolve this issue will be greatly 
> appreciated.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to