[ https://issues.apache.org/jira/browse/HDFS-1056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14079488#comment-14079488 ]
Allen Wittenauer commented on HDFS-1056: ---------------------------------------- Ping! I suspect fixed though. > Multi-node RPC deadlocks during block recovery > ---------------------------------------------- > > Key: HDFS-1056 > URL: https://issues.apache.org/jira/browse/HDFS-1056 > Project: Hadoop HDFS > Issue Type: Improvement > Components: datanode > Affects Versions: 0.20.2, 0.21.0, 0.22.0 > Reporter: Todd Lipcon > Fix For: 0.20-append > > Attachments: > 0013-HDFS-1056.-Fix-possible-multinode-deadlocks-during-b.patch > > > Believe it or not, I'm seeing HADOOP-3657 / HADOOP-3673 in a 5-node 0.20 > cluster. I have many concurrent writes on the cluster, and when I kill a DN, > some percentage of the time I get one of these cross-node deadlocks among 3 > of the nodes (replication 3). All of the DN RPC server threads are tied up > waiting on RPC clients to other datanodes. -- This message was sent by Atlassian JIRA (v6.2#6252)