[ https://issues.apache.org/jira/browse/HDFS-3541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13396850#comment-13396850 ]
Kihwal Lee commented on HDFS-3541: ---------------------------------- With the proposed fix, the DataXceiver thread would break out of responder.join(), but wouldn't PacketResponder keep running and do finalizeBlock() and closeBlock()? It would even try to send an ack back, which would probably fail anyway. Rather than letting PacketResponder asynchronously modify the state, can we make it stop? > Deadlock between recovery, xceiver and packet responder > ------------------------------------------------------- > > Key: HDFS-3541 > URL: https://issues.apache.org/jira/browse/HDFS-3541 > Project: Hadoop HDFS > Issue Type: Bug > Components: data-node > Affects Versions: 0.23.3, 2.0.1-alpha > Reporter: suja s > Assignee: Vinay > Attachments: DN_dump.rar > > > Block Recovery initiated while write in progress at Datanode side. Found a > lock between recovery, xceiver and packet responder. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira