[ https://issues.apache.org/jira/browse/HDFS-4859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13669960#comment-13669960 ]
Suresh Srinivas commented on HDFS-4859: --------------------------------------- [~cmccabe] Please stop preaching to the choir. Kihwal has been contributing to hadoop and supporting many large clusters for a long time and understands the issues more than you give him credit for in these comments. If he wants to explore making FJM more robust, it is up to him. I also believe that not everyone may want to use QJM and that is e reason why these interfaces are plug gable. > Add timeout in FileJournalManager > --------------------------------- > > Key: HDFS-4859 > URL: https://issues.apache.org/jira/browse/HDFS-4859 > Project: Hadoop HDFS > Issue Type: Bug > Components: ha, namenode > Affects Versions: 2.0.4-alpha > Reporter: Kihwal Lee > > Due to absence of explicit timeout in FileJournalManager, error conditions > that incur long delay (usually until driver timeout) can make namenode > unresponsive for long time. This directly affects NN's failure detection > latency, which is critical in HA. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira