[ 
https://issues.apache.org/jira/browse/HDFS-4859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13669960#comment-13669960
 ] 

Suresh Srinivas commented on HDFS-4859:
---------------------------------------

[~cmccabe] Please stop preaching to the choir.

Kihwal has been contributing to hadoop and supporting many large clusters for a 
long time and understands the issues more than you give him credit for in these 
comments. If he wants to explore making FJM more robust, it is up to him. I 
also believe that not everyone may want to use QJM and that is e reason why 
these interfaces are plug gable. 
                
> Add timeout in FileJournalManager
> ---------------------------------
>
>                 Key: HDFS-4859
>                 URL: https://issues.apache.org/jira/browse/HDFS-4859
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: ha, namenode
>    Affects Versions: 2.0.4-alpha
>            Reporter: Kihwal Lee
>
> Due to absence of explicit timeout in FileJournalManager, error conditions 
> that incur long delay (usually until driver timeout) can make namenode 
> unresponsive for long time. This directly affects NN's failure detection 
> latency, which is critical in HA.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to