[ 
https://issues.apache.org/jira/browse/HDFS-3504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13292096#comment-13292096
 ] 

Sanjay Radia commented on HDFS-3504:
------------------------------------

* Not sure if exponential backoff is flexible enough. Typically one wants to 
retry every 10 sec till about a minute and then retry every 60 sec.
* You forgot about the connection retry.
* Why is MiniDfsCluster changes needed?
                
> Configurable retry in DFSClient
> -------------------------------
>
>                 Key: HDFS-3504
>                 URL: https://issues.apache.org/jira/browse/HDFS-3504
>             Project: Hadoop HDFS
>          Issue Type: Improvement
>    Affects Versions: 1.0.0, 2.0.0-alpha
>            Reporter: Siddharth Seth
>            Assignee: Tsz Wo (Nicholas), SZE
>         Attachments: h3504_20120607.patch, h3504_20120608.patch
>
>
> When NN maintenance is performed on a large cluster, jobs end up failing. 
> This is particularly bad for long running jobs. The client retry policy could 
> be made configurable so that jobs don't need to be restarted.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to