[jira] [Comment Edited] (HDFS-13119) RBF: Manage unavailable clusters

Yiqun Lin (JIRA) Wed, 14 Feb 2018 05:52:28 -0800

    [ 
https://issues.apache.org/jira/browse/HDFS-13119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16364058#comment-16364058
 ]


Yiqun Lin edited comment on HDFS-13119 at 2/14/18 1:51 PM:
-----------------------------------------------------------

Thanks for the review, [~elgoiri].
{quote}Otherwise, we could just do:
{noformat}
 if (isClusterUnAvailable(nsId) && retryCount > 0) {
 throw new IOException("No namenode available under nameservice " + nsId, ioe);
 }
{noformat}
Then, the default logic takes care of the first retry.
{quote}
Actually the default logic won't takes care of the first retry. Here we use the 
retry policy {{FailoverOnNetworkExceptionRetry}}, it will firstly jump into 
logic of {{RetryDecision.FAILOVER_AND_RETRY}} and throw {{StandbyException}}. 
In the failover rerty, the retry count is passing as 0 again.

 

Attach the new patch to fix some warnings.


was (Author: linyiqun):
Thanks for the review, [~elgoiri].
{quote}Otherwise, we could just do:
{noformat}
 if (isClusterUnAvailable(nsId) && retryCount > 0) {
 throw new IOException("No namenode available under nameservice " + nsId, ioe);
 }
{noformat}
Then, the default logic takes care of the first retry.
{quote}
Actually the default logic won't takes care of the first retry. Here we use the 
retry policy {{FailoverOnNetworkExceptionRetry}}, it will firstly jump into 
logic of {{RetryDecision.FAILOVER_AND_RETRY}} and throw {{StandbyException}}. 
In the failover rerty, the retry count is passing as 0 again.

> RBF: Manage unavailable clusters
> --------------------------------
>
>                 Key: HDFS-13119
>                 URL: https://issues.apache.org/jira/browse/HDFS-13119
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Íñigo Goiri
>            Assignee: Yiqun Lin
>            Priority: Major
>         Attachments: HDFS-13119.001.patch, HDFS-13119.002.patch, 
> HDFS-13119.003.patch
>
>
> When a federated cluster has one of the subcluster down, operations that run 
> in every subcluster ({{RouterRpcClient#invokeAll()}}) may take all the RPC 
> connections.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

[jira] [Comment Edited] (HDFS-13119) RBF: Manage unavailable clusters

Reply via email to