[ 
https://issues.apache.org/jira/browse/HDFS-10985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mingliang Liu updated HDFS-10985:
---------------------------------
    Description: {{TestZKFailoverController#testGracefulFailoverMultipleZKfcs}} 
uses fixed time sleep before assertions. This may fail sometimes though 10 
seconds are generally long enough. I think we can use 
{{GenericTestUtils.waitFor()}} to retry the assertions.  (was: h6. Error Message
{quote}
Unable to become active. Local node did not get an opportunity to do so from 
ZooKeeper, or the local node took too long to transition to active.
{quote}
h6. Stacktrace
{quote}
org.apache.hadoop.ha.ServiceFailedException: Unable to become active. Local 
node did not get an opportunity to do so from ZooKeeper, or the local node took 
too long to transition to active.
        at 
org.apache.hadoop.ha.ZKFailoverController.doGracefulFailover(ZKFailoverController.java:690)
        at 
org.apache.hadoop.ha.ZKFailoverController.access$400(ZKFailoverController.java:62)
        at 
org.apache.hadoop.ha.ZKFailoverController$3.run(ZKFailoverController.java:607)
        at 
org.apache.hadoop.ha.ZKFailoverController$3.run(ZKFailoverController.java:604)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:422)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1795)
        at 
org.apache.hadoop.ha.ZKFailoverController.gracefulFailoverToYou(ZKFailoverController.java:604)
        at 
org.apache.hadoop.ha.ZKFCRpcServer.gracefulFailover(ZKFCRpcServer.java:94)
        at 
org.apache.hadoop.ha.TestZKFailoverController.testGracefulFailoverMultipleZKfcs(TestZKFailoverController.java:590)
{quote}

See recent failing build:
# 
https://builds.apache.org/job/PreCommit-HADOOP-Build/10705/testReport/org.apache.hadoop.ha/TestZKFailoverController/testGracefulFailoverMultipleZKfcs/
# to add more

I think we can use {{GenericTestUtils.waitFor()}} to retry the assertions.)

> o.a.h.ha.TestZKFailoverController#testGracefulFailoverMultipleZKfcs fails 
> intermittently
> ----------------------------------------------------------------------------------------
>
>                 Key: HDFS-10985
>                 URL: https://issues.apache.org/jira/browse/HDFS-10985
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: ha, test
>            Reporter: Mingliang Liu
>            Assignee: Mingliang Liu
>         Attachments: HDFS-10985.000.patch
>
>
> {{TestZKFailoverController#testGracefulFailoverMultipleZKfcs}} uses fixed 
> time sleep before assertions. This may fail sometimes though 10 seconds are 
> generally long enough. I think we can use {{GenericTestUtils.waitFor()}} to 
> retry the assertions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to