[ https://issues.apache.org/jira/browse/YARN-6860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16098141#comment-16098141 ]
Akira Ajisaka commented on YARN-6860: ------------------------------------- The test fails in the following code: {code} // the first app0 get kicked out from both rmContext and state store Assert.assertNull(rm2.getRMContext().getRMApps() .get(app0.getApplicationId())); Assert.assertNull(rmAppState.get(app0.getApplicationId())); {code} RMAppManager removes app0 from rmContext by blocking API, and removes it from state store by non-blocking API (Please see {{RMStateStore#removeApplication}} for the detail). That way the latter assertion may fail. I'm thinking the issue can be fixed by adding wait via {{GenericTestUtils#waitFor}}. I'll attach a patch shortly. > TestRMRestart.testFinishedAppRemovalAfterRMRestart fails intermittently > ----------------------------------------------------------------------- > > Key: YARN-6860 > URL: https://issues.apache.org/jira/browse/YARN-6860 > Project: Hadoop YARN > Issue Type: Bug > Components: test > Reporter: Akira Ajisaka > Assignee: Akira Ajisaka > > https://builds.apache.org/job/PreCommit-YARN-Build/16528/testReport/org.apache.hadoop.yarn.server.resourcemanager/TestRMRestart/testFinishedAppRemovalAfterRMRestart/ > {noformat} > java.lang.AssertionError: expected null, but was:<submit_time: 1500886835534 > application_submission_context { application_id { id: 1 cluster_timestamp: > 1500886835515 } application_name: "" queue: "default" priority { priority: 0 > } am_container_spec { } cancel_tokens_when_complete: true maxAppAttempts: 2 > resource { memory: 1024 virtual_cores: 1 } applicationType: "YARN" > keep_containers_across_application_attempts: false > attempt_failures_validity_interval: 0 am_container_resource_request { > priority { priority: 0 } resource_name: "*" capability { memory: 1024 > virtual_cores: 1 } num_containers: 1 relax_locality: true > node_label_expression: "" execution_type_request { execution_type: GUARANTEED > enforce_execution_type: false } } } user: "jenkins" start_time: 1500886835535 > application_state: RMAPP_FINISHED finish_time: 1500886835559> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotNull(Assert.java:664) > at org.junit.Assert.assertNull(Assert.java:646) > at org.junit.Assert.assertNull(Assert.java:656) > at > org.apache.hadoop.yarn.server.resourcemanager.TestRMRestart.testFinishedAppRemovalAfterRMRestart(TestRMRestart.java:1673) > {noformat} -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org