[ https://issues.apache.org/jira/browse/YARN-9640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16916312#comment-16916312 ]
Hudson commented on YARN-9640: ------------------------------ FAILURE: Integrated in Jenkins build Hadoop-trunk-Commit #17189 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/17189/]) YARN-9640. Slow event processing could cause too many attempt unregister (rohithsharmaks: rev d70f5231a794804d9baf0bdb7e6833533fe60db5) * (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/ApplicationMasterServiceTestBase.java * (edit) hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/ApplicationMasterService.java > Slow event processing could cause too many attempt unregister events > -------------------------------------------------------------------- > > Key: YARN-9640 > URL: https://issues.apache.org/jira/browse/YARN-9640 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Bibin A Chundatt > Assignee: Bibin A Chundatt > Priority: Critical > Labels: scalability > Fix For: 3.3.0 > > Attachments: YARN-9640.001.patch, YARN-9640.002.patch, > YARN-9640.003.patch > > > We found in one of our test cluster verification that the number attempt > unregister events is about 300k+. > # AM all containers completed. > # AMRMClientImpl send finishApplcationMaster > # AMRMClient check event 100ms the finish Status using > finishApplicationMaster request. > # AMRMClientImpl#unregisterApplicationMaster > {code:java} > while (true) { > FinishApplicationMasterResponse response = > rmClient.finishApplicationMaster(request); > if (response.getIsUnregistered()) { > break; > } > LOG.info("Waiting for application to be successfully unregistered."); > Thread.sleep(100); > } > {code} > # ApplicationMasterService finishApplicationMaster interface sends > unregister events on every status update. > We should send unregister event only once and cache event send , ignore and > send not unregistered response back to AM not overloading the event queue. -- This message was sent by Atlassian Jira (v8.3.2#803003) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org