Junping Du created YARN-4325: -------------------------------- Summary: purge app state from NM state-store should be independent of log aggregation Key: YARN-4325 URL: https://issues.apache.org/jira/browse/YARN-4325 Project: Hadoop YARN Issue Type: Bug Affects Versions: 2.6.0 Reporter: Junping Du Assignee: Junping Du Priority: Critical
>From a long running cluster, we found tens of thousands of stale apps still be >recovered in NM restart recovery. The reason is some wrong configuration >setting to log aggregation so the end of log aggregation events are not >received so stale apps are not purged properly. We should make sure the >removal of app state to be independent of log aggregation life cycle. -- This message was sent by Atlassian JIRA (v6.3.4#6332)