[ https://issues.apache.org/jira/browse/YARN-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16550703#comment-16550703 ]
Bibin A Chundatt commented on YARN-8558: ---------------------------------------- On container removal the following are missing . {{CONTAINER_START_TIME_KEY_SUFFIX}} is creating the major problem. Gets added on every container start call. {code} CONTAINER_START_TIME_KEY_SUFFIX CONTAINER_VERSION_KEY_SUFFIX CONTAINER_REMAIN_RETRIES_KEY_SUFFIX CONTAINER_RESTART_TIMES_SUFFIX CONTAINER_WORK_DIR_KEY_SUFFIX CONTAINER_LOG_DIR_KEY_SUFFIX {code} > NM recovery level db not cleaned up properly on container finish > ---------------------------------------------------------------- > > Key: YARN-8558 > URL: https://issues.apache.org/jira/browse/YARN-8558 > Project: Hadoop YARN > Issue Type: Bug > Affects Versions: 3.0.0, 3.1.0 > Reporter: Bibin A Chundatt > Assignee: Bibin A Chundatt > Priority: Critical > > {code} > 2018-07-20 16:49:23,117 INFO > org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl: > Application application_1531994217928_0054 transitioned from NEW to INITING > 2018-07-20 16:49:23,204 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000018 with incomplete > records > 2018-07-20 16:49:23,204 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000019 with incomplete > records > 2018-07-20 16:49:23,204 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000020 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000021 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000022 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000023 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000024 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000025 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000038 with incomplete > records > 2018-07-20 16:49:23,205 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000039 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000041 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000044 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000046 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000049 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000052 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000054 with incomplete > records > 2018-07-20 16:49:23,206 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000073 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000074 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000075 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000078 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000079 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000082 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000083 with incomplete > records > 2018-07-20 16:49:23,207 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_000085 with incomplete > records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627738 with > incomplete records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627742 with > incomplete records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627746 with > incomplete records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627749 with > incomplete records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627753 with > incomplete records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627757 with > incomplete records > 2018-07-20 16:49:23,208 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627761 with > incomplete records > 2018-07-20 16:49:23,209 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627765 with > incomplete records > 2018-07-20 16:49:23,209 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627769 with > incomplete records > 2018-07-20 16:49:23,209 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0001_01_1099511627773 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627679 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627681 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627684 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627690 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627695 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627696 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627702 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627706 with > incomplete records > 2018-07-20 16:49:23,210 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627710 with > incomplete records > 2018-07-20 16:49:23,211 WARN > org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: > Remove container container_1531994217928_0002_01_1099511627712 with > incomplete records > {code} > NM state store size could increase in long running scenarios, and recovery > could be slow -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org