[ https://issues.apache.org/jira/browse/SAMZA-563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14647244#comment-14647244 ]
Richard Lee commented on SAMZA-563: ----------------------------------- You need to enable RM restart phase 2 to see the problem w/ Samza. In particular, the addition of the 'epoch' information seems to be what screws up the AM. See https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/ResourceManagerRestart.html "ContainerId string format is changed if RM restarts with work-preserving recovery enabled. It used to be such format: Container_{clusterTimestamp}_{appId}_{attemptId}_{containerId}, e.g. Container_1410901177871_0001_01_000005. It is now changed to: Container_e{epoch}_{clusterTimestamp}_{appId}_{attemptId}_{containerId}, e.g. Container_e17_1410901177871_0001_01_000005." > Upgrade Samza to YARN 2.6.0 > --------------------------- > > Key: SAMZA-563 > URL: https://issues.apache.org/jira/browse/SAMZA-563 > Project: Samza > Issue Type: Bug > Components: yarn > Affects Versions: 0.9.0 > Reporter: Chris Riccomini > Assignee: Aleksandar Pejakovic > Attachments: HELLO-SAMZA-563.0.patch, SAMZA-563.0.patch, > SAMZA-563.2.patch > > > Samza is currently running on YARN 2.4.0. We should upgrade it to YARN 2.6.0. -- This message was sent by Atlassian JIRA (v6.3.4#6332)