[ https://issues.apache.org/jira/browse/YARN-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15169531#comment-15169531 ]
Li Lu commented on YARN-4700: ----------------------------- I think the redundant events are coming from the work preserving RM restart, where the RM tries to "replay" application lifecycle events in the state store. I don't remember the JIRA number for fixing this for SMP (but I do remember [~Naganarasimha] was involved in the discussion), but seems like the conclusion was to handle this on the SMP/storage side rather than the RM side. For us, most of the tables are fine, but the flow activity table we need to distinguish a "real" activity from a replayed activity. > ATS storage has one extra record each time the RM got restarted > --------------------------------------------------------------- > > Key: YARN-4700 > URL: https://issues.apache.org/jira/browse/YARN-4700 > Project: Hadoop YARN > Issue Type: Sub-task > Components: timelineserver > Affects Versions: YARN-2928 > Reporter: Li Lu > Assignee: Naganarasimha G R > Labels: yarn-2928-1st-milestone > > When testing the new web UI for ATS v2, I noticed that we're creating one > extra record for each finished application (but still hold in the RM state > store) each time the RM got restarted. It's quite possible that we add the > cluster start timestamp into the default cluster id, thus each time we're > creating a new record for one application (cluster id is a part of the row > key). We need to fix this behavior, probably by having a better default > cluster id. -- This message was sent by Atlassian JIRA (v6.3.4#6332)