Sangjin Lee created YARN-5095: --------------------------------- Summary: flow activities and flow runs are populated with wrong timestamp when RM restarts w/ recovery enabled Key: YARN-5095 URL: https://issues.apache.org/jira/browse/YARN-5095 Project: Hadoop YARN Issue Type: Sub-task Components: timelineserver Affects Versions: YARN-2928 Reporter: Sangjin Lee Priority: Critical
I have the RM recovery enabled. I see that upon restart the RM populates records into flow activity and flow runs but with *wrong* timestamps. What I mean by the timestamp is the part of the row key: - flow activity: row created with the day of the RM restart - flow run: row created with the RM start time as the "run id" The following illustrates an example flow run: {noformat} metrics: [ ], events: [ ], id: "sjlee@Sleep job/1463433569917", type: "YARN_FLOW_RUN", createdtime: 1463422860987, info: { UID: "yarn_cluster!sjlee!Sleep job!1463433569917", SYSTEM_INFO_FLOW_RUN_ID: 1463433569917, SYSTEM_INFO_FLOW_NAME: "Sleep job", SYSTEM_INFO_FLOW_RUN_END_TIME: 1463422865033, SYSTEM_INFO_USER: "sjlee" }, isrelatedto: { }, relatesto: { } {noformat} The created time and the end time are correct (i.e. original time), whereas the timestamp in the row key (= run id: 1463433569917) is actually later than the end time and coincides with the RM restart. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org