[ 
https://issues.apache.org/jira/browse/YARN-4700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15170387#comment-15170387
 ] 

Varun Saxena commented on YARN-4700:
------------------------------------

If we do not fix it in RM, as was the conclusion in those 2 JIRAs' and we use 
current timestamp, we will have to peek into the app table to see if the app 
already exists or not and if yes, do not enter in activity table.
The reason as it seems from the discussion on those JIRAs' is that ATS events 
are asynchronous(because we use a dispatcher in between) so its better to 
replay the events.

Maybe we can give priority to app start and finish events and make it 
synchronous for V2 as the app collector will be running within RM but this 
would block app finishing till ATS event completes.

> ATS storage has one extra record each time the RM got restarted
> ---------------------------------------------------------------
>
>                 Key: YARN-4700
>                 URL: https://issues.apache.org/jira/browse/YARN-4700
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>    Affects Versions: YARN-2928
>            Reporter: Li Lu
>            Assignee: Naganarasimha G R
>              Labels: yarn-2928-1st-milestone
>
> When testing the new web UI for ATS v2, I noticed that we're creating one 
> extra record for each finished application (but still hold in the RM state 
> store) each time the RM got restarted. It's quite possible that we add the 
> cluster start timestamp into the default cluster id, thus each time we're 
> creating a new record for one application (cluster id is a part of the row 
> key). We need to fix this behavior, probably by having a better default 
> cluster id. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to