[ 
https://issues.apache.org/jira/browse/YARN-6027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15817073#comment-15817073
 ] 

Varun Saxena commented on YARN-6027:
------------------------------------

bq. This should not happen. There should be exactly one row for a flow on a 
given day.
Yes. I think they were retrieving data based on last 24 hours instead of 
specific dates. That's why duplicate records came.

bq. We do have a lot of runs of a flow on a given day, for instance hRaven is 
running constantly on our cluster. So we do expect several runs of a flow in a 
day.
How many do we expect typically ? Can it run into thousands ? I had raised a 
JIRA to limit flow runs within a flow. We should probably have that support 
then.

> Support fromId for flows API 
> -----------------------------
>
>                 Key: YARN-6027
>                 URL: https://issues.apache.org/jira/browse/YARN-6027
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Rohith Sharma K S
>            Assignee: Rohith Sharma K S
>              Labels: yarn-5355-merge-blocker
>
> In YARN-5585 , fromId is supported for retrieving entities. We need similar 
> filter for flows/flowRun apps and flow run and flow as well. 
> Along with supporting fromId, this JIRA should also discuss following points
> * Should we throw an exception for entities/entity retrieval if duplicates 
> found?
> * TimelieEntity :
> ** Should equals method also check for idPrefix?
> ** Does idPrefix is part of identifiers?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to