[ https://issues.apache.org/jira/browse/YARN-6027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15817073#comment-15817073 ]
Varun Saxena commented on YARN-6027: ------------------------------------ bq. This should not happen. There should be exactly one row for a flow on a given day. Yes. I think they were retrieving data based on last 24 hours instead of specific dates. That's why duplicate records came. bq. We do have a lot of runs of a flow on a given day, for instance hRaven is running constantly on our cluster. So we do expect several runs of a flow in a day. How many do we expect typically ? Can it run into thousands ? I had raised a JIRA to limit flow runs within a flow. We should probably have that support then. > Support fromId for flows API > ----------------------------- > > Key: YARN-6027 > URL: https://issues.apache.org/jira/browse/YARN-6027 > Project: Hadoop YARN > Issue Type: Sub-task > Components: timelineserver > Reporter: Rohith Sharma K S > Assignee: Rohith Sharma K S > Labels: yarn-5355-merge-blocker > > In YARN-5585 , fromId is supported for retrieving entities. We need similar > filter for flows/flowRun apps and flow run and flow as well. > Along with supporting fromId, this JIRA should also discuss following points > * Should we throw an exception for entities/entity retrieval if duplicates > found? > * TimelieEntity : > ** Should equals method also check for idPrefix? > ** Does idPrefix is part of identifiers? -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org