[ 
https://issues.apache.org/jira/browse/OOZIE-1532?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14034844#comment-14034844
 ] 

Shwetha G S commented on OOZIE-1532:
------------------------------------

Purging service is enabled/configured by oozie admins(prod operations team). 
Coord action get is called by the oozie users(coord owners).  Coord owners 
can't query oozie config to check if purging service is enabled or whats the 
interval.

Take an example,
Currently for a running coord, the DB has all the actions(say C@0 to C@100). 
Coord action get with start as 10 and len 10 returns actions C@10 to C@19. 

Now assume that purging of actions for running coords is enabled and configured 
to run every 2 hours and should delete the first 10 actions(min action in DB is 
C@10). Without the changes to CoordJobGetActionsSubsetJPAExecutor, depending on 
when the user calls coord action get, the user will get different results. 
i.e., If the user makes a call before the purge service runs, the same coord 
action get(with start as 11 and len 10) returns actions C@10 to C@19. After the 
purge service runs, the coord action get(with start as 11 and len 10) returns 
actions C@20 to C@29. This is confusing for the users and there is no way for 
the users to know when the purge service runs or how oozie service is 
configured. Why should the users care about oozie maintenance(thats admins 
responsibility).

Instead if CoordJobGetActionsSubsetJPAExecutor calibrates start param with the 
min action in DB, start value of 11 maps to 1(11 - deleted actions) and the 
same coord action get(with start as 11 and len 10) returns actions C@10 to C@19 
irrespective of what time the user calls coord action get.

Makes sense?

> Purging should remove completed children job for long running coordinator jobs
> ------------------------------------------------------------------------------
>
>                 Key: OOZIE-1532
>                 URL: https://issues.apache.org/jira/browse/OOZIE-1532
>             Project: Oozie
>          Issue Type: New Feature
>            Reporter: Srikanth Sundarrajan
>            Assignee: Bowen Zhang
>         Attachments: oozie-1532.patch, oozie-1532.patch
>
>
> Specifically, this is for long running coordinator jobs with high frequency. 
> all child workflows are never purged as the coord job is still running.
> Oozie server configuration that indicates how many coordinator actions 
> frequency ticks to keep. By doing this it would be possible to purge running 
> coord jobs. By default this would not be enabled and the current logic would 
> remain.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to