[ 
https://issues.apache.org/jira/browse/FALCON-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14946467#comment-14946467
 ] 

Peeyush Bishnoi commented on FALCON-1433:
-----------------------------------------

On further debugging this issue, I have found that if any action node in 
sub-workflow fails then entire sub-workflow is getting executed on 1st retry. 
This is similar to Pig/Hive query, where if any job fails in query, then entire 
Pig/Hive query execute. 
So, as such I have not seen any impact of property "oozie.wf.rerun.failnodes" 
to "true" for ensuring that only failed action nodes execute from sub-workflow. 
With this behavior, we should think to set "oozie.wf.rerun.failnodes" to 
"false" and ensure that all instances must run till the defined number of 
attempts in case of failure.

Thoughts please.

> Falcon retry instance do not proceed on failure
> -----------------------------------------------
>
>                 Key: FALCON-1433
>                 URL: https://issues.apache.org/jira/browse/FALCON-1433
>             Project: Falcon
>          Issue Type: Bug
>          Components: rerun
>            Reporter: Peeyush Bishnoi
>            Assignee: Peeyush Bishnoi
>            Priority: Critical
>             Fix For: trunk
>
>
> When retry is specified in Falcon entity with number of attempts more than 1 
> and if workflow failed, on retrying Falcon instance workflow (if action fail 
> again) seems like stuck and further attempts also does not get retry.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to