[
https://issues.apache.org/jira/browse/OOZIE-3584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mate Juhasz updated OOZIE-3584:
-------------------------------
Fix Version/s: trunk
> Fork-join action issue when action param cannot be resolved
> -----------------------------------------------------------
>
> Key: OOZIE-3584
> URL: https://issues.apache.org/jira/browse/OOZIE-3584
> Project: Oozie
> Issue Type: Bug
> Components: core
> Affects Versions: 5.1.0
> Reporter: Janos Makai
> Assignee: Janos Makai
> Priority: Major
> Fix For: trunk
>
> Attachments: OOZIE-3584-001.patch, forkjoin_actionparam_issue.log
>
>
> _Current bug_
> *=========*
> There is a sub workflow run in independent mode that runs a fork action which
> contains two (or more) actions.
> These actions inside the fork action run in parallel mode, and they have some
> seconds delay in between them.
> If a parameter is passed to one of these actions, that cannot be resolved,
> then it changes its status to FAILED, and also the workflow’s state to
> FAILED. The other action’s state which are not started yet will stuck in PREP
> state forever. The correct behaviour would be to KILL the remaining actions
> as well as the workflow.
> Note: this bug only occurs when it is run in independent mode. If it has a
> parent workflow, then the parent workflow will kill this workflow after 10
> minutes because of the callback process.
>
> _Log_
> *===*
> 2020-02-14 11:59:26,698 ERROR org.apache.oozie.command.wf.SignalXCommand:
> SERVER[quasar-nqrrjp-4.quasar-nqrrjp.root.hwx.site] USER[admin] GROUP[-]
> TOKEN[] APP[Sub flow fork join] JOB[0000005-200214101441478-oozie-oozi-W]
> ACTION[0000005-200214101441478-oozie-oozi-W@fork-4a1c] Error running forked
> jobs parallely
> org.apache.oozie.command.CommandException: E0718: Workflow already completed
> at
> org.apache.oozie.command.wf.ActionXCommand.failJob(ActionXCommand.java:213)
> at
> org.apache.oozie.command.wf.ActionXCommand.failJob(ActionXCommand.java:185)
> at
> org.apache.oozie.command.wf.SignalXCommand.startForkedActions(SignalXCommand.java:498)
> at
> org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:462)
> at org.apache.oozie.command.wf.SignalXCommand.execute(SignalXCommand.java:82)
> at org.apache.oozie.command.XCommand.call(XCommand.java:291)
> at
> org.apache.oozie.command.wf.ActionEndXCommand.execute(ActionEndXCommand.java:283)
> at
> org.apache.oozie.command.wf.ActionEndXCommand.execute(ActionEndXCommand.java:62)
> at org.apache.oozie.command.XCommand.call(XCommand.java:291)
> at
> org.apache.oozie.command.wf.ActionStartXCommand.callActionEnd(ActionStartXCommand.java:352)
> at
> org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:338)
> at
> org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:68)
> at org.apache.oozie.command.XCommand.call(XCommand.java:291)
> at
> org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:363)
> at
> org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:292)
> at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
> at
> org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:210)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
> at
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
> at java.base/java.lang.Thread.run(Thread.java:834)
> Caused by: org.apache.oozie.workflow.WorkflowException: E0718: Workflow
> already completed
> at
> org.apache.oozie.workflow.lite.LiteWorkflowInstance.fail(LiteWorkflowInstance.java:337)
> at
> org.apache.oozie.command.wf.ActionXCommand.failJob(ActionXCommand.java:201)
> ... 19 more
> Full log added as attachment.
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)