[
https://issues.apache.org/jira/browse/OOZIE-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950183#comment-13950183
]
Rohini Palaniswamy commented on OOZIE-1735:
-------------------------------------------
Test failure unrelated (TestEventGeneration.testCoordinatorActionEvent). RAT
warning is due to patch not applying fully cleanly.
patching file docs/src/site/twiki/DG_CommandLineTool.twiki
Hunk #1 succeeded at 321 (offset 3 lines).
Unapproved licenses:
docs/src/site/twiki/DG_CommandLineTool.twiki.orig
> Support resuming of failed coordinator job and rerun of a failed coordinator
> action
> -----------------------------------------------------------------------------------
>
> Key: OOZIE-1735
> URL: https://issues.apache.org/jira/browse/OOZIE-1735
> Project: Oozie
> Issue Type: Bug
> Reporter: Purshotam Shah
> Assignee: Purshotam Shah
> Fix For: trunk
>
> Attachments: OOZIE-1735-V2.patch, OOZIE-1735-V2.patch,
> OOZIE-1735-V3.patch, OOZIE-1735_v1.patch
>
>
> We should support resuming of failed coordinator job. Job are set to failed
> if there are runtime error( like SQL timeout).
> In current scenario there is no way to recover beside running SQL.
> Resuming of failed coordinator job should also set pending to 1 ,reset
> doneMaterialization and last modified to current time. So that
> materialization continues.
> We should also provide an option of resuming failed action. The behavior will
> be same as killed option.
--
This message was sent by Atlassian JIRA
(v6.2#6252)