[ https://issues.apache.org/jira/browse/OOZIE-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15235691#comment-15235691 ]
Satish Subhashrao Saley commented on OOZIE-1735: ------------------------------------------------ Created new jira to update documentation - [OOZIE-2508 | https://issues.apache.org/jira/browse/OOZIE-2508] > Support resuming of failed coordinator job and rerun of a failed coordinator > action > ----------------------------------------------------------------------------------- > > Key: OOZIE-1735 > URL: https://issues.apache.org/jira/browse/OOZIE-1735 > Project: Oozie > Issue Type: Bug > Reporter: Purshotam Shah > Assignee: Purshotam Shah > Fix For: 4.1.0 > > Attachments: OOZIE-1735-V2.patch, OOZIE-1735-V2.patch, > OOZIE-1735-V3.patch, OOZIE-1735_v1.patch > > > We should support resuming of failed coordinator job. Job are set to failed > if there are runtime error( like SQL timeout). > In current scenario there is no way to recover beside running SQL. > Resuming of failed coordinator job should also set pending to 1 ,reset > doneMaterialization and last modified to current time. So that > materialization continues. > We should also provide an option of resuming failed action. The behavior will > be same as killed option. -- This message was sent by Atlassian JIRA (v6.3.4#6332)