Purshotam Shah created OOZIE-3006:
-------------------------------------

             Summary: Oozie submit duplicate job if action has retry and oozie 
is not able to lookup hadoop job
                 Key: OOZIE-3006
                 URL: https://issues.apache.org/jira/browse/OOZIE-3006
             Project: Oozie
          Issue Type: Bug
            Reporter: Purshotam Shah


{code:title=JavaActionExecutor.java}
Element actionXml = XmlUtils.parseXml(action.getConf());
            FileSystem actionFs = context.getAppFileSystem();
            JobConf jobConf = createBaseHadoopConf(context, actionXml);
            jobClient = createJobClient(context, jobConf);
            RunningJob runningJob = getRunningJob(context, action, jobClient);
            if (runningJob == null) {
                context.setExecutionData(FAILED, null);
                throw new 
ActionExecutorException(ActionExecutorException.ErrorType.FAILED, "JA017",
                        "Could not lookup launched hadoop Job ID [{0}] which 
was associated with " +
                        " action [{1}].  Failing this action!", 
getActualExternalId(action), action.getId());
            }
{code}

Oozie fails the action if it's not able to look up Hadoop job. If the action 
has retry, oozie will submit a new job, and it could end up running multiple 
jobs for the same action.

Not able to look up Hadoop job != job failure.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to