[
https://issues.apache.org/jira/browse/OOZIE-1953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
peter liu updated OOZIE-1953:
-----------------------------
Summary: oozie workflow is stuck in RUNNING states if folder 'oozie-oozi'
was deleted durning it's running (was: oozie workflow is stuck in RUNNING
states if folder 'oozie-oozi' was deleted durning t's running)
> oozie workflow is stuck in RUNNING states if folder 'oozie-oozi' was deleted
> durning it's running
> -------------------------------------------------------------------------------------------------
>
> Key: OOZIE-1953
> URL: https://issues.apache.org/jira/browse/OOZIE-1953
> Project: Oozie
> Issue Type: Bug
> Affects Versions: 4.0.0
> Reporter: peter liu
>
> Step to reproduce:
> After started a workflow, delete the auto-created folder 'oozie-oozi' on
> HDFS, then the workflow will be stuck in RUNNING status and never get killed.
> From below log it seems in an infinite loop to try to find the generated
> action files:
> {quote}
> Caused by: java.io.FileNotFoundException: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
> at
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:277)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:263)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper.getActionData(LauncherMapperHelper.java:263)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1073)
> ... 7 more
> 2014-07-30 20:11:09,622 WARN MapReduceActionExecutor:542 - USER[test_user1]
> GROUP[-] TOKEN[] APP[WorkFlowSDKAction]
> JOB[0002981-140710172352455-oozie-oozi-W]
> ACTION[0002981-140710172352455-oozie-oozi-W@SDKAction] Exception in check().
> Message[File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.]
> java.io.FileNotFoundException: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
> at
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:277)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:263)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper.getActionData(LauncherMapperHelper.java:263)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1073)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:177)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:56)
> at org.apache.oozie.command.XCommand.call(XCommand.java:280)
> at
> org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> 2014-07-30 20:11:09,622 WARN ActionCheckXCommand:542 - USER[test_user1]
> GROUP[-] TOKEN[] APP[WorkFlowSDKAction]
> JOB[0002981-140710172352455-oozie-oozi-W]
> ACTION[0002981-140710172352455-oozie-oozi-W@SDKAction] Exception while
> executing check(). Error Code [JA008], Message[JA008: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.]
> org.apache.oozie.action.ActionExecutorException: JA008: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecutor.java:412)
> at
> org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:396)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1163)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:177)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:56)
> at org.apache.oozie.command.XCommand.call(XCommand.java:280)
> at
> org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> Caused by: java.io.FileNotFoundException: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
> at
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:277)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:263)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper.getActionData(LauncherMapperHelper.java:263)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1073)
> ... 7 more
> 2014-07-30 20:22:09,737 WARN MapReduceActionExecutor:542 - USER[test_user1]
> GROUP[-] TOKEN[] APP[WorkFlowSDKAction]
> JOB[0002981-140710172352455-oozie-oozi-W]
> ACTION[0002981-140710172352455-oozie-oozi-W@SDKAction] Exception in check().
> Message[File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.]
> java.io.FileNotFoundException: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
> at
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:277)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:263)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper.getActionData(LauncherMapperHelper.java:263)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1073)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:177)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:56)
> at org.apache.oozie.command.XCommand.call(XCommand.java:280)
> at
> org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> 2014-07-30 20:22:09,737 WARN ActionCheckXCommand:542 - USER[test_user1]
> GROUP[-] TOKEN[] APP[WorkFlowSDKAction]
> JOB[0002981-140710172352455-oozie-oozi-W]
> ACTION[0002981-140710172352455-oozie-oozi-W@SDKAction] Exception while
> executing check(). Error Code [JA008], Message[JA008: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.]
> org.apache.oozie.action.ActionExecutorException: JA008: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.oozie.action.ActionExecutor.convertExceptionHelper(ActionExecutor.java:412)
> at
> org.apache.oozie.action.ActionExecutor.convertException(ActionExecutor.java:396)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1163)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:177)
> at
> org.apache.oozie.command.wf.ActionCheckXCommand.execute(ActionCheckXCommand.java:56)
> at org.apache.oozie.command.XCommand.call(XCommand.java:280)
> at
> org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> Caused by: java.io.FileNotFoundException: File
> hdfs://d0e003ash1013.mgmt.symcpe.net/user/test_user1/oozie-oozi/0002981-140710172352455-oozie-oozi-W/SDKAction--map-reduce
> does not exist.
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
> at
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:277)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper$1.run(LauncherMapperHelper.java:263)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
> at
> org.apache.oozie.action.hadoop.LauncherMapperHelper.getActionData(LauncherMapperHelper.java:263)
> at
> org.apache.oozie.action.hadoop.JavaActionExecutor.check(JavaActionExecutor.java:1073)
> ... 7 more
> {quote}
--
This message was sent by Atlassian JIRA
(v6.2#6252)