[ 
https://issues.apache.org/jira/browse/YARN-8335?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16484379#comment-16484379
 ] 

Eric Badger commented on YARN-8335:
-----------------------------------

Hmm, good point. I guess I wouldn't say it's exactly a dup of YARN-7904, but I 
think that both things can be fixed in YARN-7904. So I'm fine closing this as a 
dup. I'll link this JIRA so that we have it as reference. 

> Privileged docker containers' jobSubmitDir does not get successfully cleaned 
> up
> -------------------------------------------------------------------------------
>
>                 Key: YARN-8335
>                 URL: https://issues.apache.org/jira/browse/YARN-8335
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Eric Badger
>            Priority: Major
>              Labels: Docker
>
> The jobSubmitDir directory is owned by root and is being cleaned up as the 
> submitting user, which appears to be why it is failing to clean up.
> {noformat}
> 2018-05-21 19:46:15,124 WARN  [DeletionService #0] 
> privileged.PrivilegedOperationExecutor 
> (PrivilegedOperationExecutor.java:executePrivilegedOperation(174)) - Shell 
> execution returned exit code: 255. Privileged Execution Operation Stderr:
> Stdout: main : command provided 3
> main : run as user is ebadger
> main : requested yarn user is ebadger
> failed to unlink 
> /tmp/hadoop-local3/usercache/ebadger/appcache/application_1526931492976_0007/container_1526931492976_0007_01_000001/jobSubmitDir/job.split:
>  Permission denied
> failed to unlink 
> /tmp/hadoop-local3/usercache/ebadger/appcache/application_1526931492976_0007/container_1526931492976_0007_01_000001/jobSubmitDir/job.splitmetainfo:
>  Permission denied
> failed to rmdir jobSubmitDir: Directory not empty
> Error while deleting 
> /tmp/hadoop-local3/usercache/ebadger/appcache/application_1526931492976_0007/container_1526931492976_0007_01_000001:
>  39 (Directory not empty)
> Full command array for failed execution:
> [/hadoop-3.2.0-SNAPSHOT/bin/container-executor, ebadger, ebadger, 3, 
> /tmp/hadoop-local3/usercache/ebadger/appcache/application_1526931492976_0007/container_1526931492976_0007_01_000001]
> 2018-05-21 19:46:15,124 ERROR [DeletionService #0] 
> nodemanager.LinuxContainerExecutor 
> (LinuxContainerExecutor.java:deleteAsUser(848)) - DeleteAsUser for 
> /tmp/hadoop-local3/usercache/ebadger/appcache/application_1526931492976_0007/container_1526931492976_0007_01_000001
>  returned with exit code: 255
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationException:
>  ExitCodeException exitCode=255:
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:180)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:206)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.deleteAsUser(LinuxContainerExecutor.java:844)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.deletion.task.FileDeletionTask.run(FileDeletionTask.java:135)
>         at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
>         at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>         at java.lang.Thread.run(Thread.java:748)
> Caused by: ExitCodeException exitCode=255:
>         at org.apache.hadoop.util.Shell.runCommand(Shell.java:1009)
>         at org.apache.hadoop.util.Shell.run(Shell.java:902)
>         at 
> org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1227)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:152)
>         ... 10 more
> {noformat}
> {noformat}
> [foo@bar hadoop]$ ls -l 
> /tmp/hadoop-local3/usercache/ebadger/appcache/application_1526931492976_0007/container_1526931492976_0007_01_000001/
> total 4
> drwxr-sr-x 2 root users 4096 May 21 19:45 jobSubmitDir
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to