[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13551557#comment-13551557
 ] 

Jason Lowe commented on MAPREDUCE-4938:
---------------------------------------

Snippet of client output from a streaming job submission when the problem 
occurs:

{noformat}
2013-01-11 21:40:35,818 INFO  [main] mapreduce.JobSubmitter 
(JobSubmitter.java:printTokens(439)) - Submitting tokens for job: 
job_1357940284444_0001
2013-01-11 21:40:36,117 INFO  [main] client.YarnClientImpl 
(YarnClientImpl.java:submitApplication(138)) - Submitted application 
application_1357940284444_0001 to ResourceManager at xx
2013-01-11 21:40:36,164 INFO  [main] mapreduce.Job (Job.java:submit(1222)) - 
The url to track the job: http://xx/proxy/application_1357940284444_0001/
2013-01-11 21:40:36,166 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1267)) - Running job: job_1357940284444_0001
2013-01-11 21:40:36,175 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1288)) - Job job_1357940284444_0001 running in 
uber mode : false
2013-01-11 21:40:36,175 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1295)) -  map 0% reduce 0%
2013-01-11 21:40:36,185 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1308)) - Job job_1357940284444_0001 failed with 
state FAILED due to: Application appattempt_1357940284444_0001_000001 submitted 
by user someuser to unknown queue: badqueuename
2013-01-11 21:40:36,217 INFO  [main] mapreduce.Job 
(Job.java:monitorAndPrintJob(1313)) - Counters: 0
2013-01-11 21:40:36,217 ERROR [main] streaming.StreamJob 
(StreamJob.java:submitAndMonitorJob(1019)) - Job not Successful!
Streaming Command Failed!
{noformat}

And the same section of output from the client when it does not occur:

{noformat}
2013-01-11 21:49:51,292 INFO  [main] mapreduce.JobSubmitter 
(JobSubmitter.java:printTokens(439)) - Submitting tokens for job: 
job_1357940966359_0002
2013-01-11 21:49:51,556 INFO  [main] client.YarnClientImpl 
(YarnClientImpl.java:submitApplication(138)) - Submitted application 
application_1357940966359_0002 to ResourceManager at xx
2013-01-11 21:49:51,566 INFO  [main] mapreduce.JobSubmitter 
(JobSubmitter.java:submitJobInternal(401)) - Cleaning up the staging area 
/tmp/hadoop-yarn/staging/someuser/.staging/job_1357940966359_0002
2013-01-11 21:49:51,586 ERROR [main] security.UserGroupInformation 
(UserGroupInformation.java:doAs(1453)) - PriviledgedActionException as:someuser 
(auth:SIMPLE) cause:java.io.IOException: Failed to run job : Application 
appattempt_1357940966359_0002_000001 submitted by user someuser to unknown 
queue: badqueuename
2013-01-11 21:49:51,586 ERROR [main] security.UserGroupInformation 
(UserGroupInformation.java:doAs(1453)) - PriviledgedActionException as:someuser 
(auth:SIMPLE) cause:java.io.IOException: Failed to run job : Application 
appattempt_1357940966359_0002_000001 submitted by user someuser to unknown 
queue: badqueuename
2013-01-11 21:49:51,586 ERROR [main] streaming.StreamJob 
(StreamJob.java:submitAndMonitorJob(1034)) - Error Launching job : Failed to 
run job : Application appattempt_1357940966359_0002_000001 submitted by user 
someuser to unknown queue: badqueuename
Streaming Command Failed!
{noformat}
                
> Job submission to unknown queue can leave staging directory behind
> ------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4938
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4938
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client
>    Affects Versions: 2.0.3-alpha, 0.23.5
>            Reporter: Jason Lowe
>
> There is a race where submitting a job to an unknown queue can appear to 
> succeed to the client and then subsequently fail later.  Since there was no 
> AM ever launched, there was nothing left to cleanup the staging directory.  
> At that point the client is the only thing that can cleanup the staging 
> directory.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to