[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13636558#comment-13636558
 ] 

Owen O'Malley edited comment on MAPREDUCE-5169 at 4/19/13 9:07 PM:
-------------------------------------------------------------------

Client Logs
{code}
/usr/lib/hadoop/bin/hadoop jar /usr/lib/hadoop/hadoop-examples-1.X.Y.jar 
wordcount "-Dmapreduce.reduce.input.limit=-1" 
/user/hrt_qa/test_mapred_ha/medium_wordcount_input 
/user/hrt_qa/test_mapred_ha/jobtracker-near-submit_wc_output
13/04/17 22:04:10 INFO input.FileInputFormat: Total input paths to process : 20
13/04/17 22:04:10 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library
13/04/17 22:04:10 INFO lzo.LzoCodec: Successfully loaded & initialized 
native-lzo library [hadoop-lzo rev cf4e7cbf8ed0f0622504d008101c2729dc0c9ff3]
13/04/17 22:04:10 WARN snappy.LoadSnappy: Snappy native library is available
13/04/17 22:04:10 INFO util.NativeCodeLoader: Loaded the native-hadoop library
13/04/17 22:04:10 INFO snappy.LoadSnappy: Snappy native library loaded
13/04/17 22:04:11 INFO mapred.JobClient: Running job: job_201304172049_0008
13/04/17 22:04:12 INFO mapred.JobClient:  map 0% reduce 0%
13/04/17 22:04:32 INFO ipc.Client: Retrying connect to server: host:50300. 
Already tried 0 time(s); retry policy is MultipleLinearRandomRetry[6x10000ms, 
10x60000ms]
java.io.IOException: The job appears to have been removed.
at 
org.apache.hadoop.mapred.JobClient$NetworkedJob.updateStatus(JobClient.java:241)
at 
org.apache.hadoop.mapred.JobClient$NetworkedJob.isComplete(JobClient.java:321)
at org.apache.hadoop.mapred.JobClient.monitorAndPrintJob(JobClient.java:1382)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:583)
at org.apache.hadoop.examples.WordCount.main(WordCount.java:82)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:64)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:160)
{code}


Jobtracker logs

{code}
2013-04-17 22:04:11,305 INFO org.apache.hadoop.mapred.JobInProgress: 
job_201304172049_0008: nMaps=180 nReduces=1 max=-1
2013-04-17 22:04:11,357 INFO org.apache.hadoop.mapred.JobQueuesManager: Job 
job_201304172049_0008 submitted to queue default
2013-04-17 22:04:11,357 INFO org.apache.hadoop.mapred.JobTracker: Job 
job_201304172049_0008 added successfully for user 'hrt_qa' to queue 'default'
2013-04-17 22:04:14,268 INFO org.apache.hadoop.mapred.JobInitializationPoller: 
Passing to Initializer Job Id :job_201304172049_0008 User: user Queue : default
2013-04-17 22:04:15,181 INFO org.apache.hadoop.mapred.JobTracker: SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down JobTracker at 
************************************************************/
2013-04-17 22:04:15,204 INFO org.apache.hadoop.mapred.JobInitializationPoller: 
Initializing job : job_201304172049_0008 in Queue default For user : user
2013-04-17 22:04:15,204 INFO org.apache.hadoop.mapred.JobTracker: Initializing 
job_201304172049_0008
2013-04-17 22:04:15,204 INFO org.apache.hadoop.mapred.JobInProgress: 
Initializing job_201304172049_0008
2013-04-17 22:04:30,900 INFO org.apache.hadoop.mapred.JobTracker: STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting JobTracker
STARTUP_MSG:   host = 
STARTUP_MSG:   args = []
STARTUP_MSG:   version = 
STARTUP_MSG:   build = 
STARTUP_MSG:   java = 1.6.0_31
************************************************************/
...
...
...
2013-04-17 22:04:31,862 WARN org.apache.hadoop.mapred.JobTracker: Job 
job_201304172049_0008 does not have valid info/token file so ignoring for 
recovery
{code}
                
      was (Author: arpitgupta):
    Client Logs
{code}
/usr/lib/hadoop/bin/hadoop jar 
/usr/lib/hadoop/hadoop-examples-1.3.0.1.3.0.0-15.jar wordcount 
"-Dmapreduce.reduce.input.limit=-1" 
/user/hrt_qa/test_mapred_ha/medium_wordcount_input 
/user/hrt_qa/test_mapred_ha/jobtracker-near-submit_wc_output
13/04/17 22:04:10 INFO input.FileInputFormat: Total input paths to process : 20
13/04/17 22:04:10 INFO lzo.GPLNativeCodeLoader: Loaded native gpl library
13/04/17 22:04:10 INFO lzo.LzoCodec: Successfully loaded & initialized 
native-lzo library [hadoop-lzo rev cf4e7cbf8ed0f0622504d008101c2729dc0c9ff3]
13/04/17 22:04:10 WARN snappy.LoadSnappy: Snappy native library is available
13/04/17 22:04:10 INFO util.NativeCodeLoader: Loaded the native-hadoop library
13/04/17 22:04:10 INFO snappy.LoadSnappy: Snappy native library loaded
13/04/17 22:04:11 INFO mapred.JobClient: Running job: job_201304172049_0008
13/04/17 22:04:12 INFO mapred.JobClient:  map 0% reduce 0%
13/04/17 22:04:32 INFO ipc.Client: Retrying connect to server: 
hor1n02.gq1.ygridcore.net/68.142.244.21:50300. Already tried 0 time(s); retry 
policy is MultipleLinearRandomRetry[6x10000ms, 10x60000ms]
java.io.IOException: The job appears to have been removed.
at 
org.apache.hadoop.mapred.JobClient$NetworkedJob.updateStatus(JobClient.java:241)
at 
org.apache.hadoop.mapred.JobClient$NetworkedJob.isComplete(JobClient.java:321)
at org.apache.hadoop.mapred.JobClient.monitorAndPrintJob(JobClient.java:1382)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:583)
at org.apache.hadoop.examples.WordCount.main(WordCount.java:82)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at 
org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68)
at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139)
at org.apache.hadoop.examples.ExampleDriver.main(ExampleDriver.java:64)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:160)
{code}


Jobtracker logs

{code}
2013-04-17 22:04:11,305 INFO org.apache.hadoop.mapred.JobInProgress: 
job_201304172049_0008: nMaps=180 nReduces=1 max=-1
2013-04-17 22:04:11,357 INFO org.apache.hadoop.mapred.JobQueuesManager: Job 
job_201304172049_0008 submitted to queue default
2013-04-17 22:04:11,357 INFO org.apache.hadoop.mapred.JobTracker: Job 
job_201304172049_0008 added successfully for user 'hrt_qa' to queue 'default'
2013-04-17 22:04:14,268 INFO org.apache.hadoop.mapred.JobInitializationPoller: 
Passing to Initializer Job Id :job_201304172049_0008 User: user Queue : default
2013-04-17 22:04:15,181 INFO org.apache.hadoop.mapred.JobTracker: SHUTDOWN_MSG:
/************************************************************
SHUTDOWN_MSG: Shutting down JobTracker at 
************************************************************/
2013-04-17 22:04:15,204 INFO org.apache.hadoop.mapred.JobInitializationPoller: 
Initializing job : job_201304172049_0008 in Queue default For user : user
2013-04-17 22:04:15,204 INFO org.apache.hadoop.mapred.JobTracker: Initializing 
job_201304172049_0008
2013-04-17 22:04:15,204 INFO org.apache.hadoop.mapred.JobInProgress: 
Initializing job_201304172049_0008
2013-04-17 22:04:30,900 INFO org.apache.hadoop.mapred.JobTracker: STARTUP_MSG:
/************************************************************
STARTUP_MSG: Starting JobTracker
STARTUP_MSG:   host = 
STARTUP_MSG:   args = []
STARTUP_MSG:   version = 
STARTUP_MSG:   build = 
STARTUP_MSG:   java = 1.6.0_31
************************************************************/
...
...
...
2013-04-17 22:04:31,862 WARN org.apache.hadoop.mapred.JobTracker: Job 
job_201304172049_0008 does not have valid info/token file so ignoring for 
recovery
{code}
                  
> Job recovery fails if job tracker is restarted after the job is submitted but 
> before its initialized
> ----------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-5169
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5169
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 1.2.0
>            Reporter: Arpit Gupta
>
> This was noticed when within 5 seconds of submitting a word count job, the 
> job tracker was restarted. Upon restart the job failed to recover

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to