Hey Lance, Did you see any error messages in the JobTracker logs around the time this started? I think I understand how this might happen.
Thanks, -Todd On Thu, Apr 30, 2009 at 10:45 AM, Lance Riedel <la...@dotspots.com> wrote: > I have not been able to reproduce. We are using version 19.1 with the > following patches: > 4780-2v19.patch (Jira 4780) > closeAll3.patch (Jira 3998) > > Thanks, > Lance > > > On Apr 30, 2009, at 10:40 AM, Todd Lipcon wrote: > > Hi Lance, >> >> Can I ask what version you were running when you saw this? Is it >> reproducible? I'm trying to look at the code path that might produce such >> a >> behavior and want to make sure I'm looking at the right version. >> >> Thanks >> -Todd >> >> On Thu, Apr 30, 2009 at 9:33 AM, Lance Riedel <la...@dotspots.com> wrote: >> >> Has anyone seen this before? Our task tracker produced a 2.7 gig log file >>> in a few hours. The entry is all the same (every 2 ms): >>> >>> 2009-04-30 02:34:40,207 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,398 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,403 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,411 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,414 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,417 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,420 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> ... (And on and on and on...) >>> >>> >>> These are the few lines before it started: >>> >>> 2009-04-30 02:34:29,780 INFO >>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: >>> xxx.xxx.xxx.xxx:50060, dest: 10.253.178.95:40268, bytes: 3341324, op: >>> MAPRED_SHUFFLE, cliID: attempt_200904291917_0352_m_000006_0 >>> 2009-04-30 02:34:31,522 INFO org.apache.hadoop.mapred.TaskTracker: Sent >>> out >>> 418891 bytes for reduce: 12 from map: >>> attempt_200904291917_0352_m_000007_0 >>> given 418891/418887 from 4301462 with (22, 171) >>> 2009-04-30 02:34:31,522 INFO >>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: >>> xxx.xxx.xxx.xxx:50060, dest: xxx.xxx.xxx.xxx:40268, bytes: 418891, op: >>> MAPRED_SHUFFLE, cliID: attempt_200904291917_0352_m_000007_0 >>> 2009-04-30 02:34:35,382 INFO org.apache.hadoop.mapred.TaskTracker: >>> attempt_200904291917_0352_r_000003_0 0.3030303% reduce > copy (10 of 11 >>> at >>> 0.32 MB/s) > >>> 2009-04-30 02:34:38,385 INFO org.apache.hadoop.mapred.TaskTracker: >>> attempt_200904291917_0352_r_000003_0 0.3030303% reduce > copy (10 of 11 >>> at >>> 0.32 MB/s) > >>> 2009-04-30 02:34:40,207 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,398 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,403 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,411 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,414 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,417 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> 2009-04-30 02:34:40,420 INFO org.apache.hadoop.mapred.TaskTracker: >>> Resending 'status' to 'ec2-xx-xx-xx-xx.compute-1.amazonaws.com' with >>> reponseId '5341 >>> >>> --And on for 2+ gigs >>> >>> >