check that you are not getting disk full errors Miles
2008/9/29 Elia Mazzawi <[EMAIL PROTECTED]>: > in more detail, my program is happily chugging along until the reducer fails > with that exception, then it looks like it retries and fails by itself. > the same hadoop program works fine on a subset of the data. > I'm rerunning on all the subsets to see if there is anything in the data > that is causing this, > > but can someone explain what this error means, > I'm runnig hadoop 17.0 maybe its time to update > > > 08/09/27 05:59:01 INFO mapred.JobClient: map 96% reduce 31% > 08/09/27 06:02:17 INFO mapred.JobClient: map 96% reduce 32% > 08/09/27 06:24:46 INFO mapred.JobClient: map 97% reduce 32% > 08/09/27 06:49:38 INFO mapred.JobClient: map 98% reduce 32% > 08/09/27 07:14:12 INFO mapred.JobClient: map 99% reduce 32% > 08/09/27 07:17:09 INFO mapred.JobClient: map 99% reduce 33% > 08/09/27 07:37:50 INFO mapred.JobClient: map 100% reduce 33% > 08/09/27 07:56:11 INFO mapred.JobClient: map 100% reduce 0% > 08/09/27 07:56:11 INFO mapred.JobClient: Task Id : > task_200809041356_0042_r_000000_2, Status : FAILED > org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any > valid local directory for task_200809041356_0042_r_000000_2/intermediate.9 > at > org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:313) > at > org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124) > at > org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue.merge(SequenceFile.java:2851) > at > org.apache.hadoop.io.SequenceFile$Sorter.merge(SequenceFile.java:2586) > at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:352) > at > org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2124) > > 08/09/27 07:57:10 INFO mapred.JobClient: map 100% reduce 1% > 08/09/27 07:57:55 INFO mapred.JobClient: map 100% reduce 2% > 08/09/27 07:58:46 INFO mapred.JobClient: map 100% reduce 3% > 08/09/27 07:59:36 INFO mapred.JobClient: map 100% reduce 4% > 08/09/27 08:00:26 INFO mapred.JobClient: map 100% reduce 5% > 08/09/27 08:01:16 INFO mapred.JobClient: map 100% reduce 6% > 08/09/27 08:02:06 INFO mapred.JobClient: map 100% reduce 7% > 08/09/27 08:02:55 INFO mapred.JobClient: map 100% reduce 8% > 08/09/27 08:03:45 INFO mapred.JobClient: map 100% reduce 9% > 08/09/27 08:04:36 INFO mapred.JobClient: map 100% reduce 10% > 08/09/27 08:05:26 INFO mapred.JobClient: map 100% reduce 11% > 08/09/27 08:06:18 INFO mapred.JobClient: map 100% reduce 12% > 08/09/27 08:07:09 INFO mapred.JobClient: map 100% reduce 13% > 08/09/27 08:08:00 INFO mapred.JobClient: map 100% reduce 14% > 08/09/27 08:08:50 INFO mapred.JobClient: map 100% reduce 15% > 08/09/27 08:09:45 INFO mapred.JobClient: map 100% reduce 16% > 08/09/27 08:10:31 INFO mapred.JobClient: map 100% reduce 17% > 08/09/27 08:11:26 INFO mapred.JobClient: map 100% reduce 18% > 08/09/27 08:12:16 INFO mapred.JobClient: map 100% reduce 19% > 08/09/27 08:13:08 INFO mapred.JobClient: map 100% reduce 20% > 08/09/27 08:14:02 INFO mapred.JobClient: map 100% reduce 21% > 08/09/27 08:14:50 INFO mapred.JobClient: map 100% reduce 22% > 08/09/27 08:15:41 INFO mapred.JobClient: map 100% reduce 23% > 08/09/27 08:16:36 INFO mapred.JobClient: map 100% reduce 24% > 08/09/27 08:17:26 INFO mapred.JobClient: map 100% reduce 25% > 08/09/27 08:18:14 INFO mapred.JobClient: map 100% reduce 26% > 08/09/27 08:19:02 INFO mapred.JobClient: map 100% reduce 27% > 08/09/27 08:19:55 INFO mapred.JobClient: map 100% reduce 28% > 08/09/27 08:21:12 INFO mapred.JobClient: map 100% reduce 71% > java.io.IOException: Job failed! > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1062) > at org.myorg.binAnalysis.main(binAnalysis.java:99) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:585) > at org.apache.hadoop.util.RunJar.main(RunJar.java:155) > at org.apache.hadoop.mapred.JobShell.run(JobShell.java:194) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) > at org.apache.hadoop.mapred.JobShell.main(JobShell.java:220) > > You have new mail in /var/spool/mail/root > > > Elia Mazzawi wrote: >> >> what does this exception mean? >> >> 08/09/27 07:56:11 INFO mapred.JobClient: Task Id : >> task_200809041356_0042_r_000000_2, Status : FAILED >> org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any >> valid local directory for task_200809041356_0042_r_000000_2/intermediate.9 >> at >> org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:313) >> at >> org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124) >> at >> org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue.merge(SequenceFile.java:2851) >> at >> org.apache.hadoop.io.SequenceFile$Sorter.merge(SequenceFile.java:2586) >> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:352) >> at >> org.apache.hadoop.mapred.TaskTracker$Child.main(TaskTracker.java:2124) >> > > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.