Hi everyone,
I am setting up data analytics, and I got the following error while
trying to create the category-based splits of the Wikipedia dataset:
13/03/01 19:31:39 INFO mapred.JobClient: Task Id :
attempt_201303011913_0001_r_000000_1, Status : FAILED
java.io.IOException: Task: attempt_201303011913_0001_r_000000_1 - The
reduce copier failed
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:380)
at org.apache.hadoop.mapred.Child.main(Child.java:170)
Caused by: org.apache.hadoop.util.DiskChecker$DiskErrorException: Could
not find any valid local directory for
taskTracker/jobcache/job_201303011913_0001/attempt_201303011913_0001_r_000000_1/output/map_159.out
at
org.apache.hadoop.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:343)
at
org.apache.hadoop.fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124)
at
org.apache.hadoop.mapred.MapOutputFile.getInputFileForWrite(MapOutputFile.java:160)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$InMemFSMergeThread.doInMemMerge(ReduceTask.java:2537)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$InMemFSMergeThread.run(ReduceTask.java:2501)
13/03/01 19:31:40 INFO mapred.JobClient: map 61% reduce 10%
13/03/01 19:31:49 INFO mapred.JobClient: map 62% reduce 10%
13/03/01 19:31:52 INFO mapred.JobClient: map 62% reduce 12%
13/03/01 19:31:55 INFO mapred.JobClient: map 62% reduce 13%
13/03/01 19:32:05 INFO mapred.JobClient: map 62% reduce 15%
13/03/01 19:32:08 INFO mapred.JobClient: map 63% reduce 15%
13/03/01 19:32:17 INFO mapred.JobClient: map 63% reduce 10%
13/03/01 19:32:19 INFO mapred.JobClient: Task Id :
attempt_201303011913_0001_r_000000_2, Status : FAILED
Error: java.io.IOException: No space left on device
at java.io.FileOutputStream.writeBytes(Native Method)
at java.io.FileOutputStream.write(FileOutputStream.java:297)
at
org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.write(RawLocalFileSystem.java:190)
at java.io.BufferedOutputStream.write(BufferedOutputStream.java:122)
at
org.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:49)
at java.io.DataOutputStream.write(DataOutputStream.java:107)
at
org.apache.hadoop.mapred.IFileOutputStream.write(IFileOutputStream.java:84)
at
org.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:49)
at java.io.DataOutputStream.write(DataOutputStream.java:107)
at org.apache.hadoop.mapred.IFile$Writer.append(IFile.java:218)
at org.apache.hadoop.mapred.Merger.writeFile(Merger.java:157)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$InMemFSMergeThread.doInMemMerge(ReduceTask.java:2560)
at
org.apache.hadoop.mapred.ReduceTask$ReduceCopier$InMemFSMergeThread.run(ReduceTask.java:2501)
I still have disk space in my home folder, and I don't have problem with
the traininginput. Do you know what could be wrong with this?
Thank you for your help.
Binh