On Mar 10, 2011, at 1:44pm, bhawna singh wrote:

Hi All,
I am crawling a URL list of 300K and after fetching around 200K I see
IOException: Spill Failed error.
Below is the stack trace.

Would anyone have some insight as to what am I running into and how I can
overcome this issue.

I believe that can happen when you run out of free local disk space to use during the shuffle phrase of a Hadoop job.

-- Ken


Thanks in advance,
Bhawna

Stack Trace:

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - java.io.IOException: Spill
failed

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.hadoop.mapred.MapTask $MapOutputBuffer.collect(MapTask.java:860)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.hadoop.mapred.MapTask $OldOutputCollector.collect(MapTask.java:466)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java: 899)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:647)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - Caused by:
org.apache.hadoop.util.DiskChecker$DiskErrorException: Could not find any
valid local directory for
taskTracker/jobcache/job_local_0001/attempt_local_0001_m_000000_0/ output/spill26.out

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.hadoop.fs.LocalDirAllocator $AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:343)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org .apache .hadoop .fs.LocalDirAllocator.getLocalPathForWrite(LocalDirAllocator.java:124)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org .apache .hadoop.mapred.MapOutputFile.getSpillFileForWrite(MapOutputFile.java: 107)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.hadoop.mapred.MapTask $MapOutputBuffer.sortAndSpill(MapTask.java:1221)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.access $1800(MapTask.java:686)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer $SpillThread.run(MapTask.java:1173)

2011-03-09 23:59:59,752 ERROR fetcher.Fetcher - fetcher
caught:java.io.IOException: Spill failed

--------------------------
Ken Krugler
+1 530-210-6378
http://bixolabs.com
e l a s t i c   w e b   m i n i n g





Reply via email to