Hi there,

I am trying to index my local file system with nutch. After crawling through several hundred files, I get the following exception. does any one know why should this happen?

I am using a version (1.0-dev) of nightly-build downloaded on 26th-feb-2009.

Aborting with 10 hung threads.
[exec] [java] fetch of file:/home/data/work/2008-02-08_LODCH_chat-minutes.pdf failed with: java.lang.NullPointerException [exec] [java] fetch of file:/home/data/work/papers/2007 - Takayama - Collaborative Activity Human Workflow in eResearch.pdf failed with: java.lang.NullPointerException [exec] [java] fetch of file:/home/data/work/proceedings/ciravegna.pdf failed with: java.lang.NullPointerException
    [exec]      [java] java.lang.NullPointerException
    [exec]      [java] java.lang.NullPointerException
    [exec]      [java] java.lang.NullPointerException
    [exec]      [java] at java.lang.System.arraycopy(Native Method)
    [exec]      [java] at java.lang.System.arraycopy(Native Method)
[exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(MapTask.java:812) [exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(MapTask.java:729) [exec] [java] at java.io.DataOutputStream.writeByte(DataOutputStream.java:136) [exec] [java] at org.apache.hadoop.io.WritableUtils.writeVLong(WritableUtils.java:290) [exec] [java] at org.apache.hadoop.io.WritableUtils.writeVInt(WritableUtils.java:270)
    [exec]      [java] at org.apache.hadoop.io.Text.write(Text.java:281)
[exec] [java] at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:90) [exec] [java] at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:77) [exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:595) [exec] [java] at org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:357) [exec] [java] at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:249)
    [exec]      [java] fetcher caught:java.lang.NullPointerException
[exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(MapTask.java:812) [exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(MapTask.java:729) [exec] [java] at java.io.DataOutputStream.writeByte(DataOutputStream.java:136) [exec] [java] at org.apache.hadoop.io.WritableUtils.writeVLong(WritableUtils.java:306) [exec] [java] at org.apache.hadoop.io.WritableUtils.writeVInt(WritableUtils.java:270)
    [exec]      [java] at org.apache.hadoop.io.Text.write(Text.java:281)
[exec] [java] at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:90) [exec] [java] at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:77) [exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:595) [exec] [java] at org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:357) [exec] [java] at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:249)
    [exec]      [java] fetcher caught:java.lang.NullPointerException
    [exec]      [java] at java.lang.System.arraycopy(Native Method)
[exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(MapTask.java:812) [exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer$Buffer.write(MapTask.java:729) [exec] [java] at java.io.DataOutputStream.writeByte(DataOutputStream.java:136) [exec] [java] at org.apache.hadoop.io.WritableUtils.writeVLong(WritableUtils.java:290) [exec] [java] at org.apache.hadoop.io.WritableUtils.writeVInt(WritableUtils.java:270)
    [exec]      [java] at org.apache.hadoop.io.Text.write(Text.java:281)
[exec] [java] at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:90) [exec] [java] at org.apache.hadoop.io.serializer.WritableSerialization$WritableSerializer.serialize(WritableSerialization.java:77) [exec] [java] at org.apache.hadoop.mapred.MapTask$MapOutputBuffer.collect(MapTask.java:595) [exec] [java] at org.apache.nutch.fetcher.Fetcher$FetcherThread.output(Fetcher.java:357) [exec] [java] at org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:249)
    [exec]      [java] fetcher caught:java.lang.NullPointerException
    [exec]      [java] java.io.IOException: Job failed!
[exec] [java] at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1217) [exec] [java] at org.apache.nutch.fetcher.Fetcher.fetch(Fetcher.java:531) [exec] [java] at org.apache.nutch.crawl.Crawl.main(Crawl.java:121)
    [exec]      [java] Java Result: 1

Many thanks,
Niraj

Reply via email to