Hi,

I'm doing some fairly gutsy playing around with Nutch to get a
multi-index app working nicely and I'm starting to get the following
error.

WARN crawl.MapWritable: Unable to load meta data entry, ignoring.. :
java.io.IOException: unable to load class for id: 36

Which, although only being reported as an error, seems to prevent any
documents apearing in the Lucene index (when I search through nutch or
examine directly with luke). Modding nutch to show the full stack
trace give me this.

07/07/23 12:41:11 WARN crawl.MapWritable: Unable to load meta data
entry, ignoring.. :
java.io.IOException: unable to load class for id: 36
        at org.apache.nutch.crawl.MapWritable.getClass(MapWritable.java:463)
        at 
org.apache.nutch.crawl.MapWritable.getKeyValueEntry(MapWritable.java:439)
        at org.apache.nutch.crawl.MapWritable.readFields(MapWritable.java:354)
        at org.apache.nutch.crawl.CrawlDatum.readFields(CrawlDatum.java:239)
        at 
org.apache.nutch.crawl.Generator$SelectorEntry.readFields(Generator.java:70)
        at 
org.apache.hadoop.mapred.ReduceTask$ValuesIterator.getNext(ReduceTask.java:188)
        at 
org.apache.hadoop.mapred.ReduceTask$ValuesIterator.next(ReduceTask.java:149)
        at org.apache.nutch.crawl.Generator$Selector.reduce(Generator.java:182)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:313)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:155)

Can someone who has a better understandin of the crawl internals shed
some light on this for me please?

Many Thanks
Rob

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >>  http://get.splunk.com/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to