I have been running 0.7.1 without problems.
Today i build a 0.8-dev version from trunk. and ported my settings from
0.7.1
When doing a crawl crawling starts, but it ends with a nullpointer
exception.
Any hints what is going wrong / how I should resolve this?
commandline used:
sh bin/nutch crawl urls -dir /nutch-0.8-dev/crawl -depth 1
Tail of the stacktrace:
060314 231802 Nutch Query Filter (org.apache.nutch.searcher.QueryFilter)
060314 231802 parsing
jar:file:/C:/nutch-0.8-dev/lib/hadoop-0.1-dev.jar!/hadoop-default.xml
060314 231803 parsing
jar:file:/C:/nutch-0.8-dev/lib/hadoop-0.1-dev.jar!/mapred-default.xml
060314 231803 parsing \tmp\hadoop\mapred\local\job_u1rb8.xml\localRunner
060314 231803 parsing
jar:file:/C:/nutch-0.8-dev/lib/hadoop-0.1-dev.jar!/mapred-default.xml
060314 231803 parsing file:/C:/nutch-0.8-dev/conf/hadoop-site.xml
060314 231803 found resource common-terms.utf8 at
file:/C:/nutch-0.8-dev/conf/common-terms.utf8
060314 231803 found resource common-terms.utf8 at
file:/C:/nutch-0.8-dev/conf/common-terms.utf8
060314 231803 Optimizing index.
java.lang.NullPointerException
at
org.apache.nutch.indexer.Indexer$OutputFormat$1.write(Indexer.java:109)
at
org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:270)
at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:242)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:283)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:106)
060314 231803 map 100% reduce 0%
Exception in thread "main" java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:310)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:275)
at org.apache.nutch.crawl.Crawl.main(Crawl.java:120)
Thanks,
Arent-Jan
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general