It looks like when you write your own indexer plugin and forget to add
the field url in a doc then you get this kind of error :/
Regards,
MyD
On Apr 10, 2009, at 8:11 PM, MyD wrote:
Hi @ all,
I am using the newest trunk source code. I get every time this error
msg:
2009-04-10 20:08:23,816 INFO indexer.Indexer - Indexer: done
2009-04-10 20:08:23,817 INFO indexer.DeleteDuplicates - Dedup:
starting
2009-04-10 20:08:23,818 INFO indexer.DeleteDuplicates - Dedup:
adding indexes in: crawl.dirs/crawl.wikicfp.test/indexes
2009-04-10 20:08:23,828 WARN mapred.JobClient - Use
GenericOptionsParser for parsing the arguments. Applications should
implement Tool for the
same.
2009-04-10 20:08:24,987 WARN mapred.LocalJobRunner - job_local_0014
java.lang.NullPointerException
at org.apache.hadoop.io.Text.encode(Text.java:388)
at org.apache.hadoop.io.Text.set(Text.java:178)
at org.apache.nutch.indexer.DeleteDuplicates$InputFormat
$DDRecordReader.next(DeleteDuplicates.java:191)
at org.apache.nutch.indexer.DeleteDuplicates$InputFormat
$DDRecordReader.next(DeleteDuplicates.java:157)
at org.apache.hadoop.mapred.MapTask
$TrackedRecordReader.moveToNext(MapTask.java:192)
at org.apache.hadoop.mapred.MapTask
$TrackedRecordReader.next(MapTask.java:176)
at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
at org.apache.hadoop.mapred.LocalJobRunner
$Job.run(LocalJobRunner.java:138)
Any idea? Thanks in advance.
Regards,
MyD