we were having this discussion afew days ago thou I don't think we came up with any solution, but dedup seems to be having problems:

bin/nutch solrdedup  http://solr3:8983/solr/core8

results in this:

2012-01-27 17:19:13,667 INFO solr.SolrDeleteDuplicates - SolrDeleteDuplicates: starting at 2012-01-27 17:19:13 2012-01-27 17:19:13,667 INFO solr.SolrDeleteDuplicates - SolrDeleteDuplicates: Solr url: http://solr3:8983/solr/core8 2012-01-27 17:19:14,397 WARN util.NativeCodeLoader - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable 2012-01-27 17:19:15,402 WARN mapred.FileOutputCommitter - Output path is null in cleanup
2012-01-27 17:19:15,403 WARN  mapred.LocalJobRunner - job_local_0001
java.lang.NullPointerException
at org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrRecord.readSolrDocument(SolrDeleteDuplicates.java:131) at org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat$1.next(SolrDeleteDuplicates.java:271) at org.apache.nutch.indexer.solr.SolrDeleteDuplicates$SolrInputFormat$1.next(SolrDeleteDuplicates.java:241) at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.moveToNext(MapTask.java:236) at org.apache.hadoop.mapred.MapTask$TrackedRecordReader.next(MapTask.java:216)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:48)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:436)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)


and I don't know what this output path is that it is complaining about. any body?

--
Kaveh Minooie

www.plutoz.com

Reply via email to