[ http://issues.apache.org/jira/browse/NUTCH-284?page=comments#action_12413240 ]
Stefan Neufeind commented on NUTCH-284: --------------------------------------- Yes, I was missing index-basic. Please apologize. I needed the extra fields of index-more and thought it would do the basic fields as well. The same thing occured in NUTCH-51. Would it be possible to maybe demand that index-basic is loaded (same like "well, you need a scoring-plugin" etc.)? What if somebody writes his own index-basic2-plugin - then he'd have to be able to put an "provides index-basic" into his plugin to notify that he indexes the basic fields or so. Maybe something like this could avoid trouble / searching for some people like me :-) > NullPointerException during index > --------------------------------- > > Key: NUTCH-284 > URL: http://issues.apache.org/jira/browse/NUTCH-284 > Project: Nutch > Type: Bug > Components: indexer > Versions: 0.8-dev > Reporter: Stefan Neufeind > > For quite a few this "reduce > sort" has been going on. Then it fails. What > could be wrong with this? > 060524 212613 reduce > sort > 060524 212614 reduce > sort > 060524 212615 reduce > sort > 060524 212615 found resource common-terms.utf8 at > file:/home/mm/nutch-nightly-prod/conf/common-terms.utf8 > 060524 212615 found resource common-terms.utf8 at > file:/home/mm/nutch-nightly-prod/conf/common-terms.utf8 > 060524 212619 Optimizing index. > 060524 212619 job_jlbhhm > java.lang.NullPointerException > at > org.apache.nutch.indexer.Indexer$OutputFormat$1.write(Indexer.java:111) > at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:269) > at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:253) > at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:282) > at > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:114) > Exception in thread "main" java.io.IOException: Job failed! > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:341) > at org.apache.nutch.indexer.Indexer.index(Indexer.java:287) > at org.apache.nutch.indexer.Indexer.main(Indexer.java:304) -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira ------------------------------------------------------- All the advantages of Linux Managed Hosting--Without the Cost and Risk! Fully trained technicians. The highest number of Red Hat certifications in the hosting industry. Fanatical Support. Click to learn more http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
