On 6/24/07, Emmanuel JOKE <[EMAIL PROTECTED]> wrote: > Thank you so much. > > Now I will recompile and restart a complete crawl.
btw, you don't have to restart a complete crawl. You can just reindex (which will work, but you will have slightly inconsistent data), or reparse(which should work fine) then reindex. > > > On 6/24/07, Emmanuel JOKE <[EMAIL PROTECTED]> wrote: > >> Hi Guys, > >> > >> Ive started a crawl and everything was working fine until it try to > >> index > >> the data. I had a NPE as shown below: > >> 2007-06-24 05:05:39,453 WARN mapred.LocalJobRunner - job_agncso > >> java.lang.NullPointerException: value cannot be null > >> at org.apache.lucene.document.Field.<init>(Field.java:195) > >> at org.apache.lucene.document.Field.<init>(Field.java:171) > >> at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:198) > >> at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:326) > >> at org.apache.hadoop.mapred.LocalJobRunner$Job.run( > >> LocalJobRunner.java:155) > >> 2007-06-24 05:05:39,592 FATAL indexer.Indexer - Indexer: > >> java.io.IOException: > >> Job failed! > >> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604) > >> at org.apache.nutch.indexer.Indexer.index(Indexer.java:275) > >> at org.apache.nutch.indexer.Indexer.run(Indexer.java:297) > >> at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189) > >> at org.apache.nutch.indexer.Indexer.main(Indexer.java:280) > >> > >> I don't understand what is the pb. Do you have any idea ? > > > > I just committed a fix for this one(NUTCH-504, rev 550196). See discussion > > here: > > > > http://www.nabble.com/Indexing-problems-in-nutch-nightly-tf3923427.html > > > > for why the problem occurs. > > > >> > >> Conf: > >> 1 single machine > >> Linux 2.6, Java 1.6 > >> nutch nigthly + hadoop 0.12.3 > >> > >> Thanks in advance for ur help > >> > > > > > > -- > > DoÄŸacan Güney > > > -- Doğacan Güney ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
