On 6/24/07, Emmanuel JOKE <[EMAIL PROTECTED]> wrote:
> Thank you so much.
>
> Now I will recompile and restart a complete crawl.

btw, you don't have to restart a complete crawl. You can just reindex
(which will work, but you will have slightly inconsistent data), or
reparse(which should work fine) then reindex.

>
> > On 6/24/07, Emmanuel JOKE <[EMAIL PROTECTED]> wrote:
> >> Hi Guys,
> >>
> >> Ive started a crawl and everything was working fine until it try to
> >> index
> >> the data. I had a NPE as shown below:
> >> 2007-06-24 05:05:39,453 WARN  mapred.LocalJobRunner - job_agncso
> >> java.lang.NullPointerException: value cannot be null
> >>         at org.apache.lucene.document.Field.<init>(Field.java:195)
> >>         at org.apache.lucene.document.Field.<init>(Field.java:171)
> >>         at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:198)
> >>         at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:326)
> >>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
> >> LocalJobRunner.java:155)
> >> 2007-06-24 05:05:39,592 FATAL indexer.Indexer - Indexer:
> >> java.io.IOException:
> >> Job failed!
> >>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:604)
> >>         at org.apache.nutch.indexer.Indexer.index(Indexer.java:275)
> >>         at org.apache.nutch.indexer.Indexer.run(Indexer.java:297)
> >>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
> >>         at org.apache.nutch.indexer.Indexer.main(Indexer.java:280)
> >>
> >> I don't understand what is the pb. Do you have any idea ?
> >
> > I just committed a fix for this one(NUTCH-504, rev 550196). See discussion
> > here:
> >
> > http://www.nabble.com/Indexing-problems-in-nutch-nightly-tf3923427.html
> >
> > for why the problem occurs.
> >
> >>
> >> Conf:
> >> 1 single machine
> >> Linux 2.6, Java 1.6
> >> nutch nigthly + hadoop 0.12.3
> >>
> >> Thanks in advance for ur help
> >>
> >
> >
> > --
> > Doğacan Güney
> >
>


-- 
Doğacan Güney
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to