I gave it a next try this night and I still have troubles. This is the very end of my log (full version is attached) and you can see another nasty exception:
... 060104 213644 map 100% 060104 213645 Optimizing index. java.lang.NullPointerException: value cannot be null at org.apache.lucene.document.Field.<init>(Field.java:469) at org.apache.lucene.document.Field.<init>(Field.java:412) at org.apache.lucene.document.Field.UnIndexed(Field.java:195) at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:199) at org.apache.nutch.mapred.ReduceTask.run(ReduceTask.java:260) at org.apache.nutch.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:90) Exception in thread "main" java.io.IOException: Job failed! at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:308) at org.apache.nutch.indexer.Indexer.index(Indexer.java:259) at org.apache.nutch.crawl.Crawl.main(Crawl.java:121) I tried to turn off most of parsing pluggins but it didn't help so there is probably some general issue. Any ideas? Regards, Lukas On 1/4/06, Lukas Vlcek <[EMAIL PROTECTED]> wrote: > Thanks guys! > I really didn't have the latest copy... > L. > On 1/4/06, Byron Miller <[EMAIL PROTECTED]> wrote: > > Fixed in the copy i run as i've been able to get my > > 100k pages indexed without getting that error. > > > > -byron > > > > --- Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > > > > > Lukas Vlcek wrote: > > > > > > >Hi, > > > > > > > >I am trying to use the latest nutch-trunk version > > > but I am facing > > > >unexpected "Job failed!" exception. It seems that > > > all crawling work > > > >has been already done but some threads are hunged > > > which results into > > > >exception after some timeout. > > > > > > > > > > > > > > > > > > This was fixed (or should be fixed :) in the > > > revision r365576. Please > > > report if it doesn't fix it for you. > > > > > > -- > > > Best regards, > > > Andrzej Bialecki <>< > > > ___. ___ ___ ___ _ _ > > > __________________________________ > > > [__ || __|__/|__||\/| Information Retrieval, > > > Semantic Web > > > ___|||__|| \| || | Embedded Unix, System > > > Integration > > > http://www.sigram.com Contact: info at sigram dot > > > com > > > > > > > > > > > > > >