This is what I got after tiny modification of log4j.properties:
java.lang.RuntimeException: Unexpected status: 67
at org.apache.nutch.indexer.Indexer.reduce(Indexer.java:198)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:307)
at org.apache.hadoop.mapred.LocalJobRunner$Job.run(
LocalJobRunner.java:137)
Indexer: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:399)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:297)
at org.apache.nutch.indexer.Indexer.run(Indexer.java:319)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:302)
On 1/11/07, Lukas Vlcek <[EMAIL PROTECTED]> wrote:
Hi again,
Still there is some problem.
I was able to crawl 2 cycles without any issues but an exception arose
during the third crawl.
Optimizing index.
merging segments _8i (1 docs) into _8j (1 docs)
merging segments _1e (50 docs) _2t (50 docs) _48 (50 docs) _5n (50 docs)
_72 (50 docs) _8h (50 docs) _8j (1 docs) into _8k (301 docs)
Indexer: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob (JobClient.java:399)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:297)
at org.apache.nutch.indexer.Indexer.run(Indexer.java:319)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java :189)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:302)
:-(
Lukas
On 1/11/07, Lukas Vlcek < [EMAIL PROTECTED]> wrote:
>
> You are right Andrzej!
> I really appreciate your response.
>
> Regards,
> Lukas
>
> On 1/11/07, Andrzej Bialecki < [EMAIL PROTECTED] > wrote:
> >
> > Sean Dean wrote:
> > > I know that a bug was reported on that version of trunk (the latest)
> > and a patch is currently in the works. If that isn't the cause of this, then
> > the fact that your getting checksum errors signals your missing part of the
> > file or corruption has occurred.
> > >
> > >
> >
> > This may also indicate that mapred.speculative.execution is true in
> > your
> > config - make sure it's explicitly set to false.
> >
> > --
> > Best regards,
> > Andrzej Bialecki <><
> > ___. ___ ___ ___ _ _ __________________________________
> > [__ || __|__/|__||\/| Information Retrieval, Semantic Web
> > ___|||__|| \| || | Embedded Unix, System Integration
> > http://www.sigram.com Contact: info at sigram dot com
> >
> >
> >
>
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general