Hi again,
Still there is some problem.
I was able to crawl 2 cycles without any issues but an exception arose
during the third crawl.
Optimizing index.
merging segments _8i (1 docs) into _8j (1 docs)
merging segments _1e (50 docs) _2t (50 docs) _48 (50 docs) _5n (50 docs) _72
(50 docs) _8h (50 docs) _8j (1 docs) into _8k (301 docs)
Indexer: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:399)
at org.apache.nutch.indexer.Indexer.index(Indexer.java:297)
at org.apache.nutch.indexer.Indexer.run(Indexer.java:319)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.indexer.Indexer.main(Indexer.java:302)
:-(
Lukas
On 1/11/07, Lukas Vlcek <[EMAIL PROTECTED]> wrote:
You are right Andrzej!
I really appreciate your response.
Regards,
Lukas
On 1/11/07, Andrzej Bialecki <[EMAIL PROTECTED] > wrote:
>
> Sean Dean wrote:
> > I know that a bug was reported on that version of trunk (the latest)
> and a patch is currently in the works. If that isn't the cause of this, then
> the fact that your getting checksum errors signals your missing part of the
> file or corruption has occurred.
> >
> >
>
> This may also indicate that mapred.speculative.execution is true in your
> config - make sure it's explicitly set to false.
>
> --
> Best regards,
> Andrzej Bialecki <><
> ___. ___ ___ ___ _ _ __________________________________
> [__ || __|__/|__||\/| Information Retrieval, Semantic Web
> ___|||__|| \| || | Embedded Unix, System Integration
> http://www.sigram.com Contact: info at sigram dot com
>
>
>
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general