Hi again,

Still there is some problem.
I was able to crawl 2 cycles without any issues but an exception arose
during the third crawl.

Optimizing index.
merging segments _8i (1 docs) into _8j (1 docs)
merging segments _1e (50 docs) _2t (50 docs) _48 (50 docs) _5n (50 docs) _72
(50 docs) _8h (50 docs) _8j (1 docs) into _8k (301 docs)
Indexer: java.io.IOException: Job failed!
       at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:399)
       at org.apache.nutch.indexer.Indexer.index(Indexer.java:297)
       at org.apache.nutch.indexer.Indexer.run(Indexer.java:319)
       at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
       at org.apache.nutch.indexer.Indexer.main(Indexer.java:302)

:-(

Lukas

On 1/11/07, Lukas Vlcek <[EMAIL PROTECTED]> wrote:

You are right Andrzej!
I really appreciate your response.

Regards,
Lukas

On 1/11/07, Andrzej Bialecki <[EMAIL PROTECTED] > wrote:
>
> Sean Dean wrote:
> > I know that a bug was reported on that version of trunk (the latest)
> and a patch is currently in the works. If that isn't the cause of this, then
> the fact that your getting checksum errors signals your missing part of the
> file or corruption has occurred.
> >
> >
>
> This may also indicate that mapred.speculative.execution is true in your
> config - make sure it's explicitly set to false.
>
> --
> Best regards,
> Andrzej Bialecki     <><
> ___. ___ ___ ___ _ _   __________________________________
> [__ || __|__/|__||\/|  Information Retrieval, Semantic Web
> ___|||__||  \|  ||  |  Embedded Unix, System Integration
> http://www.sigram.com  Contact: info at sigram dot com
>
>
>

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to