Sebastian Nagel created NUTCH-2518: -------------------------------------- Summary: Must check return value of job.waitForCompletion() Key: NUTCH-2518 URL: https://issues.apache.org/jira/browse/NUTCH-2518 Project: Nutch Issue Type: Bug Components: crawldb, fetcher, generator, hostdb, linkdb Affects Versions: 1.15 Reporter: Sebastian Nagel Fix For: 1.15
The return value of job.waitForCompletion() of the new MapReduce API (NUTCH-2375) must always be checked. If it's not true, the job has been failed or killed. Accordingly, the program - should not proceed with further jobs/steps - must clean-up temporary data, unlock CrawlDB, etc. - exit with non-zero exit value, so that scripts running the crawl workflow can handle the failure Cf. NUTCH-2076, NUTCH-2442, [NUTCH-2375 PR #221|https://github.com/apache/nutch/pull/221#issuecomment-332941883]. -- This message was sent by Atlassian JIRA (v7.6.3#76005)