[ https://issues.apache.org/jira/browse/NUTCH-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16415426#comment-16415426 ]
Omkar Reddy commented on NUTCH-2518: ------------------------------------ I might have missed this ticket. Hi [~wastl-nagel], this was not covered in my PR for: [NUTCH-2375|https://github.com/apache/nutch/pull/221]. [~kpm1985], [~wastl-nagel], [~lewismc] I see there is a PR with just a minor change for this issue. I can take it up if it is not a problem. Please let me know anyways. Thanks. > Must check return value of job.waitForCompletion() > -------------------------------------------------- > > Key: NUTCH-2518 > URL: https://issues.apache.org/jira/browse/NUTCH-2518 > Project: Nutch > Issue Type: Bug > Components: crawldb, fetcher, generator, hostdb, linkdb > Affects Versions: 1.15 > Reporter: Sebastian Nagel > Assignee: Kenneth McFarland > Priority: Blocker > Fix For: 1.15 > > > The return value of job.waitForCompletion() of the new MapReduce API > (NUTCH-2375) must always be checked. If it's not true, the job has been > failed or killed. Accordingly, the program > - should not proceed with further jobs/steps > - must clean-up temporary data, unlock CrawlDB, etc. > - exit with non-zero exit value, so that scripts running the crawl workflow > can handle the failure > Cf. NUTCH-2076, NUTCH-2442, [NUTCH-2375 PR > #221|https://github.com/apache/nutch/pull/221#issuecomment-332941883]. -- This message was sent by Atlassian JIRA (v7.6.3#76005)