[
https://issues.apache.org/jira/browse/MAPREDUCE-4831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jason Lowe resolved MAPREDUCE-4831.
-----------------------------------
Resolution: Not A Problem
After discussing this at length with Bobby, we came to the conclusion that this
isn't a problem. Task commit can already occur multiple times due to things
like fetch failure, etc. The OutputCommitter is responsible for guarding
against task double-commit if that indeed is a problem.
> Task commit can occur more than once due to AM retries
> ------------------------------------------------------
>
> Key: MAPREDUCE-4831
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-4831
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: mr-am
> Affects Versions: 0.23.0, 2.0.0-alpha
> Reporter: Jason Lowe
> Priority: Critical
>
> If a task attempt begins committing but the AM crashes before the task
> attempt completes then we could end up having the task commit again when the
> AM is relaunched. The subsequent AM attempt will not see the task having
> completed, so it will re-run the task and it will commit again. The output
> committer is user code, and the task commit may not be something repeatable.
> Therefore we should treat an AM crash during a task attempt commit the same
> as we do for a commit failure by the task attempt, i.e.: the task should fail
> since we do not know how to recover from a commit failure.
> This is similar to MAPREDUCE-4819, as this involves commit at the task level
> and that involves commit at the job-level.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira