[
https://issues.apache.org/jira/browse/HADOOP-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arun C Murthy updated HADOOP-1077:
----------------------------------
Status: Patch Available (was: Open)
> Race condition in fetching map outputs (might lead to hung reduces)
> -------------------------------------------------------------------
>
> Key: HADOOP-1077
> URL: https://issues.apache.org/jira/browse/HADOOP-1077
> Project: Hadoop
> Issue Type: Bug
> Components: mapred
> Reporter: Devaraj Das
> Assigned To: Devaraj Das
> Priority: Blocker
> Fix For: 0.12.1
>
> Attachments: 1077.2.patch, 1077.patch
>
>
> Sometimes when a map task is lost while the map-output fetch is happening
> from the TT for that task, and the lost map has successfully executed on some
> other node, the event for that successful execution is lost at the fetching
> TT. The fetching TT might eventually fail to fetch the output for the lost
> task, but then since the event for the new run of the lost map might also
> have been lost, the fetching TT might hang.
> This "hung" problem was discovered while working on HADOOP-1060.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.