[ 
https://issues.apache.org/jira/browse/HADOOP-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Arun C Murthy updated HADOOP-1077:
----------------------------------

    Status: Patch Available  (was: Open)

> Race condition in fetching map outputs (might lead to hung reduces)
> -------------------------------------------------------------------
>
>                 Key: HADOOP-1077
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1077
>             Project: Hadoop
>          Issue Type: Bug
>          Components: mapred
>            Reporter: Devaraj Das
>         Assigned To: Devaraj Das
>            Priority: Blocker
>             Fix For: 0.12.1
>
>         Attachments: 1077.2.patch, 1077.patch
>
>
> Sometimes when a map task is lost while the map-output fetch is happening 
> from the TT for that task, and the lost map has successfully executed on some 
> other node, the event for that successful execution is lost at the fetching 
> TT. The fetching TT might eventually fail to fetch the output for the lost 
> task, but then since the event for the new run of the lost map might also 
> have been lost, the fetching TT might hang.
> This "hung" problem was discovered while working on HADOOP-1060.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to