[
https://issues.apache.org/jira/browse/HADOOP-5280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12674587#action_12674587
]
Vinod K V commented on HADOOP-5280:
-----------------------------------
On one of the clusters, a map attempt was expired as a lost task in
ExpireLaunchingTasks thread, but it was not removed from taskidToTIPMap. All
the reducers were informed that the map has failed. In the next heartbeat the
TT came back reporting the attempt as a success, thereby preventing launch of
any new map attempts for this task.
Subsequently, all the reduces just got stalled waiting for the output from this
map task and the whole job got stock with no progress.
> When expiring a lost launched task, JT doesn't remove the attempt from the
> taskidToTIPMap.
> ------------------------------------------------------------------------------------------
>
> Key: HADOOP-5280
> URL: https://issues.apache.org/jira/browse/HADOOP-5280
> Project: Hadoop Core
> Issue Type: Bug
> Reporter: Vinod K V
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.