[ 
https://issues.apache.org/jira/browse/HADOOP-4869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Amareshwari Sriramadasu updated HADOOP-4869:
--------------------------------------------

    Attachment: patch-4869.txt

Attaching the patch that puts back heartbeat code which was prior to 
HADOOP-4305. 
Manually tested patch for the lost trackers trying to bind to different port. 
Also repeated manual tests mentioned for HADOOP-4305.

Tried to write a testcase for Lost tracker bouncing back, but that looks 
difficult.

> Lost Trackers may not be able to join back
> ------------------------------------------
>
>                 Key: HADOOP-4869
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4869
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: mapred
>    Affects Versions: 0.20.0
>            Reporter: Devaraj Das
>            Assignee: Amareshwari Sriramadasu
>            Priority: Blocker
>             Fix For: 0.20.0
>
>         Attachments: patch-4869.txt
>
>
> There is a bug in the heartbeat processing which shows up when TaskTrackers 
> are lost. Due to the bug, lost TTs may not be able to join back the JT after 
> reinitializing (and binding to a RPC port different from the previous one). 
> This bug got introduced in HADOOP-4305.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to