(RESEND, MENT TO ATTACH THE COMMENT BELOW TO THIS POSTING)

Why don't we include documenting this as part of the the "map-reduce walk-through" sprint item?

-----
On reintegrating lost task trackers...

It does seem like we should do this to me, but we need to make sure we reason through how this effects corner cases, what invariants the system does maintain and so on.

I suggest we work this through, and then go forward with this patch (modified if we find any corner cases) and post the reasoning, so we can review it as this logic evolves. (And update any existing documentation in this area of course...)


On Aug 10, 2006, at 12:38 PM, Devaraj Das (JIRA) wrote:

[ http://issues.apache.org/jira/browse/HADOOP-181? page=comments#action_12427327 ]

Devaraj Das commented on HADOOP-181:
------------------------------------

Doug, does it make sense to do what is done in this patch only when speculative execution is on?

task trackers should not restart for having a late heartbeat
------------------------------------------------------------

                Key: HADOOP-181
                URL: http://issues.apache.org/jira/browse/HADOOP-181
            Project: Hadoop
         Issue Type: Bug
         Components: mapred
           Reporter: Owen O'Malley
        Assigned To: Devaraj Das
            Fix For: 0.6.0

        Attachments: lost-heartbeat.patch


TaskTrackers should not close and restart themselves for having a late heartbeat. The JobTracker should just accept their current status.

--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/ Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/ software/jira



Reply via email to