MapEventFetcherThread should not iterate over jobs that are not localized
-------------------------------------------------------------------------

                 Key: MAPREDUCE-1895
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1895
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: tasktracker
            Reporter: Amareshwari Sriramadasu


We have seen a scenario of lost trackers on our clusters because of the 
following:
TaskLauncher has locked a TaskTracker$RunningJob and doing localizeJob, which 
involves DFS operations. Map-event
fetcher has locked TaskTracker.runningJobs map and is waiting to lock the 
RunningJob object. TaskTracker offerService
is waiting to lock TaskTracker.runningJobs map, thus failing to send heartbeats 
in 10 minutes. 

So, I think map-event fetcher should circuit jobs that are not localized.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to