[ 
https://issues.apache.org/jira/browse/HADOOP-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod K V updated HADOOP-4523:
------------------------------

    Attachment: HADOOP-4523-200811-05.txt

Attaching a patch. This
 - makes TaskMemoryManagerThread to observe total memory usage across all 
tasks. If total usage crosses overall limit, TT tries and kills any tasks which 
cross individual task limits. If it cannot find such tasks, it kills the task 
with the least progress found via TaskTracker.findTaskToKill() which has 
already been used in case of overflowing disk. This method first tries to find 
the reduce task with least progress, otherwise it returns the map task with 
least progress.
 - marks tasks killed because of transgressing individual limits as failed, 
otherwise they are marked as killed.
 - includes testTasksWithinTTLimits, 
testTaskBeyondIndividualLimitsAndTotalUsageBeyondTTLimits and 
testTaskBeyondIndividualLimitsButTotalUsageWithinTTLimits. Couldn't write a 
test to check killing of a task with least progress; simulating this situation 
proved very difficult.

> Enhance how memory-intensive user tasks are handled
> ---------------------------------------------------
>
>                 Key: HADOOP-4523
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4523
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.19.0
>            Reporter: Vivek Ratan
>            Assignee: Vinod K V
>         Attachments: HADOOP-4523-200811-05.txt
>
>
> HADOOP-3581 monitors each Hadoop task to see if its memory usage (which 
> includes usage of any tasks spawned by it and so on) is within a per-task 
> limit. If the task's memory usage goes over its limit, the task is killed. 
> This, by itself, is not enough to prevent badly behaving jobs from bringing 
> down nodes. What is also needed is the ability to make sure that the sum 
> total of VM usage of all Hadoop tasks does not exceed a certain limit.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to