Chang Li created YARN-4105: ------------------------------ Summary: Capacity Scheduler headroom for DRF is wrong Key: YARN-4105 URL: https://issues.apache.org/jira/browse/YARN-4105 Project: Hadoop YARN Issue Type: Bug Reporter: Chang Li Assignee: Chang Li
relate to the problem discussed in YARN-1857. But the min method is flawed when we are using DRC. Have run into a real scenario in production where queueCapacity: <memory:1056256, vCores:3750>, qconsumed: <memory:1054720, vCores:361>, consumed: <memory:125952, vCores:170> limit: <memory:214016, vCores:755>. headRoom calculation returns 88064 where there is only 1536 left in the queue because DRC effectively compare by vcores. It then caused deadlock because RMcontainer allocator thought there is still space for mapper and won't preempt a reducer in a full queue to schedule a mapper. Propose fix with componentwiseMin. -- This message was sent by Atlassian JIRA (v6.3.4#6332)