[ https://issues.apache.org/jira/browse/YARN-80?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13450423#comment-13450423 ]
Hudson commented on YARN-80: ---------------------------- Integrated in Hadoop-Mapreduce-trunk-Commit #2728 (See [https://builds.apache.org/job/Hadoop-Mapreduce-trunk-Commit/2728/]) YARN-80. Add support for delaying rack-local containers in CapacityScheduler. Contributed by Arun C. Murthy. (Revision 1381872) Result = FAILURE acmurthy : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1381872 Files : * /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacitySchedulerConfiguration.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/LeafQueue.java * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/resources/capacity-scheduler.xml * /hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/TestLeafQueue.java > Support delay scheduling for node locality in MR2's capacity scheduler > ---------------------------------------------------------------------- > > Key: YARN-80 > URL: https://issues.apache.org/jira/browse/YARN-80 > Project: Hadoop YARN > Issue Type: Improvement > Components: capacityscheduler > Reporter: Todd Lipcon > Assignee: Arun C Murthy > Fix For: 2.1.0-alpha > > Attachments: YARN-80.patch, YARN-80.patch > > > The capacity scheduler in MR2 doesn't support delay scheduling for achieving > node-level locality. So, jobs exhibit poor data locality even if they have > good rack locality. Especially on clusters where disk throughput is much > better than network capacity, this hurts overall job performance. We should > optionally support node-level delay scheduling heuristics similar to what the > fair scheduler implements in MR1. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira