[jira] [Updated] (YARN-4230) Increasing container resource while there is no headroom left will cause ResourceManager to crash
[ https://issues.apache.org/jira/browse/YARN-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wangda Tan updated YARN-4230: - Issue Type: Sub-task (was: Bug) Parent: YARN-1197 > Increasing container resource while there is no headroom left will cause > ResourceManager to crash > - > > Key: YARN-4230 > URL: https://issues.apache.org/jira/browse/YARN-4230 > Project: Hadoop YARN > Issue Type: Sub-task > Components: resourcemanager >Reporter: MENG DING >Assignee: MENG DING >Priority: Critical > Attachments: YARN-4230.1.patch > > > This issue was found while doing end-to-end test of YARN-1197 in YARN-4175. > When increasing resource of a container, if there is no headroom left for the > user, the ResourceManager crashes with NPE. > The following is the stack trace: > {code} > 15/10/05 20:35:21 INFO capacity.ParentQueue: assignedContainer queue=root > usedCapacity=0.9375 absoluteUsedCapacity=0.9375 used= > cluster= > 15/10/05 20:35:49 FATAL resourcemanager.ResourceManager: Error in handling > event type NODE_UPDATE to the scheduler > java.lang.NullPointerException > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.IncreaseContainerAllocator.assignContainers(IncreaseContainerAllocator.java:327) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.ContainerAllocator.assignContainers(ContainerAllocator.java:66) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.common.fica.FiCaSchedulerApp.assignContainers(FiCaSchedulerApp.java:474) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.assignContainers(LeafQueue.java:819) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.assignContainersToChildQueues(ParentQueue.java:572) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.assignContainers(ParentQueue.java:423) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.allocateContainersToNode(CapacityScheduler.java:1177) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:1274) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:134) > at > org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$SchedulerEventDispatcher$EventProcessor.run(ResourceManager.java:691) > at java.lang.Thread.run(Thread.java:745) > 15/10/05 20:35:49 INFO resourcemanager.ResourceManager: Exiting, bbye.. > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (YARN-4230) Increasing container resource while there is no headroom left will cause ResourceManager to crash
[ https://issues.apache.org/jira/browse/YARN-4230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] MENG DING updated YARN-4230: Attachment: YARN-4230.1.patch The fix is simple. Attaching the patch with an added test case. > Increasing container resource while there is no headroom left will cause > ResourceManager to crash > - > > Key: YARN-4230 > URL: https://issues.apache.org/jira/browse/YARN-4230 > Project: Hadoop YARN > Issue Type: Bug > Components: resourcemanager >Reporter: MENG DING >Assignee: MENG DING >Priority: Critical > Attachments: YARN-4230.1.patch > > > This issue was found while doing end-to-end test of YARN-1197 in YARN-4175. > When increasing resource of a container, if there is no headroom left for the > user, the ResourceManager crashes with NPE. > The following is the stack trace: > {code} > 15/10/05 20:35:21 INFO capacity.ParentQueue: assignedContainer queue=root > usedCapacity=0.9375 absoluteUsedCapacity=0.9375 used= > cluster= > 15/10/05 20:35:49 FATAL resourcemanager.ResourceManager: Error in handling > event type NODE_UPDATE to the scheduler > java.lang.NullPointerException > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.IncreaseContainerAllocator.assignContainers(IncreaseContainerAllocator.java:327) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.allocator.ContainerAllocator.assignContainers(ContainerAllocator.java:66) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.common.fica.FiCaSchedulerApp.assignContainers(FiCaSchedulerApp.java:474) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.assignContainers(LeafQueue.java:819) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.assignContainersToChildQueues(ParentQueue.java:572) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.assignContainers(ParentQueue.java:423) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.allocateContainersToNode(CapacityScheduler.java:1177) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:1274) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:134) > at > org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$SchedulerEventDispatcher$EventProcessor.run(ResourceManager.java:691) > at java.lang.Thread.run(Thread.java:745) > 15/10/05 20:35:49 INFO resourcemanager.ResourceManager: Exiting, bbye.. > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)