[ https://issues.apache.org/jira/browse/YARN-3762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14571272#comment-14571272 ]
Karthik Kambatla commented on YARN-3762: ---------------------------------------- The patch just adds read-write locks to address any races. Haven't added any tests since it is hard to test race conditions. The code changed is accessed directly or indirectly by existing tests. Running jcarder should catch any deadlocks introduced by this change. > FairScheduler: CME on FSParentQueue#getQueueUserAclInfo > ------------------------------------------------------- > > Key: YARN-3762 > URL: https://issues.apache.org/jira/browse/YARN-3762 > Project: Hadoop YARN > Issue Type: Bug > Components: fairscheduler > Affects Versions: 2.7.0 > Reporter: Karthik Kambatla > Assignee: Karthik Kambatla > Priority: Critical > Attachments: yarn-3762-1.patch, yarn-3762-1.patch > > > In our testing, we ran into the following ConcurrentModificationException: > {noformat} > halxg.cloudera.com:8042, nodeRackName/rackvb07, nodeNumContainers0 > 15/05/22 13:02:22 INFO distributedshell.Client: Queue info, > queueName=root.testyarnpool3, queueCurrentCapacity=0.0, > queueMaxCapacity=-1.0, queueApplicationCount=0, queueChildQueueCount=0 > 15/05/22 13:02:22 FATAL distributedshell.Client: Error running Client > java.util.ConcurrentModificationException: > java.util.ConcurrentModificationException > at java.util.ArrayList$Itr.checkForComodification(ArrayList.java:901) > at java.util.ArrayList$Itr.next(ArrayList.java:851) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FSParentQueue.getQueueUserAclInfo(FSParentQueue.java:155) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.getQueueUserAclInfo(FairScheduler.java:1395) > at > org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.getQueueUserAcls(ClientRMService.java:880) > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)