[ 
https://issues.apache.org/jira/browse/YARN-8764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16610566#comment-16610566
 ] 

Hu Ziqian commented on YARN-8764:
---------------------------------

hi [~wangda], i find this error in our cluster which based on hadoop-2.8.x. And 
i found issue https://issues.apache.org/jira/browse/YARN-5864 has already 
reconstruction this part. In YARN-5864, you use threadlocal in 
PriorityUtilizationQueueOrderingPolicy. I think we should also use threadlocal 
in PartitionedQueueComparator in 2.8.x.

That is just my initial thought. Can you give me some advice?

Thank you.

> Comparison method violates its general contract in Async Schedule
> -----------------------------------------------------------------
>
>                 Key: YARN-8764
>                 URL: https://issues.apache.org/jira/browse/YARN-8764
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: scheduler
>    Affects Versions: 2.8.0
>            Reporter: Hu Ziqian
>            Priority: Major
>              Labels: async-capacity-scheduler
>         Attachments: YARN-8764-branch-2.8.4.001.patch
>
>
> We use node label and async schedule in our cluster, and found our rm throw 
> many {{IllegalArgumentException which stack is blow:}}
>  
> {code:java}
> 2018-09-11 11:59:23,585 ERROR 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler:
>  AysncSchedulerThread encounter runtime exception!
> java.lang.IllegalArgumentException: Comparison method violates its general 
> contract!
>         at java.util.TimSort.mergeHi(TimSort.java:899)
>         at java.util.TimSort.mergeAt(TimSort.java:516)
>         at java.util.TimSort.mergeForceCollapse(TimSort.java:457)
>         at java.util.TimSort.sort(TimSort.java:254)
>         at java.util.Arrays.sort(Arrays.java:1512)
>         at java.util.ArrayList.sort(ArrayList.java:1454)
>         at java.util.Collections.sort(Collections.java:175)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.sortAndGetChildrenAllocationIterator(ParentQueue.java:685)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.assignContainersToChildQueues(ParentQueue.java:698)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.ParentQueue.assignContainers(ParentQueue.java:521)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.allocateOrReserveNewContainers(CapacityScheduler.java:1720)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.allocateContainerOnSingleNode(CapacityScheduler.java:1715)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.allocateContainersToNode(CapacityScheduler.java:1806)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.allocateContainersToNode(CapacityScheduler.java:1446)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.schedule(CapacityScheduler.java:487)
>         at 
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler$AsyncScheduleThread.run(CapacityScheduler.java:516)
> {code}
> the reason is partitionToLookAt in PartitionedQueueComparator is a string and 
> each Async Schedule thread will set it. It cause the compare function in 
> PartitionedQueueComparator incorrect and finally casues the 
> IllegalArgumentException.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to