[ 
https://issues.apache.org/jira/browse/YARN-3091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14289552#comment-14289552
 ] 

Wangda Tan commented on YARN-3091:
----------------------------------

Thanks for jumping in and provide your thoughts. [~gtCarrera], [~sunilg], 
[~ozawa], [~rohithsharma], [~varun_saxena].

I've just updated title of this JIRA a little bit according to suggestions from 
[~gtCarrera]. I think it's better to put improvement and fix together in this 
ticket. Since they share a lot of background works. And +1 to fix bugs prior to 
improvements, but it is possible we can address both of them at some places.

I agree to run Jcarder first to pinpoint problems first, with that, we can get 
some valid inputs. But I'm not sure what's the plan of HADOOP-9213, if it needs 
take more time, we can do some works our side parallelly.

> [Umbrella] Improve and fix locks of RM scheduler
> ------------------------------------------------
>
>                 Key: YARN-3091
>                 URL: https://issues.apache.org/jira/browse/YARN-3091
>             Project: Hadoop YARN
>          Issue Type: Task
>          Components: capacityscheduler, fairscheduler, resourcemanager, 
> scheduler
>            Reporter: Wangda Tan
>
> In existing YARN RM scheduler, there're some issues of using locks. For 
> example:
> - Many unnecessary synchronized locks, we have seen several cases recently 
> that too frequent access of scheduler makes scheduler hang. Which could be 
> addressed by using read/write lock. Components include scheduler, CS queues, 
> apps
> - Some fields not properly locked (Like clusterResource)
> We can address them together in this ticket.
> (More details see comments below)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to