[ https://issues.apache.org/jira/browse/YARN-5964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Junping Du updated YARN-5964: ----------------------------- Fix Version/s: (was: 2.7.1) > Lower the granularity of locks in FairScheduler > ----------------------------------------------- > > Key: YARN-5964 > URL: https://issues.apache.org/jira/browse/YARN-5964 > Project: Hadoop YARN > Issue Type: Improvement > Components: fairscheduler > Affects Versions: 2.7.1 > Environment: CentOS-7.1 > Reporter: zhengchenyu > Priority: Critical > Original Estimate: 2m > Remaining Estimate: 2m > > When too many applications are running, we found that client couldn't submit > the application, and a high callqueuelength of port 8032. I catch the jstack > of resourcemanager when callqueuelength is too high. I found that the thread > "IPC Server handler xxx on 8032" are waitting for the object lock of > FairScheduler, nodeupdate holds the lock of the FairScheduler. Maybe high > process time leads to the phenomenon that client can't submit the > application. > Here I don't consider the problem that client can't submit the application, > only estimates the performance of the fairscheduler. We can see too many > function which needs object lock are used, the granularity of object lock is > too big. For example, nodeUpdate and getAppWeight wanna hold the same object > lock. It is unresonable and inefficiency. I recommand that the low > granularity lock replaces the current lock. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org