Hi Naga, Ravi, We have lots of small applications running on the cluster. We use Java 8 and Hadoop version 2.7.3. Resourcemanager is running on 40GB heap with NewRatio set to 3. We store 100,000 completed apps in memory (max-completed-apps). Tenured space occupies ~28 GB after a full GC. Is this footprint expected for 100,000 apps?
We did try CMS before with 70% occupancy fraction, there were ‘promotion failures’ that ended up in stop the world. Regards, Puneet From: Naganarasimha Garla <naganarasimha...@apache.org> Date: Wednesday, August 23, 2017 at 5:23 PM To: Ravi Prakash <ravihad...@gmail.com> Cc: "Ravuri, Venkata Puneet" <vrav...@ea.com>, "common-u...@hadoop.apache.org" <user@hadoop.apache.org> Subject: Re: Recommendation for Resourcemanager GC configuration Hi Puneet, Along with the heap dump details, I would also like to know the version of the Hadoop-Yarn being used, size of the cluster, all Memory configurations, and JRE version. Also if possible can you share the rational behind the choice for Parallel GC collector over others (CMS or G1) ? Regards, + Naga On Thu, Aug 24, 2017 at 2:54 AM, Ravi Prakash <ravihad...@gmail.com<mailto:ravihad...@gmail.com>> wrote: Hi Puneet Can you take a heap dump and see where most of the churn is? Is it lots of small applications / few really large applications with small containers etc. ? Cheers Ravi On Wed, Aug 23, 2017 at 9:23 AM, Ravuri, Venkata Puneet <vrav...@ea.com<mailto:vrav...@ea.com>> wrote: Hello, I wanted to know if there is any recommendation for ResourceManager GC settings. Full GC (with Parallel GC, 8 threads) is sometimes taking more than 30 sec due to which state store sessions to Zookeeper time out resulting in FATAL errors. The YARN cluster is heavily used with 1000’s of applications launched per hour. Could you please share any documentation related to best practices for tuning resourcemanager GC? Thanks, Puneet