Re: YARN cluster underutilization

2016-06-22 Thread Sunil Govind
in cluster configuration JSON). >> >> >> >> Thanks again to Sunil and Shubh (and my colleague, York) for the helpful >> guidance! >> >> >> >> Take care, >> >> -Jeff >> >> >> >> *From:* Shubh hadoopExp [m

Re: YARN cluster underutilization

2016-06-21 Thread Deepak Goel
idance! > > > > Take care, > > -Jeff > > > > *From:* Shubh hadoopExp [mailto:shubhhadoop...@gmail.com] > *Sent:* Wednesday, May 25, 2016 11:08 PM > *To:* Guttadauro, Jeff <jeff.guttada...@here.com> > *Cc:* Sunil Govind <sunil.gov...@gmail.com>; user@hadoo

Re: YARN cluster underutilization

2016-06-19 Thread Shubh hadoopExp
e, York) for the helpful >> guidance! >> >> Take care, >> -Jeff >> >> From: Shubh hadoopExp [mailto:shubhhadoop...@gmail.com >> <mailto:shubhhadoop...@gmail.com>] >> Sent: Wednesday, May 25, 2016 11:08 PM >> To: Guttadauro,

RE: YARN cluster underutilization

2016-05-31 Thread Guttadauro, Jeff
Yes, that’s correct, Shubh. Thanks again… From: Shubh hadoopExp [mailto:shubhhadoop...@gmail.com] Sent: Saturday, May 28, 2016 3:20 AM To: Guttadauro, Jeff <jeff.guttada...@here.com> Cc: user@hadoop.apache.org Subject: Re: YARN cluster underutilization Hey Thats pretty good. So by ch

Re: YARN cluster underutilization

2016-05-28 Thread Shubh hadoopExp
l.com] > Sent: Wednesday, May 25, 2016 11:08 PM > To: Guttadauro, Jeff <jeff.guttada...@here.com> > Cc: Sunil Govind <sunil.gov...@gmail.com>; user@hadoop.apache.org > Subject: Re: YARN cluster underutilization > > Hey, > > OFFSWITCH allocation means if the data localit

RE: YARN cluster underutilization

2016-05-27 Thread Guttadauro, Jeff
;jeff.guttada...@here.com> Cc: Sunil Govind <sunil.gov...@gmail.com>; user@hadoop.apache.org Subject: Re: YARN cluster underutilization Hey, OFFSWITCH allocation means if the data locality is maintained or not. It has no relation with heartbeat! Heartbeat is just used to clear the pipelining of

Re: YARN cluster underutilization

2016-05-25 Thread Shubh hadoopExp
ou suggest any other knobs to turn to help RM > handle it? > > Thanks again for all your help, Sunil! > > From: Sunil Govind [mailto:sunil.gov...@gmail.com] > Sent: Wednesday, May 25, 2016 1:07 PM > To: Guttadauro, Jeff <jeff.guttada...@here.com>; user@hadoop.

Re: YARN cluster underutilization

2016-05-25 Thread Sunil Govind
dnesday, May 25, 2016 1:07 PM > > > *To:* Guttadauro, Jeff <jeff.guttada...@here.com>; user@hadoop.apache.org > *Subject:* Re: YARN cluster underutilization > > > > Hi Jeff, > > > > I do see the yarn.resourcemanager.nodemanagers.heartbeat-int

RE: YARN cluster underutilization

2016-05-25 Thread Guttadauro, Jeff
RM handle it? Thanks again for all your help, Sunil! From: Sunil Govind [mailto:sunil.gov...@gmail.com] Sent: Wednesday, May 25, 2016 1:07 PM To: Guttadauro, Jeff <jeff.guttada...@here.com>; user@hadoop.apache.org Subject: Re: YARN cluster underutilization Hi Jeff, I

Re: YARN cluster underutilization

2016-05-25 Thread Sunil Govind
ldn’t > you generally expect fairly stable utilization over the course of the job? > (This is the only job running.) > > > > Thanks, > > -Jeff > > > > *From:* Sunil Govind [mailto:sunil.gov...@gmail.com] > *Sent:* Wednesday, May 25, 2016 11:55 AM > > > *To:* G

RE: YARN cluster underutilization

2016-05-25 Thread Guttadauro, Jeff
25, 2016 11:55 AM To: Guttadauro, Jeff <jeff.guttada...@here.com>; user@hadoop.apache.org Subject: Re: YARN cluster underutilization Hi Jeff. Thanks for sharing this information. I have some observations from this logs. - I think the node heartbeat is around 2/3 seconds here. Is it chang

Re: YARN cluster underutilization

2016-05-25 Thread Sunil Govind
are pretty big, and I thought that might be sufficient. > > > > Any guidance is much appreciated! > > -Jeff > > > > *From:* Sunil Govind [mailto:sunil.gov...@gmail.com] > *Sent:* Wednesday, May 25, 2016 10:55 AM > *To:* Guttadauro, Jeff <jeff.guttada...@he

Re: YARN cluster underutilization

2016-05-25 Thread Sunil Govind
Hi Jeff, It looks like to you are allocating more memory for AM container. Mostly you might not need 6Gb (as per the log). Could you please help to provide some more information. 1. What type of mapreduce application (wordcount etc) are you running? Some AMs may be CPU intensive and some may

YARN cluster underutilization

2016-05-25 Thread Guttadauro, Jeff
Hi, all. I have an M/R (map-only) job that I'm running on a Hadoop 2.7.1 YARN cluster that is being quite underutilized (utilization of around 25-30%). The EMR cluster is 1 master + 20 core m3.xlarge nodes, which have 8 cores each and 15G total memory (with 11.25G of that available to YARN).