Re: [Ganglia-general] High load on gmetad machine

2015-06-12 Thread Adam Compton
I got a substantial performance improvement out of a) using a ramdisk and then b) recompiling the rrdtool libraries to disable the use of mmap (they have a configure flag for this), and then recompiling gmetad with those modified libraries. This keeps gmetad from bothering to read and mmap() th

Re: [Ganglia-general] Down hosts remain on disk and flood RAM disk

2014-08-25 Thread Adam Compton
On 8/25/14 6:53 AM, Cristovao Jose Domingues Cordeiro wrote: Hi all, I have a Ganglia setup running with a 4Gb ramdisk. I'm monitoring several opportunistic clusters with short-lifetimed VM's which are renewed a lot, causing a huge number of down hosts in Ganglia. >From a web interface poi

Re: [Ganglia-general] Two questions: (1) default rrd settings, (2) gmond vs gmetad collection rates

2014-08-06 Thread Adam Compton
On 8/6/14, 10:14 AM, Mark Selby wrote: My company has just stated using Ganglia for production metrics gathering and as I like to really understand "what is happening" in my environment I have a few questions that I can not seem to truly figure out on my own. All and any help is greatly appreci

Re: [Ganglia-general] Ganglia 4.x architecture planning

2014-03-27 Thread Adam Compton
I'm in favor of teaching gmetad how to send the metrics it collects to a wider variety of things, particularly if there's a plugin interface for writing them. - Adam On 3/27/14 1:07 PM, Daniel Pocock wrote: > > I made up a rough diagram about how Ganglia 4.x could look: > > https://raw.githubu

Re: [Ganglia-general] Ganglia 4.x architecture planning

2014-03-27 Thread Adam Compton
On 3/27/14 1:20 PM, Daniel Pocock wrote: > > On 27/03/14 21:16, Adam Compton wrote: >> I'm in favor of teaching gmetad how to send the metrics it collects to a >> wider variety of things, particularly if there's a plugin interface for >> writing them. >>

Re: [Ganglia-general] multiple clusters with just one collector

2014-01-28 Thread Adam Compton
On 1/28/14 11:18 AM, Sergio Ballestrero wrote: > On 28 Jan 2014, at 20:10, Adam Compton wrote: > >> The gmond "globals" configuration option "host_tmax" controls how long a >> host can go without a heartbeat before being seen as "down"; it's

Re: [Ganglia-general] multiple clusters with just one collector

2014-01-28 Thread Adam Compton
On 1/28/14 6:21 AM, Sergio Ballestrero wrote: > Hello Adrian, > > On 25 Jan 2014, at 21:34, Adrian Sevcenco wrote: >> On 01/25/2014 09:37 PM, Sergio Ballestrero wrote: >>> Hello Adrian, if the host for which you send gmetrics is not a gmond >>> "client", you need to also spoof a "heartbeat" metric