Re: [Ganglia-general] Does Ganglia measure itself?

2010-09-21 Thread Jim Rowan
On Sep 21, 2010, at 1:33 PM, Stevens, Weston J wrote: > Obviously, if Ganglia is using up considerable resources, one would > want to know. However, the purpose of Ganglia is to measure the > performance of everything else, Ganglia can skew this data. Ideally > and naturally, one would like

Re: [Ganglia-general] notification and logs

2008-07-01 Thread Jim Rowan
On Jul 1, 2008, at 12:28 PM, David Ritch wrote: > I need to monitor a cluster. I hear very good things about Ganglia, > but it seems to be missing a couple of key components. I'd like to > find out if I'm missing something, or how other ganglia users handle > this. The two components tha

Re: [Ganglia-general] Big clusters

2007-11-08 Thread Jim Rowan
Douglas Nordwall wrote: > thanks for all the input. Perhaps unsurprisingly, we are already > looking at most of this for dealing with a large cluster, but I'm > happy to see that we're not the biggest :) > > The RRD issues is good to know. We have a _very_ large disk array that > we can write t

Re: [Ganglia-general] Gmond assuming other gmonds are in same cluster

2007-11-01 Thread Jim Rowan
Andy Brody wrote: >Ah, so I can't use just one head node gmond to receive data from >different clusters. Oh well... > > Actually, you can. We have an arrangement where we have several clusters arranged in several grids all reporting into one machine. We use unicast from each machine to a

Re: [Ganglia-general] summary data not up to date

2007-09-07 Thread Jim Rowan
Bernard Li wrote: >Instead of running gmetad on multiple ports of one server, have you >tried running gmetad on the cluster headnodes and then have them >aggregate the data to the current gmetad server? You can do this by >having a data_source entry like the following: > ># data_source "my grid"

Re: [Ganglia-general] summary data not up to date

2007-09-07 Thread Jim Rowan
Bernard Li wrote: >HI Jim: > >Can you describe the network connections you have for the hosts >(bandwidth and latency) and which servers gmetad are running -- in >order words I need to figure out whether you are doing federation or >not (i.e. multiple gmetads aggregate data back to one main gmetad

[Ganglia-general] 4T limit on memory?

2007-09-07 Thread Jim Rowan
We have a cluster with more than 4T of memory. Ganglia (3.0.4) won't show that on the graphs, although if you visit the physical view it seems to have the correct number. If you restart all the gmonds, the summary memory graph looks like a sawtooth; ramping up to 4T and dropping suddenly to z

[Ganglia-general] summary data not up to date

2007-09-07 Thread Jim Rowan
I have 3.0.4 installed and mostly working properly. We have lots of hosts, so the rrds are on tmpfs. I have a grid-of-grids configuration, with separate gmetad's running for each grid, all on the same host. The server in question is pretty beefy; it's a Sun T2000; typically has a load averag

[Ganglia-general] php error

2003-04-04 Thread Jim Rowan
.3 on solaris 9, apache 1.3.27, php 4.3.1. I've verified that you can indeed connect to 127.0.0.1:8651 and get appropriate data...PHP works for other apps, although it's not used very extensively here. Thanks, Jim Rowan [EMAIL PROTECTED]

[Ganglia-general] solaris not reporting running processes

2003-01-30 Thread Jim Rowan
e any hints on what might be wrong? Thanks, Jim Rowan [EMAIL PROTECTED]