Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
Hi Mark, I assume cnode340 is the head node that all ~340 other gmond's send their data to. If so, you could reduce the amount of redundant metadata flying around by increasing send_metadata_interval to 120 seconds or higher. That is correct, cnode340 is the head node for ganglia. I have

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
Well things blew up ~184 hosts. The web interface shows a random number of hosts down each refresh, although sometimes there are all up. It reports just ~1 second to download and process the XML: Downloading and parsing ganglia's XML tree took 0.9751s. So I don't think timeouts are the problem.

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
. In fact even telneting to the port only lights up Recv-Q for 2 seconds flat. From: Nicholas Satterly [nfsatte...@gmail.com] Sent: Thursday, October 25, 2012 15:19 To: Potter,Mark L Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general

Re: [Ganglia-general] Question about scaling

2012-10-25 Thread Potter,Mark L
...@veus.hr] Sent: Thursday, October 25, 2012 16:20 To: Potter,Mark L; Nicholas Satterly Cc: ganglia-general@lists.sourceforge.net Subject: Re: [Ganglia-general] Question about scaling 60 seconds is likely the problem. I would leave it at default ie 15. I can explain later. Potter,Mark L mlpot

[Ganglia-general] Question about scaling

2012-10-23 Thread Potter,Mark L
I am using what I think to be a fairly standard gmond.conf: globals { daemonize = yes setuid = yes user = nobody debug_level = 0 max_udp_msg_len = 1472 mute = no deaf = no allow_extra_data = yes host_dmax = 86400 /*secs. Expires (removes from web interface) hosts in 1 day */

Re: [Ganglia-general] Question about scaling

2012-10-23 Thread Potter,Mark L
data_source MDACC 60 cnode340:8649 Everything else is default at this point. http://pastebin.com/UAQYxcX3 is a full copy. From: Nicholas Satterly [nfsatte...@gmail.com] Sent: Tuesday, October 23, 2012 13:33 To: Potter,Mark L Cc: ganglia-general

Re: [Ganglia-general] Ganglia not collecting any metrics

2012-10-10 Thread Potter,Mark L
at this am rolling it out to the rest but still have no idea why it started working. From: Nicholas Satterly [nfsatte...@gmail.com] Sent: Tuesday, October 09, 2012 15:37 To: Potter,Mark L Cc: Douglas Wagner; ganglia-general@lists.sourceforge.net Subject: Re

Re: [Ganglia-general] Ganglia not collecting any metrics

2012-10-09 Thread Potter,Mark L
    I've got one more idea/concept that I ran into this morning that MIGHT be your issue, but likely isn't. RHEL 5 (not sure if 4 does this also) seems to like, upon initial installation, in the /etc/hosts file, to put the current system name in the line for localhost. For instance:

[Ganglia-general] Ganglia not collecting any metrics

2012-10-08 Thread Potter,Mark L
I'll take another shot at this since I didn't get it working last time I posted. I am running (installed from rpm): Ganglia Web Frontend version 3.5.2 Ganglia Web Backend (gmetad) version 3.4.0 Gmond version 3.4.0 RHEL 5.5 RPM list: rpm -qa | grep ganglia ganglia-gmond-3.4.0-1

Re: [Ganglia-general] Ganglia not collecting any metrics

2012-10-08 Thread Potter,Mark L
Couple comments: 1) Fill in the URL in the cluster section, should be something like http://cnode340/ganglia 2) In your GMETAD.conf file your data_source should contain the port number:  data_source MDACC 10 cnode340:8649 Note:  In previous versions of GMETAD.conf I used 8651 for the data

Re: [Ganglia-general] Ganglia not collecting any metrics

2012-10-08 Thread Potter,Mark L
I found something else. I remade my RPMS from the source using: rpmbuild -ta --target x86_64 ganglia-3.4.0.tar.gz. Pulled out the originals and re-installed. Copied y old configs back. Then just to see I ran gmond -m and got an error which ended the output: (cut off for brevity, there are no

[Ganglia-general] Nodes never show up in web interface

2012-09-28 Thread Potter,Mark L
I am running the following: Ganglia Web Frontend version 3.5.2 Ganglia Web Backend (gmetad) version 3.4.0 Gmond version 3.4.0 RHEL 5.5 I currently have gmetad and gmond running on a single system (cnode340) And a single cluster node being monitored (cnode100) Here are the config files: Cnode340

Re: [Ganglia-general] Nodes never show up in web interface

2012-09-28 Thread Potter,Mark L
to talk just give me stats on itself and will go from there. From: Aaron Nichols [mailto:anich...@trumped.org] Sent: Friday, September 28, 2012 2:27 PM To: Potter,Mark L Subject: Re: [Ganglia-general] Nodes never show up in web interface On Fri, Sep 28, 2012 at 1:20 PM, Potter,Mark L mlpot