Re: [Ganglia-general] Help! I have a petabyte/s network (Martin Knoblauch)

2007-03-30 Thread Andreas Schoenfeld
Hi David and Martin, I suppose the network code is still the code I wrote, so there are two problems I know of: 1. yes there is a problem with owerflows 2. the shown network traffic is the sum of all network interfaces including local loopback devices (lo0...). Both Problems could lead to

[Ganglia-general] Not getting something

2007-03-30 Thread Michael Steeevs
I'm trying to set up what I hope/think is a pretty straight forward configuration -- I'm looking to monitor Oracle RAC via ganglia, and I've got three clusters (Prod, Dev and Test). I've got a machine I'm using right now for both gmetad and the web front end piece, and I can only get one host

Re: [Ganglia-general] Not getting something

2007-03-30 Thread Richard.Grevis
Michael, Use different multicast addresses for each cluster, unless you are sure the multicast can't leak from 1 cluster to another. Remember that when you list hosts after the data_source for gmetad.conf that is for resilience only. You do not have to mention all nodes in the cluster there.

[Ganglia-general] Hosts marked down although current data is being shown

2007-03-30 Thread Lewis E. Randerson
Hi, For four of the machines on one of our clusters, the Ganglia Host web page is showing 'This host is down' even though the host is up and the web page is displaying current data from it. These four machines are part of a group of fourteen that had their IP addresses changed and this

Re: [Ganglia-general] Help! I have a petabyte/s network (Martin Knoblauch)

2007-03-30 Thread Michael Perzl
Andreas, thank you for taking the blame but you are off the hook here. ;-) If I understood David correctly, he is using my AIX Ganglia RPM packages with POWER5 extensions. Here most if not all implementation of how the metrics are collected under AIX have been changed. Everything is

[Ganglia-general] Changing IP addresses sometimes causes heartbeat to not be seen even though TN is an acceptable value.

2007-03-30 Thread Lewis E. Randerson
Hi, I have changed the ip address of some members of one of our cluster and have seen that in some cases this causes the heartbeat to not be recognized even though TN is an acceptable value and current data is still displayed. What happens is that a This host is down' message is displayed and the

Re: [Ganglia-general] Changing IP addresses sometimes causes heartbeat to not be seen even though TN is an acceptable value.

2007-03-30 Thread Lewis E. Randerson
Ian, Thanks! That resolved it. I had restarted gmetad but not gmond on the head node. --Lew -Original Message- From: Ian Cunningham [mailto:[EMAIL PROTECTED] Sent: Friday, March 30, 2007 3:12 PM To: Lewis E. Randerson Cc: ganglia-general@lists.sourceforge.net; Kevin Ying Subject: