Not sure if RHEL-3 has this but on RH9 there the command called sysctl
which lets you force the kernel to use IGMP v2 at runtime. (so no
patching is involved)

Cheers,

Bernard 

> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On 
> Behalf Of Jason A. Smith
> Sent: Wednesday, June 23, 2004 15:15
> To: Peter Schmid
> Cc: Steve Gilbert; Ganglia General
> Subject: Re: [Ganglia-general] Problem with multicast (I think)
> 
> I have also had igmp v3 problems here.  In one multicast 
> group I tried having some nodes running RHEL-3 and others 
> running RH-7.3.  The Enterprise kernel does igmp v3 by 
> default and is supposed to fall back to igmp v2 if it doesn't 
> get a response.  That doesn't appear to be happening here 
> with our network setup, which doesn't support igmp v3, see 
> this bugzilla report:
> 
> https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=125570
> 
> There is a kernal patch that can allow you to force the 
> kernel to use igmp v2 but RedHat doesn't want to apply it to RHEL-3
> 
> 
> A while ago we also had another igmp problem that we were 
> never able to solve.  We purchased some new nodes for our 
> clusters that were also attached to a new switch that had 
> trunks connecting it to the other switches in the same 
> subnets.  For some unknown reason the igmp packets were not 
> being sent between the two switches.  This would cause a 
> split in the multicast groups if a node on one side of the 
> trunks was rebooted or gmond was restarted.  Basically the 
> nodes on each side of the switch would see each others 
> multicast, but not see across the trunks.  Our network guy 
> tried fixing this with help from our Cisco tech rep, but 
> eventually gave up and we ended up disabling igmp on our 
> switches.  It wasn't really doing much for us anyway since 
> only our cluster nodes are connected to them and we obviously 
> want each node to be in the ganglia multicast group, which is 
> the only thing that is using multicast.
> 
> ~Jason
> 
> 
> On Wed, 2004-06-23 at 17:55, Peter Schmid wrote:
> > Are these new machines a different OS.  I've been battleing 
> the IGMP 
> > v3 issues here.... and if the OS kernel is set for IGMP v3 and the 
> > switch only does v2.. or if it does v3 and the 
> include/exclude groups 
> > are not setup right.. then your behavior is just what you will see.
> > 
> > HTH,
> > 
> > Pete.
> > 
> > On Wed, 23 Jun 2004, Steve Gilbert wrote:
> > 
> > > Howdy Gangliati,
> > > 
> > > I'm having a strange problem that seems to be with multicast, but 
> > > I'm not really sure.  I had a very similar problem in the 
> past and 
> > > posted here about it...that turned out to a be a problem 
> on one of 
> > > the network switches, but my network team insists that 
> this is not the same issue.
> > > 
> > > I'm running Ganglia 2.5.4 (need to upgrade, I know) on about 16 
> > > different clusters/subnets of ~200 hosts each.  Each subnet has a 
> > > "control" host that also runs gmond as well as named, 
> ypserv, dhcpd, 
> > > etc.  I have a central monitoring host that is dedicated 
> to running 
> > > gmetad and the webfrontend that talks to the 16 different 
> control nodes.
> > > Hope that makes sense.  We've been running this way with no major 
> > > trouble for quite a while.
> > > 
> > > I recently brought a new subnet/cluster online, and now 
> I'm having 
> > > trouble.  The control box on this subnet seems to be 
> isolated from 
> > > the rest.  gstat --all only shows itself, not the rest of the 
> > > subnet.  The rest of the subnet sees everything except 
> this control box.
> > > 
> > > I've rebooted all the machines as well as restarted all gmonds 
> > > several times.  When you first start up gmond on the 
> control box, it 
> > > only sees itself...then some random amount of time later, it will 
> > > list the other nodes in the subnet as being dead.  Similarly, the 
> > > other hosts report the control box as being dead.  I can point my 
> > > gmetad to a random node in the subnet, and that works 
> fine...I just 
> > > can't get the control box to be part of the cluster.  So 
> it seems to 
> > > me that they do communicate at some point to at least 
> populate the 
> > > dead list.  I've done tcpdumps looking for multicast 
> traffic between 
> > > the control box and the rest, but nothing ever shows up.
> > > 
> > > The control box is on a different physical network segment...the 
> > > nodes are plugged into 48-port Cisco switches (100 Mb), and those 
> > > have a GigE connection back to a big Cisco 6500.  The control box 
> > > has a direct GigE connection to the 6500.  Same deal as 
> with all our other subnets.
> > > 
> > > I'm no network whiz, but I've had our network team beating their 
> > > heads against this, and they insist there is nothing 
> wrong on their end.
> > > Anyone else have any ideas?  Thanks!
> > > 
> > > 
> > > Steve Gilbert
> > > Unix Systems Administrator
> > > [EMAIL PROTECTED]
> > > 
> > > 
> > > -------------------------------------------------------
> > > This SF.Net email sponsored by Black Hat Briefings & Training.
> > > Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
> > > digital self defense, top technical experts, no vendor pitches, 
> > > unmatched networking opportunities. Visit www.blackhat.com 
> > > _______________________________________________
> > > Ganglia-general mailing list
> > > Ganglia-general@lists.sourceforge.net
> > > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > 
> --
> /------------------------------------------------------------------\
> |  Jason A. Smith                          Email:  [EMAIL PROTECTED] |
> |  Atlas Computing Facility, Bldg. 510M    Phone:  (631)344-4226   |
> |  Brookhaven National Lab, P.O. Box 5000  Fax:    (631)344-7616   |
> |  Upton, NY 11973-5000                                            |
> \------------------------------------------------------------------/
> 
> 
> 
> 
> -------------------------------------------------------
> This SF.Net email sponsored by Black Hat Briefings & Training.
> Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
> digital self defense, top technical experts, no vendor 
> pitches, unmatched networking opportunities. Visit 
> www.blackhat.com _______________________________________________
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
> 
> 

Reply via email to