Michael,

Use different multicast addresses for each cluster,
unless you are sure the multicast can't leak
from 1 cluster to another.

Remember that when you list hosts after the data_source
for gmetad.conf that is for resilience only. You do not have to
mention all nodes in the cluster there.

Given your symptoms it might be something else. I suggest you
consider using unicast initially rather than multicast until
you get everything going. (upd_send in gmond.conf pointing to
a nominated headnode on each cluser, then data_source from that).

And netcatting hosts can be very instructive (e.g. nc lsora1006 8649).
Are all expected hosts listed in the nc output?
Unexpected hostnames? (gmond does reverse dns lookup to make hostnames).
Is the cluster name returned by nc different for every cluster?
(the clustername in gmetad.conf is not used).

good luck

Richard Grevis
Production Architecture
Barclays Capital, Canary Wharf, London, E14 4BB



> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On 
> Behalf Of Michael Steeevs
> Sent: 30 March 2007 15:29
> To: ganglia-general@lists.sourceforge.net
> Subject: [Ganglia-general] Not getting something
> 
> 
> 
> I'm trying to set up what I hope/think is a pretty straight 
> forward configuration -- I'm looking to monitor Oracle RAC 
> via ganglia, and I've got three clusters (Prod, Dev and 
> Test).  I've got a machine I'm using right now for both 
> gmetad and the web front end piece, and I can only get one 
> host from each cluster to show up as 'up' in the web front end.
> 
> Right now, I've got the following setup:
> 
> lsora1003 and lsora1006 -- Dev RAC, running just gmond.
> 
> cluster { 
>   name = "Oracle RAC Dev nodes" 
>   owner = "myorg" 
>   latlong = "unspecified" 
>   url = "unspecified" 
> } 
> 
> /* Feel free to specify as many udp_send_channels as you
>  * like.  Gmond used to only support having a single channel */ 
> udp_send_channel { 
>   mcast_join = 239.2.100.71 
>   port = 8649 
> } 
> 
> /* You can specify as many udp_recv_channels as you
>  * like as well. */ 
> udp_recv_channel { 
>   mcast_join = 239.2.100.71 
>   port = 8649 
>   bind = 239.2.100.71 
> } 
> 
> lsora1001, lsora1002 and lsora1005 -- Test RAC, running just gmond.
> 
> cluster { 
>   name = "Oracle RAC Test nodes" 
>   owner = "myorg" 
>   latlong = "unspecified" 
>   url = "unspecified" 
> } 
> 
> /* Feel free to specify as many udp_send_channels as you
>  * like.  Gmond used to only support having a single channel */ 
> udp_send_channel { 
>   mcast_join = 239.2.101.71 
>   port = 8649 
> } 
> 
> /* You can specify as many udp_recv_channels as you
>  * like as well. */ 
> udp_recv_channel { 
>   mcast_join = 239.2.101.71 
>   port = 8649 
>   bind = 239.2.101.71 
> } 
> 
> lsora1004, lsora1007 and lsora1008 -- Prod RAC, running just gmond.
> 
> cluster { 
>   name = "Oracle RAC Prod nodes" 
>   owner = "myorg" 
>   latlong = "unspecified" 
>   url = "unspecified" 
> } 
> 
> /* Feel free to specify as many udp_send_channels as you
>  * like.  Gmond used to only support having a single channel */ 
> udp_send_channel { 
>   mcast_join = 239.2.102.71 
>   port = 8649 
> } 
> 
> /* You can specify as many udp_recv_channels as you
>  * like as well. */ 
> udp_recv_channel { 
>   mcast_join = 239.2.102.71 
>   port = 8649 
>   bind = 239.2.102.71 
> } 
> 
> My gmetad server has gmond running as cluster 'localhost', 
> and the following in the gmetad file:
> 
> data_source "localhost" localhost
> data_source "Oracle RAC Dev nodes" lsora1003 lsora1006 
> data_source "Oracle RAC Test nodes" lsora1001 lsora1002 
> lsora1005 data_source "Oracle RAC Prod nodes" lsora1004 
> lsora1007 lsora1008
> 
> Initially, none of the other hosts would show up in their 
> clusters, just the first node listed, but over time the other 
> nodes do appear, and are active briefly, and then show as 
> being down and unable to contact.
> 
> 
> -Mike
> -- 
> Michael Steeves ([EMAIL PROTECTED])
> 
> --------------------------------------------------------------
> -----------
> Take Surveys. Earn Cash. Influence the Future of IT
> Join SourceForge.net's Techsay panel and you'll get the 
> chance to share your opinions on IT & business topics through 
> brief surveys-and earn cash 
> http://www.techsay.com/default.php?page=join.php&p=sourceforge
&CID=DEVDEV
_______________________________________________
Ganglia-general mailing list Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general
------------------------------------------------------------------------
For more information about Barclays Capital, please visit our web site at 
http://www.barcap.com.

Internet communications are not secure and therefore the Barclays Group does 
not accept legal responsibility for the contents of this message.  Although the 
Barclays Group operates anti-virus programmes, it does not accept 
responsibility for any damage whatsoever that is caused by viruses being 
passed.  Any views or opinions presented are solely those of the author and do 
not necessarily represent those of the Barclays Group.  Replies to this email 
may be monitored by the Barclays Group for operational or business reasons.
------------------------------------------------------------------------

Reply via email to