[Ganglia-general] Ganglia XML error: required
Hi, In the ganglia web interface I do often see the XML errors like , There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 7776There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 14268 There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 9872how to track this error ? -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Cluster setup
Hi I am quite new to ganglia, so I am having trouble setting it up like I want it. We are running a hpc-cluster and I want ganglia to monirot three sub-clusters. Admin Workstation Computational I have found some hints on how to do this, but nothing is working. I am able to monitor a few machines individually, but not to collect them into clusters. No I am doing Gmetad.conf Data_source machine1 machine1 Data_source machine2 machine2 Gmond.conf Cluster { Name = machine1 .. } Udp_send_channel { Host = machine1 Port = 8649 } If I change host in send_channel to something else, and I do telnet to look at the xml-file then no metrics are sent. Anybody have any tips? Regards Peter -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Ganglia XML error: required
On Aug 29, 2011, at 4:23 AM, S 2011 wrote: Hi, In the ganglia web interface I do often see the XML errors like , There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 7776 There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 14268 There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 9872 how to track this error ? telnet localhost 8658 ganglia-output.xml xmllint ganglia-output.xml That, plus some visual inspection of the XML, should give you an idea what's wrong. alex -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Cluster setup
Have you configured any udp_recv_channel sections? http://linux.die.net/man/5/gmond.conf http://sourceforge.net/apps/trac/ganglia/wiki/Gmond%203.1.x%20General%20Configuration On Aug 29, 2011, at 8:23 AM, Peter Ellevseth wrote: Hi I am quite new to ganglia, so I am having trouble setting it up like I want it. We are running a hpc-cluster and I want ganglia to monirot three sub-clusters. Admin Workstation Computational I have found some hints on how to do this, but nothing is working. I am able to monitor a few machines individually, but not to collect them into clusters. No I am doing Gmetad.conf Data_source “machine1” machine1 Data_source “machine2” machine2 Gmond.conf Cluster { Name = machine1 .. } Udp_send_channel { Host = machine1 Port = 8649 } If I change host in send_channel to something else, and I do telnet to look at the xml-file then no metrics are sent. Anybody have any tips? Regards Peter -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] vlan traffic counting
I've noticed on our VLAN interfaces that gmond's default network metric seem to be miscalculating network traffic. anyone else seeing this? we don't use many VLANs. seems the Linux OS counters for the VLAN interface also add onto on the parent interface, and gmond reads both so traffic reported by gmond is ~2x greater than it really is. eg. eth4 (no IP set) with a eth4.99 VLAN, /proc/net/dev shows Inter-| Receive| Transmit face |bytespackets errs drop fifo frame compressed multicast|bytes packets errs drop fifo colls carrier compressed ... eth4:1453106293850 1688242724000 0 0 11874 3090232715347 2518006182000 0 0 0 eth4.99:1429470895714 1688242724000 0 0 11874 2988353281655 912706240000 0 0 0 maybe we have setup our interfaces oddly or something. I don't know why Tx Pkts is different between the 2 interfaces ... maybe an upstream MTU. aliased interfaces don't have the same problem as Linux doesn't list them in /proc/net/dev. our setup is ganglia 3.2.0, x86_64, centos5.6 userland, 2.6.32 vanilla kernels, ixgbe 10gige. the below patch fixes/hacks-around the problem by simply skipping all VLAN interfaces - anything with a '.' in the name. doesn't seem right somehow, but seems to work for me. cheers, robin -- Dr Robin Humble, HPC Systems Analyst, NCI National Facility --- ganglia-3.2.0.orig/libmetrics/linux/metrics.c 2010-05-11 00:39:54.0 +1000 +++ ganglia-3.2.0/libmetrics/linux/metrics.c2011-08-29 16:19:55.0 +1000 @@ -181,8 +181,10 @@ void update_ifdata ( char *caller ) p = index(p, ':'); /* Ignore 'lo' and 'bond*' interfaces (but sanely) */ + /* Ignore VLAN interfaces (eg. eth4.99) as stats are already included in parent */ if (p strncmp (src, lo, 2) - strncmp (src, bond, 4)) + strncmp (src, bond, 4) + (index(src,'.') == NULL || index(src, '.') p)) { p++; /* Check for data from the last read for this */ -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Problem displaying Virtual Machine data with hsflowd and ganglia 3.2.0 in an Openstack Compute node.
Hi, I downloaded and installe hsflowd trunk revision 226 but using hsflowd I keep seeing virStorageLookupByPath errors, and VM disk statistics aren't displayed. Do I need to tell hsflowd explicitly to use target=vda call? If yes, how? Thanks in advance, Emanuele 2011/8/25 Emanuele Verga verga.emanu...@gmail.com Hi Neil, Yes that's possible, the problem is Nova places each image in a separate folder (/var/lib/nova/instance/INSTANCENAME/), so we would have to create a new pool with the corresponding path each time a new instance is created, and if we start to add more servers it quicly becomes impractical. I've not yet been able to try the hsflowd version you suggested, I'll test it tomorrow and let you know. Thanks for the help! Emanuele -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Problem displaying Virtual Machine data with hsflowd and ganglia 3.2.0 in an Openstack Compute node.
Sorry, the failure of virStorageLookupByPath() was preventing virDomainBlockStats() from being attempted. I checked in a fix for this, and also code to try the newer virDomainGetBlockInfo() call as a fallback should virStorageLookupByPath() fail. This call only came in with libvirt version 0.8.1. Are you running something newer than that? (see /usr/include/libvirt/libvirt.h) If this works, we should make a new release of hsflowd, so please let me know how it goes. Regards, Neil On Aug 29, 2011, at 7:24 AM, Emanuele Verga wrote: Hi, I downloaded and installe hsflowd trunk revision 226 but using hsflowd I keep seeing virStorageLookupByPath errors, and VM disk statistics aren't displayed. Do I need to tell hsflowd explicitly to use target=vda call? If yes, how? Thanks in advance, Emanuele 2011/8/25 Emanuele Verga verga.emanu...@gmail.com Hi Neil, Yes that's possible, the problem is Nova places each image in a separate folder (/var/lib/nova/instance/INSTANCENAME/), so we would have to create a new pool with the corresponding path each time a new instance is created, and if we start to add more servers it quicly becomes impractical. I've not yet been able to try the hsflowd version you suggested, I'll test it tomorrow and let you know. Thanks for the help! Emanuele -- Special Offer -- Download ArcSight Logger for FREE! Finally, a world-class log management solution at an even better price-free! And you'll get a free Love Thy Logs t-shirt when you download Logger. Secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsisghtdev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Ganglia Web 2.1.6 - Host Display Errors
I updated to the 2.1.6 web frontend, but I am still experiencing the same reported issues (problems showing Host Overview for more than a second) from the 2.1.5 release. I noticed in a previous thread that a new host_view.tpl was created for this issue, but in the 2.1.6 release the date on the file host_view.tpl is 2011-08-19. I was wondering what steps would be needed to correct this issue? Thanks, Brandon Jones UNIX/SAN Administrator Sanford Health Fargo, ND brandon.jo...@sanfordhealth.org --- Confidentiality Notice: This e-mail message, including any attachments, is for the sole use of the intended recipient(s) and may contain privileged and confidential information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply e-mail and destroy all copies of the original message. -- Special Offer -- Download ArcSight Logger for FREE! Finally, a world-class log management solution at an even better price-free! And you'll get a free Love Thy Logs t-shirt when you download Logger. Secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsisghtdev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
Re: [Ganglia-general] Ganglia XML error: required
Hi Alex, The telnet and xmllint gives the success result. I am not sure it could be because of xml_parse logic done on ganglia.php (web interface files) It reads entire xml in chunks of 16384 bytes and parses it till the end of file. In one of the post I read about the same issue and suggested to read entire xml into a variable and parse all at once. Thanks , Sunil On Mon, Aug 29, 2011 at 6:52 PM, Alex Dean a...@crackpot.org wrote: On Aug 29, 2011, at 4:23 AM, S 2011 wrote: Hi, In the ganglia web interface I do often see the XML errors like , There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 7776 There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 14268 There was an error collecting ganglia data (127.0.0.1:8656): XML error: required at 9872 how to track this error ? telnet localhost 8658 ganglia-output.xml xmllint ganglia-output.xml That, plus some visual inspection of the XML, should give you an idea what's wrong. alex -- EMC VNX: the world's simplest storage, starting under $10K The only unified storage solution that offers unified management Up to 160% more powerful than alternatives and 25% more efficient. Guaranteed. http://p.sf.net/sfu/emc-vnx-dev2dev ___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general -- Special Offer -- Download ArcSight Logger for FREE! Finally, a world-class log management solution at an even better price-free! And you'll get a free Love Thy Logs t-shirt when you download Logger. Secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsisghtdev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general
[Ganglia-general] Set the default value for metric (RRD datasource value)
Hi, RRD expects the value for a metric(data source) on fixed interval. If the metric value cannot be generated on time, by default RRD stores it as 'Nan'. Is there any way to handle this case where in I want to initialise the value to 0 in case of metric doesn't get value in the required time slot. Thanks, Sunil -- Special Offer -- Download ArcSight Logger for FREE! Finally, a world-class log management solution at an even better price-free! And you'll get a free Love Thy Logs t-shirt when you download Logger. Secure your free ArcSight Logger TODAY! http://p.sf.net/sfu/arcsisghtdev2dev___ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general