Re: [Nagios-users] Virtual Machines - define as parent or as host dependency...
I think you should be fine with setting the VMWare host to be the parent of your guests. It's the simplest configuration, it meets your goals (host down suppresses guests, but host's services do not affect guests' notifications) and it has the added benefit of looking good in the Status Map :-) IMHO using parents for VMWare hosts/guests is within the intended use of the directive, even though the docs only talk about network equipment, because your guests are as directly dependent upon their host as a physical host is of its switch. On Mon, Jan 25, 2010 at 9:14 PM, Andrew Davis ncc...@gmail.com wrote: I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: *define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u }* Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: *define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 }* I've successfully used the parents directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Martin Melin op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Virtual Machines - define as parent or as host dependency...
This is the way we do it, with Parents (not host dependencies). First we create a virtual object for the VMWare farm. This has a status of UP if any of the farm servers are up (using check_summary). This virtual 'host' has several services, using the v0.9 check_vmware, relating to the farm's alarms, storage volumes, etc. These services have service dependencies on the VirtualCentre service running on the Virtual Centre host. The Farm object has ALL of the ESX Servers as Parents. All the VMs in the farm have the Farm object as a parent. Some of them also use check_esx3 to alert on Alarms, CPU, and Memory usage within VMWare. This might seem a bit complex if you've only the one server, but as soon as oyu have multiple servers in the farm, and use DRS, you have to use a farm object for parents/dependencies. It might make more sense for these relationships to be host dependencies rather than parents i nmost cases, but we have a SAN mirrored environment to a seocnd ESX farm so that the VMs can be brought up ther ein the event of a complete farm outage, hence the use of Parents rather than dependencies. If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch in the farm, you could probably make the virtual switch the parent device rather than having to use a farm object. The VMWare monitoring plugin we're using is v0.9 of check_vmware, from here: http://www.steveshipway.org/forum/viewtopic.php?f=28t=1648 check_summary is available from nagiosexchange.org (as is check_esx3 which is the forerunner of check_vmware) Steve From: Andrew Davis [ncc...@gmail.com] Sent: Tuesday, 26 January 2010 9:14 a.m. To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Virtual Machines - define as parent or as host dependency... I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u } Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 } I've successfully used the parents directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email: ncc...@gmail.commailto:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Overloaded master
I would stop doing the time-consuming ticket integration (or whatever is taking up the time) from within the notification command itself. I don't think you need to build a separate notification server - after all, notification logic is one of the things Nagios does best - but I do think you could build a quick program to do the heavy lifting for notifications, per Nagios' instructions. Without knowing what kind of info you need, I would probably have the Nagios notification command write to a spool directory, then send a signal to a separate program to have it read the spool directory and create/search for tickets etc. On Tue, Jan 26, 2010 at 2:02 AM, Mike Lindsey mike-nag...@5dninja.netwrote: A typical first tier notification goes to 20 people. One of those will be a pager, and is very simple. The rest are fairly complex. Notifications include a link to existing and recent tickets in our ticketing system (this also allows me to not send a ticket opening notification if a ticket already exists).. I populate the notification with links to cacti graphs, links to wiki documentation for the event as well as fire off a secondary notification handler that adds in additional information based on the host, service, and state. The first notification of the cycles does all the heavy lifting and takes about 6 seconds. The other 19 finish relatively quickly. I've been thinking of building a notification server - so I could have separate and discrete notification escalations for different service states - which would also let me fire off one notification with just the contents of $ENV{NAGIOS_*}.. Perhaps that's my best option? Martin Melin wrote: What kind of notifications are you doing and how many are you sending out? Why does a notification cycle take 9 seconds to complete? On Sat, Jan 23, 2010 at 12:13 AM, Mike Lindsey mike-nag...@5dninja.netmailto: mike-nag...@5dninja.net wrote: What kind of options does one have, if your master nagios server is getting overloaded? I have half a dozen slaves doing polling, submitting passive check results back via send_nsca. The master does no active polling, just event processing, notifications, and web ui. Under normal circumstances, it works alright. But after a restart it can take up to half an hour before the master catches up; and if there are a lot of events, the act of sending out notifications can cause it to fall behind. I'm pre-caching my object file, I'm skipping circular dependency checks, and I've gotten a notification cycle down to 9 seconds. I tried modifying nagios to fork before notifications, but that failed pretty spectacularly; so that 9 seconds is a time where 900 or so passive check submissions block until the notifications are done. Are there any options for running a dual-master setup, or other ways to spread the load across multiple machines? Has anyone patched nsca to submit check results into the checkresults directory, instead of via the nagios.cmd pipe? What kind of improvement can one expect from that? Any other advice? -- Mike Lindsey -- Martin Melin op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Defining service dependency
On Tue, Jan 26, 2010 at 10:33 AM, Asrai khn asrai...@gmail.com wrote: Later I'll switch my dependency configuration to something simple by using Time-saving Tricks --- http://nagios.sourceforge.net/docs/3_0/objecttricks.html#same_host_dependency Tell me one thing, is it ok to get ... Checking service dependencies... Checked 10 service dependencies. For the 5 dependent_service_description above? I have just configured service dependency using *All Services In Multiple Servicegroups *nagios docs. 1. First of all i have created (4) servicegroups for services which using nrpe for checking remote hosts including a servicegroup for service 'nrpe' on 8 different hosts The other 3 servicegroups contains total 95 services 2. Then defined service dependency using following config define servicedependency{ dependent_servicegroup_nameSol-CPU-Load,Sol-Partitions,Sol-Processes servicegroup_nameSol-nrpe execution_failure_criteriaw,u,c notification_failure_criteriaw,u,c } Everything is working just fine I have tested it by stopping nrpe on one of the server and nagios only send notification for master process 'nrpe' by suppressing all all other checks and notification for the host. However I am confusing with the nagios -v check its giving me too much counts for service dependency and i wonders if this have any effect on monitoring host (nagios) performance, working etc. *sudo nagios -v /etc/nagios/nagios.cfg Checking service dependencies... Checked 1530 service dependencies.* 1530 service dependencies. and this also include the previous 10 for another server, but I wonders what this counts mean? I'll greatly appreciate if someone shed some light on it. Thanks. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Using Unix groupnames in contact definitions or cgi authorization schemes
Nagios 3.x - I was wondering wether one can use UNIX groupnames in contact defintions , or in cgi auth. directives such as 'authorized_for_all_service_commands=... ; instead of having to use individual usernames ? I need to make a simple distinction between a couple of divisions, regarding nagios access and server monitoring in our company. Using individual usernames would be cumbersome w.r.t the simple allowed views I want to use in the NAGIOS (console). Tx, M. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] popen timeout received, but no child process
[1264500750] SERVICE ALERT: CCC151;PING;CRITICAL;SOFT;2;CRITICAL - popen timeout received, but no child process İ am getting this error. İ googled and read a few forum but there is no solutions for me. İ use FreeBSD 8.0 amd64 last pid: 24484; load averages: 0.05, 0.16, 0.21 up 0+04:37:59 12:21:44 281 processes: 1 running, 280 sleeping CPU: 14.9% user, 0.0% nice, 11.9% system, 0.0% interrupt, 73.2% idle Mem: 80M Active, 54M Inact, 111M Wired, 53M Buf, 739M Free Swap: 1500M Total, 1500M Free What can i do to solve this? Thanks.. I have a system like this. Checked 1231 hosts. Checking host groups... Checked 49 host groups. Checking service groups... Checked 0 service groups. Checking contacts... Checked 44 contacts. Checking contact groups... Checked 44 contact groups. Checking service escalations... Checked 0 service escalations. Checking service dependencies... Checked 0 service dependencies. Checking host escalations... Checked 0 host escalations. Checking host dependencies... Checked 0 host dependencies. Checking commands... Checked 24 commands. Checking time periods... Checked 2 time periods. Checking for circular paths between hosts... Checking for circular host and service dependencies... Checking global event handlers... Checking obsessive compulsive processor commands... Checking misc settings... Total Warnings: 288 Total Errors: 0 All of my hosy definitions are like that: #'CCC151' host definition define host{ use generic-host host_name CCC151 alias CCC151 address 172.20.55.55 parents CCC_15 #check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_optionsd,u,r contact_groups CCC1_admin, CCC15_admin,CCC151-admin } And allof my service definitions are like: define service{ use generic-service host_name CCC151,CCC152,CCC153. etc. service_description PING is_volatile 0 check_period24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval1 contact_groups CCCs notification_interval 120 notification_period 24x7 notification_optionsc,r check_command check-host-alive } -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Virtual Machines - define as parent or ashost dependency...
-Original Message- From: Steve Shipway [mailto:s.ship...@auckland.ac.nz] Sent: Tuesday, January 26, 2010 3:03 AM To: ncc...@gmail.com; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Virtual Machines - define as parent or ashost dependency... This is the way we do it, with Parents (not host dependencies). Hi Steve, been following this with great interest. Care to share how you do host dep's too?, this is all very helpful, thank you! :) First we create a virtual object for the VMWare farm. This has a status of UP if any of the farm servers are up (using check_summary). This virtual 'host' has several services, using the v0.9 check_vmware, relating to the farm's alarms, storage volumes, etc. These services have service dependencies on the VirtualCentre service running on the Virtual Centre host. The Farm object has ALL of the ESX Servers as Parents. So ultimately, the Farm object definition would be: the vcenter server (with esx hosts as parents?), combined with a service check on the vcenter service? Not familiar with check_summary, sorry, that's next up to google. ;) All the VMs in the farm have the Farm object as a parent. Some of them also use check_esx3 to alert on Alarms, CPU, and Memory usage within VMWare. This might seem a bit complex if you've only the one server, but as soon as oyu have multiple servers in the farm, and use DRS, you have to use a farm object for parents/dependencies. Yes, I feel that pain well. How can/do you keep track of what ESX hosts are parents to what vm's when DRS is in fully-automated mode, that seems to be the key... ? It might make more sense for these relationships to be host dependencies rather than parents i nmost cases, but we have a SAN mirrored environment to a seocnd ESX farm so that the VMs can be brought up ther ein the event of a complete farm outage, hence the use of Parents rather than dependencies. If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch in the farm, you could probably make the virtual switch the parent device rather than having to use a farm object. The VMWare monitoring plugin we're using is v0.9 of check_vmware, from here: http://www.steveshipway.org/forum/viewtopic.php?f=28t=1648 check_summary is available from nagiosexchange.org (as is check_esx3 which is the forerunner of check_vmware) Steve Excellent - off to have a look. We use a combination of things at the moment, including check_esx3 and also use python/WBEM snmp on HP hardware. Thanks much! James -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_ifoperstatus Query
I maybe expecting something that is not designed, but I was hoping to exclude checking of my loopback interface #snmpwalk -v1 -c public localhost ifType IF-MIB::ifType.1 = INTEGER: softwareLoopback(24) IF-MIB::ifType.2 = INTEGER: ethernetCsmacd(6) IF-MIB::ifType.3 = INTEGER: ethernetCsmacd(6) # ./check_ifoperstatus -H localhost -T 6 OK: Interface eth0 (index 2) is up. Should I not be seeing two interfaces in the check_ifoperstatus output ? I have check_ifoperstatus v1.4.14 (nagios-plugins 1.4.14) on AS4 U6. Regards Nigel Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas ___ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using Unix groupnames in contact definitions or cgi authorization schemes
On Jan 26, 2010, at 4:18 AM, Mark Elsen wrote: Nagios 3.x - I was wondering wether one can use UNIX groupnames in contact defintions , or in cgi auth. directives such as 'authorized_for_all_service_commands=... ; instead of having to use individual usernames ? There's no Apache auth module that works like that, that I know of. All nagios does is look at the web server environment variable 'REMOTE_USER' to determine the username of the logged in user. If you can find (or write) an Apache auth module that sets REMOTE_USER to what you need then it will work. The difficult part is that you want to change REMOTE_USER from the username passed during login to something else after authentication. I'm not sure that Apache permits/supports that. I need to make a simple distinction between a couple of divisions, regarding nagios access and server monitoring in our company. Using individual usernames would be cumbersome w.r.t the simple allowed views I want to use in the NAGIOS (console). You could create contacts and associated .htpasswd entries that are role accounts and provide the divisions with the single username/password that is theirs. That's worked well for our different NOCs. -- Marc -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Virtual Machines - define as parent or as host dependency...
Thanks much for the info. Curious... you mention ESX and VSphere. Do you know if your check_vmware test also works against VMWare Server 2.x (free edition). We are running ESX internally, but have a single external server running VMWare Server 2.x with three VMs in it. Hence why I ask... A. Davis Email: ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan On 1/26/10 3:03 AM, Steve Shipway wrote: This is the way we do it, with Parents (not host dependencies). First we create a virtual object for the VMWare farm. This has a status of UP if any of the farm servers are up (using check_summary). This virtual 'host' has several services, using the v0.9 check_vmware, relating to the farm's alarms, storage volumes, etc. These services have service dependencies on the VirtualCentre service running on the Virtual Centre host. The Farm object has ALL of the ESX Servers as Parents. All the VMs in the farm have the Farm object as a parent. Some of them also use check_esx3 to alert on Alarms, CPU, and Memory usage within VMWare. This might seem a bit complex if you've only the one server, but as soon as oyu have multiple servers in the farm, and use DRS, you have to use a farm object for parents/dependencies. It might make more sense for these relationships to be host dependencies rather than parents i nmost cases, but we have a SAN mirrored environment to a seocnd ESX farm so that the VMs can be brought up ther ein the event of a complete farm outage, hence the use of Parents rather than dependencies. If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch in the farm, you could probably make the virtual switch the parent device rather than having to use a farm object. The VMWare monitoring plugin we're using is v0.9 of check_vmware, from here: http://www.steveshipway.org/forum/viewtopic.php?f=28t=1648 http://www.steveshipway.org/forum/viewtopic.php?f=28t=1648 check_summary is available from nagiosexchange.org (as is check_esx3 which is the forerunner of check_vmware) Steve *From:* Andrew Davis [ncc...@gmail.com] *Sent:* Tuesday, 26 January 2010 9:14 a.m. *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Virtual Machines - define as parent or as host dependency... I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: *define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u } * Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: *define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 }* I've successfully used the parents directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email:ncc...@gmail.com There is no limit to what a man can accomplish if he doesn't care who gets the credit. - Ronald Reagan -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using Unix groupnames in contact definitions or cgi authorization schemes
Mark Elsen wrote: - I was wondering wether one can use UNIX groupnames in contact defintions , or in cgi auth. directives such as 'authorized_for_all_service_commands=... ; instead of having to use individual usernames ? I need to make a simple distinction between a couple of divisions, regarding nagios access and server monitoring in our company. Using individual usernames would be cumbersome w.r.t the simple allowed views I want to use in the NAGIOS (console). The only way to pull this off, really, is to define contactgroups in Nagios that match your Unix groups. Since Nagios does not deal with Unix users and groups directly, you'd need something like a script that generated Nagios configs from your account information. In our case here, that's relatively trivial. A simple script can go out and pull a group and its members from LDAP and reformat that information into a Nagios config. I suppose it wouldn't be much harder to do it with local accounts, if we had a script to parse the group information from the passwd and group files. So I guess the short answer is no, Nagios doesn't know how to deal with Unix account info directly, but that doesn't mean you can't do what you're trying to do. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Unexplainable disk space usage in nagios/var/spool/checkresults
Hi! I have Nagios installed on a 1GB partition, with everything installed under a nagios directory on that partition. I did not expect Nagios to start consuming a large amount of space, and now that it is, I cannot determine why or in what files, which is baffling. I've found the biggest culprit is the var/spool/checkresults directory within the Nagios installation. a `du` command on the directory shows that it is using 287MB. However, when I `ls -la` within that directory, I can't see any file using more than a single kilobyte, and there aren't that many files. However, an `ls -ld` on the directory itself shows that the directory itself is using 286MB. That doesn't make sense to me, and so I'm looking for input! I wonder if the following approach would help correct things (but I'd like to understand why they're in this state first): Stop nagios, move the checkresults directory to checkresults.old, make a new checkresults directory, copy all the old files into it, remove the empty .old directory, start nagios. The filesystem is JFS, if that makes any difference. Here is raw output of what I'm seeing: r...@indhlcwww1:/mnt/data/share/nagios-3.2.0/var/spool/checkresults # du -h 287M. r...@indhlcwww1:/mnt/data/share/nagios-3.2.0/var/spool/checkresults # ls -laSrh total 287M -rw--- 1 nagios nagios0 2010-01-26 15:23 cZar4eF.ok -rw--- 1 nagios nagios0 2010-01-04 19:15 cy8clT1.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cXaj1gN.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cvWmN4R.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cVvx4M8.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cvtm0Hd.ok -rw--- 1 nagios nagios0 2009-12-06 01:27 cvqSGDK -rw--- 1 nagios nagios0 2010-01-26 15:23 cv6pJ1B.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 ctwW4SD.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cTpIKAl.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cTm3Vzw.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 ctCCihr.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 csRsM4M.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cryC1Dq.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cRR4Ubm.ok -rw--- 1 nagios nagios0 2009-11-27 23:24 cqqacdj -rw--- 1 nagios nagios0 2010-01-26 15:23 cq75lkN.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cnxxq4z.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cnDr6c0.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cmxnujo.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cMPxHgo.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cMf26lf.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cLksJml.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 ckVnJcE.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cjjgyfh.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cjCc8ov.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 chokVz4.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cHN3b3M.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cHlsWuh.ok -rw--- 1 nagios nagios0 2009-12-11 19:19 checkyTpLfU -rw--- 1 nagios nagios0 2009-11-05 04:12 checkXbP1al -rw--- 1 nagios nagios0 2009-11-04 12:53 checkWx0m6z -rw--- 1 nagios nagios0 2009-11-05 04:12 checkwpqhVE -rw--- 1 nagios nagios0 2009-12-29 17:23 checkvMraRW -rw--- 1 nagios nagios0 2009-12-11 19:20 checkqWrl04 -rw--- 1 nagios nagios0 2009-11-04 12:53 checkQq7MX0 -rw--- 1 nagios nagios0 2009-12-29 17:23 checknzRtjN -rw--- 1 nagios nagios0 2009-11-05 04:12 checkM6htUZ -rw--- 1 nagios nagios0 2009-11-27 23:24 checkhVbOzY -rw--- 1 nagios nagios0 2009-11-17 12:43 checkEVDugp -rw--- 1 nagios nagios0 2009-11-04 12:53 checkd4A1MS -rw--- 1 nagios nagios0 2009-11-05 04:12 checkBXX71Z -rw--- 1 nagios nagios0 2009-11-27 23:24 checkBLCU9i -rw--- 1 nagios nagios0 2009-11-04 12:53 checkAaA6Qr -rw--- 1 nagios nagios0 2009-12-06 01:27 check8avYJJ -rw--- 1 nagios nagios0 2009-12-29 17:23 cGZFyHr -rw--- 1 nagios nagios0 2010-01-26 15:23 cgBr4Y1.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cFxFlm3.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cD2T5eV.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 caw2Ihv.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c9iVtCP.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c7QYuXc.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c75znoT.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c6NV5yW.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c59UhZL.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c1m1KIr.ok drwxr-xr-x 3 root root 8 2009-10-20 01:44 ../ -rw--- 1 nagios nagios 249 2009-11-17 15:03 check47CLdy -rw--- 1 nagios nagios 252 2009-11-17 15:03 checkAE7iMM -rw--- 1 nagios nagios 277 2009-11-17 15:03 checkaNpheC -rw--- 1 nagios nagios 278 2010-01-26 15:23 checkWDzCTc -rw--- 1 nagios nagios
Re: [Nagios-users] Command execution failed
On 1/26/2010 12:01 AM, Marc Powell wrote: Can you run check_bacula.pl as the nagios user on the remote machine? If not, read any documentation you received with the plugin to see what other changes you need to make. I'm guessing that other programs it calls are not executable by the nagios user. I can run the command but get this message after: Can't locate DBI.pm in @INC (@INC contains: /etc/perl /usr/local/lib/perl/5.10.0 /usr/local/share/perl/5.10.0 /usr/lib/perl5 /usr/share/perl5 /usr/lib/perl/5.10 /usr/share/perl/5.10 /usr/local/lib/site_perl .) at /usr/local/nagios/libexec/check_bacula.pl line 39. BEGIN failed--compilation aborted at /usr/local/nagios/libexec/check_bacula.pl line 39. Maybe I need to install some dependencies but anyway I think that previous message error (Remote command execution failed: Permission denied, please try again) has nothing to do. -- Cheers ReynierPM -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unexplainable disk space usage in nagios/var/spool/checkresults
2010/1/26 Wolfe, Robert robert.wo...@robertwolfe.org When you do an ‘ls –lash’ on this directory, what do you see? I already included that in my last post. The only difference was that I added -Sr to do a reverse sort by size (largest items at the bottom), and -F is included in my alias for ls, so '/' is shown at the end of directory names. However, this actually seems to be an issue with JFS and high turnover within a directory. http://www.mail-archive.com/jfs-discuss...@lists.sourceforge.net/msg00748.HTML I've forwarded this along to the JFS list already. I don't suppose there's any way to make Nagios not use so many constantly changing files in this directory is there? # ls -laSrh total 287M -rw--- 1 nagios nagios0 2010-01-26 15:23 cZar4eF.ok -rw--- 1 nagios nagios0 2010-01-04 19:15 cy8clT1.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cXaj1gN.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cvWmN4R.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cVvx4M8.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cvtm0Hd.ok -rw--- 1 nagios nagios0 2009-12-06 01:27 cvqSGDK -rw--- 1 nagios nagios0 2010-01-26 15:23 cv6pJ1B.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 ctwW4SD.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cTpIKAl.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cTm3Vzw.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 ctCCihr.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 csRsM4M.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cryC1Dq.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cRR4Ubm.ok -rw--- 1 nagios nagios0 2009-11-27 23:24 cqqacdj -rw--- 1 nagios nagios0 2010-01-26 15:23 cq75lkN.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cnxxq4z.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cnDr6c0.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cmxnujo.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cMPxHgo.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cMf26lf.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cLksJml.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 ckVnJcE.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cjjgyfh.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cjCc8ov.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 chokVz4.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cHN3b3M.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cHlsWuh.ok -rw--- 1 nagios nagios0 2009-12-11 19:19 checkyTpLfU -rw--- 1 nagios nagios0 2009-11-05 04:12 checkXbP1al -rw--- 1 nagios nagios0 2009-11-04 12:53 checkWx0m6z -rw--- 1 nagios nagios0 2009-11-05 04:12 checkwpqhVE -rw--- 1 nagios nagios0 2009-12-29 17:23 checkvMraRW -rw--- 1 nagios nagios0 2009-12-11 19:20 checkqWrl04 -rw--- 1 nagios nagios0 2009-11-04 12:53 checkQq7MX0 -rw--- 1 nagios nagios0 2009-12-29 17:23 checknzRtjN -rw--- 1 nagios nagios0 2009-11-05 04:12 checkM6htUZ -rw--- 1 nagios nagios0 2009-11-27 23:24 checkhVbOzY -rw--- 1 nagios nagios0 2009-11-17 12:43 checkEVDugp -rw--- 1 nagios nagios0 2009-11-04 12:53 checkd4A1MS -rw--- 1 nagios nagios0 2009-11-05 04:12 checkBXX71Z -rw--- 1 nagios nagios0 2009-11-27 23:24 checkBLCU9i -rw--- 1 nagios nagios0 2009-11-04 12:53 checkAaA6Qr -rw--- 1 nagios nagios0 2009-12-06 01:27 check8avYJJ -rw--- 1 nagios nagios0 2009-12-29 17:23 cGZFyHr -rw--- 1 nagios nagios0 2010-01-26 15:23 cgBr4Y1.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cFxFlm3.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 cD2T5eV.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 caw2Ihv.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c9iVtCP.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c7QYuXc.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c75znoT.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c6NV5yW.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c59UhZL.ok -rw--- 1 nagios nagios0 2010-01-26 15:23 c1m1KIr.ok drwxr-xr-x 3 root root 8 2009-10-20 01:44 ../ -rw--- 1 nagios nagios 249 2009-11-17 15:03 check47CLdy -rw--- 1 nagios nagios 252 2009-11-17 15:03 checkAE7iMM -rw--- 1 nagios nagios 277 2009-11-17 15:03 checkaNpheC -rw--- 1 nagios nagios 278 2010-01-26 15:23 checkWDzCTc -rw--- 1 nagios nagios 279 2010-01-26 15:23 checkpLyZS3 -rw--- 1 nagios nagios 280 2010-01-26 15:23 checkVSgE8U -rw--- 1 nagios nagios 281 2010-01-26 15:23 check6txgYX -rw--- 1 nagios nagios 285 2009-11-17 15:03 checkQXTDa5 -rw--- 1 nagios nagios 394 2010-01-26 15:23 cD2T5eV -rw--- 1 nagios nagios 395 2010-01-26 15:23 cgBr4Y1 -rw--- 1 nagios nagios 397 2010-01-26 15:23 cZar4eF -rw--- 1 nagios nagios 397 2010-01-26 15:23 ctwW4SD -rw--- 1 nagios nagios 401 2010-01-26 15:23 ctCCihr -rw--- 1 nagios
Re: [Nagios-users] Unexplainable disk space usage in nagios/var/spool/checkresults
On Jan 26, 2010, at 9:44 AM, Casey Allen Shobe wrote: Hi! I have Nagios installed on a 1GB partition, with everything installed under a nagios directory on that partition. I did not expect Nagios to start consuming a large amount of space, and now that it is, I cannot determine why or in what files, which is baffling. I've found the biggest culprit is the var/spool/checkresults directory within the Nagios installation. a `du` command on the directory shows that it is using 287MB. However, when I `ls -la` within that directory, I can't see any file using more than a single kilobyte, and there aren't that many files. However, an `ls -ld` on the directory itself shows that the directory itself is using 286MB. That doesn't make sense to me, and so I'm looking for input! My gut instinct is that this is jfs doing something unexpected, in which case I can't offer much advice. The first thing I'd verify though is that nagios isn't actually still holding a bunch of files open in that directory, even after they've been removed. The disk space would still be seen as used even though you can't see the files any longer. Use 'lsof | grep checkresults' and see if you have any output for files that can't be seen in the directory any longer. I expect that you would see thousands or more if that is the case. I wonder if the following approach would help correct things (but I'd like to understand why they're in this state first): Stop nagios, move the checkresults directory to checkresults.old, make a new checkresults directory, copy all the old files into it, remove the empty .old directory, start nagios. That would certainly work in either case... -- marc -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] CGI authentication problems
I originally set up my nagios environment with an administrative user named nagiosadmin and basic authentication. At that time everything worked fine. I tried to set up Active Directory authentication, but that caused numerous problems, including the fact that my Active Directory users couldn't do things like re-schedule service checks. At the time, I thought this was an AD issue. I have since decided to abandon AD authentication altogether, although I'm now using Digest authentication (per the official documentation) instead of Basic. Now, however, I'm still finding that I can't re-schedule service checks (or schedule downtime) as any user other than nagiosadmin. I have created passwords for both users in my .digest_pw file. My cgi.cfg file shows: physical_html_path=/usr/share/nagios3/htdocs url_html_path=/nagios3 use_authentication=1 default_user_name= authorized_for_system_information=michellea,nagiosadmin authorized_for_system_commands=michellea,nagiosadmin authorized_for_configuration_information=michellea,nagiosadmin authorized_for_all_hosts=michellea,nagiosadmin authorized_for_all_host_commands=michellea,nagiosadmin authorized_for_all_services=michellea,nagiosadmin authorized_for_all_service_commands=michellea,nagiosadmin default_statusmap_layout=5 default_statuswrl_layout=4 refresh_rate=90 ping_syntax=/bin/ping -n -U -c 5 $HOSTADDRESS$ main_config_file=/etc/nagios3/nagios.cfg The user michellea is also set as a contact for all of the hosts. When I log in to the web interface, it shows Logged in as *michellea*. I can see all information for all hosts and services, but I still can't issue commands. What am I missing? Michelle -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] CGI authentication problems
I do use Likewise-Open for authentication to the system, but I was trying to make the Apache authentication completely independent. For some reason, until you asked me that, it didn't even occur that there could be a conflict between the two. I just tried adding a completely different user (michellea is the active directory user name), and that works perfectly fine. Apparently the two of them were conflicting. Thank you! On Tue, Jan 26, 2010 at 2:05 PM, Matt Simmons standalone.sysad...@gmail.com wrote: You aren't using Likewise-Open for your AD authentication, are you? --Matt On Tue, Jan 26, 2010 at 1:54 PM, Michelle Avery michelleave...@gmail.com wrote: I originally set up my nagios environment with an administrative user named nagiosadmin and basic authentication. At that time everything worked fine. I tried to set up Active Directory authentication, but that caused numerous problems, including the fact that my Active Directory users couldn't do things like re-schedule service checks. At the time, I thought this was an AD issue. I have since decided to abandon AD authentication altogether, although I'm now using Digest authentication (per the official documentation) instead of Basic. Now, however, I'm still finding that I can't re-schedule service checks (or schedule downtime) as any user other than nagiosadmin. I have created passwords for both users in my .digest_pw file. My cgi.cfg file shows: physical_html_path=/usr/share/nagios3/htdocs url_html_path=/nagios3 use_authentication=1 default_user_name= authorized_for_system_information=michellea,nagiosadmin authorized_for_system_commands=michellea,nagiosadmin authorized_for_configuration_information=michellea,nagiosadmin authorized_for_all_hosts=michellea,nagiosadmin authorized_for_all_host_commands=michellea,nagiosadmin authorized_for_all_services=michellea,nagiosadmin authorized_for_all_service_commands=michellea,nagiosadmin default_statusmap_layout=5 default_statuswrl_layout=4 refresh_rate=90 ping_syntax=/bin/ping -n -U -c 5 $HOSTADDRESS$ main_config_file=/etc/nagios3/nagios.cfg The user michellea is also set as a contact for all of the hosts. When I log in to the web interface, it shows Logged in as michellea. I can see all information for all hosts and services, but I still can't issue commands. What am I missing? Michelle -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] CGI authentication problems
On Jan 26, 2010, at 12:54 PM, Michelle Avery wrote: The user michellea is also set as a contact for all of the hosts. When I log in to the web interface, it shows Logged in as michellea. I can see all information for all hosts and services, but I still can't issue commands. What am I missing? What's the error that you see? Did you enable external commands in nagios.cfg and specify a valid path for the command pipe? Does it exist and is it writable by the user your web server runs as? -- Marc -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Defining service dependency
Anyone please? I wonders why 1500+ dependencies? Thanks in advance. On Tue, Jan 26, 2010 at 1:36 PM, Asrai khn asrai...@gmail.com wrote: On Tue, Jan 26, 2010 at 10:33 AM, Asrai khn asrai...@gmail.com wrote: Later I'll switch my dependency configuration to something simple by using Time-saving Tricks --- http://nagios.sourceforge.net/docs/3_0/objecttricks.html#same_host_dependency Tell me one thing, is it ok to get ... Checking service dependencies... Checked 10 service dependencies. For the 5 dependent_service_description above? I have just configured service dependency using *All Services In Multiple Servicegroups *nagios docs. 1. First of all i have created (4) servicegroups for services which using nrpe for checking remote hosts including a servicegroup for service 'nrpe' on 8 different hosts The other 3 servicegroups contains total 95 services 2. Then defined service dependency using following config define servicedependency{ dependent_servicegroup_name Sol-CPU-Load,Sol-Partitions,Sol-Processes servicegroup_nameSol-nrpe execution_failure_criteriaw,u,c notification_failure_criteriaw,u,c } Everything is working just fine I have tested it by stopping nrpe on one of the server and nagios only send notification for master process 'nrpe' by suppressing all all other checks and notification for the host. However I am confusing with the nagios -v check its giving me too much counts for service dependency and i wonders if this have any effect on monitoring host (nagios) performance, working etc. *sudo nagios -v /etc/nagios/nagios.cfg Checking service dependencies... Checked 1530 service dependencies.* 1530 service dependencies. and this also include the previous 10 for another server, but I wonders what this counts mean? I'll greatly appreciate if someone shed some light on it. Thanks. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unexplainable disk space usage in nagios/var/spool/checkresults
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 26/01/10 12:29 PM, Casey Allen Shobe wrote: 2010/1/26 Wolfe, Robert robert.wo...@robertwolfe.org mailto:robert.wo...@robertwolfe.org When you do an ‘ls –lash’ on this directory, what do you see? I already included that in my last post. The only difference was that I added -Sr to do a reverse sort by size (largest items at the bottom), and -F is included in my alias for ls, so '/' is shown at the end of directory names. However, this actually seems to be an issue with JFS and high turnover within a directory. http://www.mail-archive.com/jfs-discuss...@lists.sourceforge.net/msg00748.HTML I would highly recommend using tmpfs for the checkresults directory. This is especially important on logging file systems like JFS, XFS, Ext3+ and ReiserFS since every meta-data operation will cause a log write+flush (at the very least) which will cause a constant stream of unneeded IO. 1. Add this line to fstab: checkresults /usr/local/nagios/var/checkresults tmpfs uid=1001,gid=103,mode=0755 0 0 (in one line, nevermind auto line-warp... make sure the uid/gid are that of the nagios user). 2. Make sure the dir exists (it should if nagios uses it!) and that nagios is stopped. Delete any stale file and/or re-create the directory if you need to free space used by it (i.e. Jfs bug), then mount it (it should also get mounted at boot time before Nagios starts since it's in the fstab): # mount -a 3. Start nagios - it should now write all files in a memory fs (tmpfs). Note: you can also add the size=nbytes mount option... it limits the amount of RAM that can be used by the tmpfs, however unless you hit a nasty bug it shouldn't be a problem (plus you can monitor it with check_disk...) Note 2: I also use one for temp/status.dat files... - -- Thomas -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLX9/w6dZ+Kt5BchYRAkgXAKDrkRWtY2O1sRg2eD8h1FbkWlvMNgCgmtq2 fFZKTDXXrNaPgVf6CLZlD7g= =WfLm -END PGP SIGNATURE- -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Unexplainable disk space usage in nagios/var/spool/checkresults
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 27/01/10 01:40 AM, Thomas Guyot-Sionnest wrote: This is especially important on logging file systems like JFS, XFS, I meant journalled file systems. I need sleep... :) - -- Thomas -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLX+I86dZ+Kt5BchYRAj9MAJ4xeVasEGU0+IIaInK+eDHGIwE/xACgvPS0 DE9kkm4+IBWe9GSNpb1g9/M= =BXu8 -END PGP SIGNATURE- -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Need to monitor Nagios Service Automatically
Hi everyone, I'm using Nagios to monitor some servers. All the things are good. But these days I'm thinking about one question - What if Nagios went down?? If my Nagios went down, I would never get any notification. That would be really bad. *So is there a good solution to monitor the Nagios service??* Advise will be appreciated. Thanks and regards, Sunny, -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null