[Nagios-users] hosts/services exposed to contact through groups
Hello, I have a contact (nagios_user1) listed in a host (host1) and service (service1) definition. As expected, that contact can only view that host and service in the Nagios web interface (nagios_user1 can view host1 and service1); nagios_user1 cannot view other hosts/services - again, this is expected. What's not expected is nagios_user1 can view other hosts/services when viewing the hosts groups/services groups pages for which nagios_user1 is not listed as a contact for. Selecting these hosts/services displays an unauthorized page but I'm looking to not expose those other hosts/services through the host groups/services groups pages. Thoughts, suggestions? -- Jacob -- Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_sfd2d_oct___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] hosts change from soft to hard state after first attempt
I am looking for some help tracking down what I think is potentially a timing bug or something of the sort. I show entries in my logs which look like this: [1327419369] HOST ALERT: testhost;DOWN;SOFT;1;CRITICAL - 10.11.12.13: rta nan, lost 100% [1327419369] HOST ALERT: testhost;DOWN;HARD;1;CRITICAL - 10.11.12.13: rta nan, lost 100% The host entry is configured as active on-demand (not scheduled) with a check attempts of 5. I see the above quite frequently where within the same second, it changes state to DOWN/SOFT 1/5 and then to DOWN/HARD 1/5. After performing some debugging, it appears that there are two service checks failing back-to-back and both scheduling a on-demand host check within fractions of a second. Looking through the codebase, it looks like there are provisions in place to abort a check if there is one already executing, but based on the output of the debug, there are two on-demand host checks being executed simultaneously. Any help would be appreciated. System is running Nagios 3.3.1 on RedHat ES5 32-bit. [1327419359.134875] [016.0] [pid=12950] Attempting to run scheduled check of service 'check_ssh' on host 'testhost': check options=0, latency=0.134000 [1327419359.134899] [016.0] [pid=12950] Checking service 'check_ssh' on host 'testhost'... [1327419359.222060] [016.0] [pid=12950] Attempting to run scheduled check of service 'check_icmp_latency' on host 'testhost': check options=0, latency=0.222000 [1327419359.222085] [016.0] [pid=12950] Checking service 'check_icmp_latency' on host 'testhost'... [1327419365.212247] [016.1] [pid=12950] Handling check result for service 'check_ssh' on host 'testhost'... [1327419365.212254] [016.0] [pid=12950] ** Handling check result for service 'check_ssh' on host 'testhost'... [1327419365.212260] [016.1] [pid=12950] HOST: testhost, SERVICE: check_ssh, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: CRITICAL - Timed out connecting to remote host [1327419365.212290] [016.0] [pid=12950] ** Running async check of host 'testhost'... [1327419365.212306] [016.0] [pid=12950] Checking host 'testhost'... [1327419365.220665] [016.0] [pid=12950] Scheduling a non-forced, active check of service 'check_ssh' on host 'testhost' @ Tue Jan 24 15:38:29 2012 [1327419365.223831] [016.1] [pid=12950] Checking service 'check_ssh' on host 'testhost' for flapping... [1327419365.223854] [016.1] [pid=12950] Checking host 'testhost' for flapping... [1327419365.265025] [016.1] [pid=12950] Handling check result for service 'check_icmp_latency' on host 'testhost'... [1327419365.265033] [016.0] [pid=12950] ** Handling check result for service 'check_icmp_latency' on host 'testhost'... [1327419365.265040] [016.1] [pid=12950] HOST: testhost, SERVICE: check_icmp_latency, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: CRITICAL - 10.11.12.13: rta nan, lost 100%|rta=0.000ms;600.000;1000.000;0; pl=100%;33;66;; \n [1327419365.265106] [016.0] [pid=12950] ** Running async check of host 'testhost'... [1327419365.265364] [016.0] [pid=12950] Scheduling a non-forced, active check of service 'check_icmp_latency' on host 'testhost' @ Tue Jan 24 15:38:29 2012 [1327419365.266349] [016.1] [pid=12950] Checking service 'check_icmp_latency' on host 'testhost' for flapping... [1327419365.266360] [016.1] [pid=12950] Checking host 'testhost' for flapping... [1327419369.079254] [016.1] [pid=12950] Handling check result for host 'testhost'... [1327419369.079261] [016.1] [pid=12950] ** Handling async check result for host 'testhost'... [1327419369.080141] [016.1] [pid=12950] HOST: testhost, ATTEMPT=1/5, CHECK TYPE=ACTIVE, STATE TYPE=HARD, OLD STATE=0, NEW STATE=1 [1327419369.080175] [016.1] [pid=12950] Pre-handle_host_state() Host: testhost, Attempt=1/5, Type=SOFT, Final State=1 [1327419369.080290] [016.1] [pid=12950] Post-handle_host_state() Host: testhost, Attempt=1/5, Type=SOFT, Final State=1 [1327419369.080327] [016.1] [pid=12950] Checking host 'testhost' for flapping... [1327419369.080368] [016.0] [pid=12950] Scheduling a non-forced, active check of host 'testhost' @ Tue Jan 24 15:37:09 2012 [1327419369.084229] [016.1] [pid=12950] ** Async check result for host 'testhost' handled: new state=1 [1327419369.084266] [016.1] [pid=12950] Handling check result for host 'testhost'... [1327419369.084273] [016.1] [pid=12950] ** Handling async check result for host 'testhost'... [1327419369.084348] [016.1] [pid=12950] HOST: testhost, ATTEMPT=1/5, CHECK TYPE=ACTIVE, STATE TYPE=SOFT, OLD STATE=1, NEW STATE=1 [1327419369.084394] [016.1] [pid=12950] Pre-handle_host_state() Host: testhost, Attempt=1/5, Type=HARD, Final State=1 [1327419369.798706] [016.1] [pid=12950] Post-handle_host_state() Host: testhost, Attempt=1/5, Type=HARD, Final State=1 [1327419369.798734] [016.1] [pid=12950] Checking host 'testhost' for flapping... [1327419369.798781] [016.1]
[Nagios-users] Hosts without any contacts or contactgroups
What is the best practice for defining a host without any contacts or contactgroups? We use a custom application to monitor nagios which (among other things) serves as our notifier, so we have no need for nagios-based contacts, etc. Simply leaving out the contacts and contact_groups directives works, but generates a warning for every host we have, which I would like to avoid. Thanks, -- Brandon -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts without any contacts or contactgroups
I have a user called Nobody with no contact times and no contact information but it's enough to make the system happy. Dan -Original Message- From: Brandon Phelps [mailto:bphe...@gls.com] Sent: Thursday, November 10, 2011 1:02 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Hosts without any contacts or contactgroups What is the best practice for defining a host without any contacts or contactgroups? We use a custom application to monitor nagios which (among other things) serves as our notifier, so we have no need for nagios-based contacts, etc. Simply leaving out the contacts and contact_groups directives works, but generates a warning for every host we have, which I would like to avoid. Thanks, -- Brandon -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts without any contacts or contactgroups
That's what I've got now (below) but it just seems messy, maybe it's just me! define timeperiod { timeperiod_name int_tp_none alias Internal Timeperiod None } define contact { contact_name nobody alias Pseudo Contact for hosts with no contacts service_notification_period int_tp_none host_notification_period int_tp_none service_notification_commands notify-by-epager host_notification_commandshost-notify-by-epager } On 11/10/2011 02:08 PM, Daniel Wittenberg wrote: I have a user called Nobody with no contact times and no contact information but it's enough to make the system happy. Dan -Original Message- From: Brandon Phelps [mailto:bphe...@gls.com] Sent: Thursday, November 10, 2011 1:02 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Hosts without any contacts or contactgroups What is the best practice for defining a host without any contacts or contactgroups? We use a custom application to monitor nagios which (among other things) serves as our notifier, so we have no need for nagios-based contacts, etc. Simply leaving out the contacts and contact_groups directives works, but generates a warning for every host we have, which I would like to avoid. Thanks, -- RSA(R) Conference 2012 Save $700 by Nov 18 Register now http://p.sf.net/sfu/rsa-sfdev2dev1 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts showing up as down even though they are not being monitored
On Thu, Jan 28, 2010 at 1:58 AM, Kaplan, Andrew H. ahkap...@partners.orgwrote: Hi there -- I went through the steps you mentioned in your e-mail, and the problem continues to persist. The preflight check did mention the clients in question did not have any services associated with them. Aside from those warnings, the Nagios application did not have any complaints or problems starting up. You've commented out your service definitions for these hosts, but still have host definitions left in. These hosts are being checked with a check_command (most probably check-host-alive in a default Nagios), and since the machines are offline, they show up in Nagios as down. Go to your $NAGIOSHOME/etc directory and do a grep -R machinename * where machinename is the host you want to remove. It'll show you all the places where you mention that host, which will let you know which file you still need to edit. Then do the verify+reload dance again :-) -- *From:* Gary Every [mailto:gev...@gmail.com] *Sent:* Wednesday, January 27, 2010 6:52 PM *To:* Kaplan, Andrew H. *Subject:* Re: [Nagios-users] hosts showing up as down even though they are not being monitored You still need to restart nagios. I'm assuming you're running it on *nix of some kind. These are linux commands below: Check your config first tho via: /path/to/nagios/bindir/nagios -v /path/to/nagios.cfg If that checks out without errors, then restart nagios /path/to/nagios/[STARTUPSCRIPT|nagios] restart On Wed, Jan 27, 2010 at 4:13 PM, Kaplan, Andrew H. ahkap...@partners.orgwrote: Hi there -- We are running Nagios 3.1.2 on one of our servers, and the following anomaly is occurring: Several of our clients have been taken offline, during which time they were detected as being down. I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. What other steps do I need to take in order to correct this problem? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine athttp://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Gary Every Pay it Forward! -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Martin Melin op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being
[Nagios-users] hosts showing up as down even though they are not being monitored
Hi there -- We are running Nagios 3.1.2 on one of our servers, and the following anomaly is occurring: Several of our clients have been taken offline, during which time they were detected as being down. I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. What other steps do I need to take in order to correct this problem? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts showing up as down even though they are not being monitored
Hi there -- I went through the steps you mentioned in your e-mail, and the problem continues to persist. The preflight check did mention the clients in question did not have any services associated with them. Aside from those warnings, the Nagios application did not have any complaints or problems starting up. From: Gary Every [mailto:gev...@gmail.com] Sent: Wednesday, January 27, 2010 6:52 PM To: Kaplan, Andrew H. Subject: Re: [Nagios-users] hosts showing up as down even though they are not being monitored You still need to restart nagios. I'm assuming you're running it on *nix of some kind. These are linux commands below: Check your config first tho via: /path/to/nagios/bindir/nagios -v /path/to/nagios.cfg If that checks out without errors, then restart nagios /path/to/nagios/[STARTUPSCRIPT|nagios] restart On Wed, Jan 27, 2010 at 4:13 PM, Kaplan, Andrew H. ahkap...@partners.org wrote: Hi there -- We are running Nagios 3.1.2 on one of our servers, and the following anomaly is occurring: Several of our clients have been taken offline, during which time they were detected as being down. I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. What other steps do I need to take in order to correct this problem? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Gary Every Pay it Forward! -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts showing up as down even though they are not being monitored
Hi there -- Just for clarification, are you referring to the hosts.cfg, hostextinfo.cfg, hostgroups.cfg file, or all of the above? Thanks. From: L.Q. Marshall [mailto:qmarsh...@inetspace.net] Sent: Wednesday, January 27, 2010 8:04 PM To: 'nagios-users@lists.sourceforge.net' Subject: RE: [Nagios-users] hosts showing up as down even though they are not being monitored snip I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. restart (reload?) nagios service. Also verify that you have removed the host entries. - LQ The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts stuck on pending.
Ok so I have about 20 or so hosts which are stuck on PENDING and I am unable to get them to work properly. The definitions for a working and a stuck host are identical and can be seen below. However, nothing I do or delete (retendion.dat, cache files, spool files) gets the hosts unstuck... Any ideas? Stuck host: define host { host_name gab202kiosk.gab330 address 129.#.#.# max_check_attempts3 notification_interval 15 notification_options d,u,r active_checks_enabled 1 passive_checks_enabled0 notifications_enabled 1 check_period 24x7 notification_period 24x7 contact_groupskiosks,admins parents 129.#.#.# } define service { service_description check_ping check_command check_ping!20,80%!30,90% host_name gab202kiosk.gab330 check_period 24x7 notification_period 24x7 contact_groupskiosks,admins event_handler_enabled 0 max_check_attempts3 normal_check_interval 5 retry_check_interval 1 notification_interval 15 notification_options w,u,c,r active_checks_enabled 1 passive_checks_enabled0 notifications_enabled 1 check_freshness 0 freshness_threshold 86400 } Working host: define host { host_name gab302kiosk.gab330 address 129.#.#.# max_check_attempts3 notification_interval 15 notification_options d,u,r active_checks_enabled 1 passive_checks_enabled0 notifications_enabled 1 check_period 24x7 notification_period 24x7 contact_groupskiosks,admins parents 129.#.#.# } define service { service_description check_ping check_command check_ping!20,80%!30,90% host_name gab302kiosk.gab330 check_period 24x7 notification_period 24x7 contact_groupskiosks,admins event_handler_enabled 0 max_check_attempts3 normal_check_interval 5 retry_check_interval 1 notification_interval 15 notification_options w,u,c,r active_checks_enabled 1 passive_checks_enabled0 notifications_enabled 1 check_freshness 0 freshness_threshold 86400 } -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts with dynamic IP addresses
2009/5/5 Ken Tang kent...@berkeley.edu: I'd like to monitor a few user laptops in our environment. They use wireless so they often get dynamic IP addresses. Is there a way that Nagios can accept any connections, or, a range of addresses for a host? I don't have control of the DHCP server so I cannot assign a fixed IP address. I would have the laptops send passive checks back to Nagios. You can use nsca if you have your own plugins or use NSClient++ which is also capable of sending results to Nagios via nsca. hth, Jim -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts with dynamic IP addresses
Ken Tang wrote: I'd like to monitor a few user laptops in our environment. They use wireless so they often get dynamic IP addresses. Is there a way that Nagios can accept any connections, or, a range of addresses for a host? I don't have control of the DHCP server so I cannot assign a fixed IP address. There is no way nagios can accept a range of ip-addresses for a host, since it wouldn't know which of the presumably multiple hosts in the range is responding to what. Since you don't have control over the DHCP server, following Jim Avery's advice and letting the laptops send passive check-results to your nagios server is definitely the best solution. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Register now for Nordic Meet on Nagios, June 3-4 in Stockholm http://nordicmeetonnagios.op5.org/ Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts with dynamic IP addresses
Does your network use dynamic DNS by any chance? If you do, you could use the laptop's DNS name rather than its IP address. You may also be able to use a script that enumerates all the machines in the network. On Windows, look into the commands net view /Domain: and nbtstat -c as some starting points (Samba also has similar commands) Ken Tang wrote: I'd like to monitor a few user laptops in our environment. They use wireless so they often get dynamic IP addresses. Is there a way that Nagios can accept any connections, or, a range of addresses for a host? I don't have control of the DHCP server so I cannot assign a fixed IP address. -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts are Critical
I am running CentOS 5 and I installed nagios thru yum using the RPMForge repository. When I log into the web interface, it shows my server as down/critical under hosts. Obviously since I'm accessing the host remotely via HTTP it isn't down. Any thoughts on how to fix this? Also, this was a clean install of the OS that I installed Apache, MySQL, PHP, and Nagios onto (including dependencies). Any help would be great. Thanks, Andy Carlson IT Manager City of Princeton www.princeton-il.com Office (815) 875-2631 ext. 253 Cell (815) 719-0023 41° 22' 30 N 89° 27' 55 W --- The ubiquity of the Internet is more important than the technology of the Internet -- Jeff Bezos -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts are Critical
On Mar 23, 2009, at 10:15 AM, acarl...@princeton-il.com wrote: I am running CentOS 5 and I installed nagios thru yum using the RPMForge repository. When I log into the web interface, it shows my server as down/critical under hosts. I can't speak to the specifics of what RPMForge does for defaults but generally, your host's check_command doesn't work or doesn't accurately reflect the status of the host. What is the Status Information? What is the host{} definition? What is the command{} definition it references as it's check_command? Obviously since I'm accessing the host remotely via HTTP it isn't down. Any thoughts on how to fix this? Also, this was a clean install of the OS that I installed Apache, MySQL, PHP, and Nagios onto (including dependencies). An somewhat arbitrary guess is that the host check_command tries to ping localhost and that doesn't work. Can you ping localhost as the nagios user? If not, you may need to SUID /bin/ping so that it can be run by non-root users. -- Marc -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts are Critical
Marc, The status (of the host in Nagios) is down. The host definition is: define host{ use linux-server; Name of host template$ ; This host definition $ ; in (or inherited by) $ host_name localhost alias localhost address 127.0.0.1 } This is how it is out-of-the-box. How would I impliment a check_command? Thanks, Andy Quoting Marc Powell m...@ena.com: On Mar 23, 2009, at 10:15 AM, acarl...@princeton-il.com wrote: I am running CentOS 5 and I installed nagios thru yum using the RPMForge repository. When I log into the web interface, it shows my server as down/critical under hosts. I can't speak to the specifics of what RPMForge does for defaults but generally, your host's check_command doesn't work or doesn't accurately reflect the status of the host. What is the Status Information? What is the host{} definition? What is the command{} definition it references as it's check_command? Obviously since I'm accessing the host remotely via HTTP it isn't down. Any thoughts on how to fix this? Also, this was a clean install of the OS that I installed Apache, MySQL, PHP, and Nagios onto (including dependencies). An somewhat arbitrary guess is that the host check_command tries to ping localhost and that doesn't work. Can you ping localhost as the nagios user? If not, you may need to SUID /bin/ping so that it can be run by non-root users. -- Marc -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Andy Carlson IT Manager City of Princeton www.princeton-il.com Office (815) 875-2631 ext. 253 Cell (815) 719-0023 41° 22' 30 N 89° 27' 55 W --- The ubiquity of the Internet is more important than the technology of the Internet -- Jeff Bezos -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts are Critical
2009/3/23 acarl...@princeton-il.com: Marc, it also is giving me the following error: Return code of 127 is out of bounds - plugin may be missing Sorry for not mentioning that before. Thanks, Andy yeah, that's probably the problem - nagios can't find / can't run the plugin. Make sure that the plugins are installed, nagios can run them, and nagios knows where they are. presumably you already installed the plugins through the RPMForge repository, so they should be executable by the nagios user. As for telling nagios where they are, you should have a commands file that defines commands. It'll use one the $USER$, most likely $USER1$. Edit resources.cfg and point $USER1$ (or whatever) to the correct location of your nagios plugins, which will be where you installed them. Probably in /usr/lib/nagios/libexec on centos if memory servers me right. .r' -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts report 'DOWN, HARD' after first attempt.
I am running a distributed monitoring system using Nagios 2.11 on FreeBSD 6.3. I use NSCA to send host and services events to the central server from the slave servers and have always had the following problem: A distributed server notices a host service is non-Ok and fires off check-host-alive. I have it set up to do check_ICMP and so it fires off five ICMP packets. Since the network isn't always perfect those five packets get dropped. However, I have my max_retry_interval set to 3 so it fires off another check_ICMP which completes just fine. As a result I see the following events take place on the slave server: [01-16-2009 15:18:46] HOST ALERT: s3200.blah.net;UP;SOFT;2;OK - 10.XX.XX.XX: rta 100.294ms, lost 0% [01-16-2009 15:18:46] HOST ALERT: s3200.blah.net;DOWN;SOFT;1;CRITICAL - 10.XX.XX.XX: rta nan, lost 100% However on the central server I see the following: [01-16-2009 15:19:02] HOST NOTIFICATION: NOC-email;s3200.blah.net;UP;host-notify-by-email;OK - 10.XX.XX.XX: rta 100.294ms, lost 0% [01-16-2009 15:19:01] HOST ALERT: s3200.blah.net;UP;HARD;1;OK - 10.XX.XX.XX: rta 100.294ms, lost 0% [01-16-2009 15:19:01] HOST NOTIFICATION: NOC-email;s3200.blah.net;DOWN;host-notify-by-email;CRITICAL - 10.XX.XX.XX: rta nan, lost 100% [01-16-2009 15:19:01] HOST ALERT: s3200.blah.net;DOWN;HARD;1;CRITICAL - 10.XX.XX.XX: rta nan, lost 100% The central server is immediately flagging the host as DOWN, HARD in spite of having the same max_retry_interval = 3 setting. On some hosts this is generating a tone of false HOST DOWN notifications. Is there any way to fix it? Jonathan Call This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt.
On Fri, 16 Jan 2009, Jonathan Call wrote: I am running a distributed monitoring system using Nagios 2.11 on FreeBSD 6.3. I use NSCA to send host and services events to the central server from the slave servers and have always had the following problem: A distributed server notices a host service is non-Ok and fires off check-host-alive. I have it set up to do check_ICMP and so it fires off five ICMP packets. Since the network isn't always perfect those five packets get dropped. However, I have my max_retry_interval set to 3 so it fires off another check_ICMP which completes just fine. As a result I see the following events take place on the slave server: [01-16-2009 15:18:46] HOST ALERT: s3200.blah.net;UP;SOFT;2;OK - 10.XX.XX.XX: rta 100.294ms, lost 0% [01-16-2009 15:18:46] HOST ALERT: s3200.blah.net;DOWN;SOFT;1;CRITICAL - 10.XX.XX.XX: rta nan, lost 100% However on the central server I see the following: [01-16-2009 15:19:02] HOST NOTIFICATION: NOC-email;s3200.blah.net;UP;host-notify-by-email;OK - 10.XX.XX.XX: rta 100.294ms, lost 0% [01-16-2009 15:19:01] HOST ALERT: s3200.blah.net;UP;HARD;1;OK - 10.XX.XX.XX: rta 100.294ms, lost 0% [01-16-2009 15:19:01] HOST NOTIFICATION: NOC-email;s3200.blah.net;DOWN;host-notify-by-email;CRITICAL - 10.XX.XX.XX: rta nan, lost 100% [01-16-2009 15:19:01] HOST ALERT: s3200.blah.net;DOWN;HARD;1;CRITICAL - 10.XX.XX.XX: rta nan, lost 100% The central server is immediately flagging the host as DOWN, HARD in spite of having the same max_retry_interval = 3 setting. On some hosts this is generating a tone of false HOST DOWN notifications. Is there any way to fix it? The max_check_attempts only applies to active checks, not the passive ones you're sending the central server (at least I assume when you said max_retry_interval you meant max_check_attempts) -- and you may note that SOFT and HARD are only relative to the server doing the checking; they probably aren't passed as part of the passive check submission process. In short, passive host checks are a bit of a pain. I'm not sure exactly how you're passing check results to the central server, but you may want to consider modifying the process to only send host check results when they are in a hard state. -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt.
-Original Message- From: Patrick Morris [mailto:patrick.mor...@hp.com] Sent: Friday, January 16, 2009 11:40 AM To: Jonathan Call Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt. ... I'm not sure exactly how you're passing check results to the central server, but you may want to consider modifying the process to only send host check results when they are in a hard state. That sounds like an excellent recommendation. Here is my host check command: $USER1$/custom/submit_host_check_result.sh $HOSTNAME$ $HOSTSTATEID$ '$HOSTOUTPUT$' I'll need to modify it to be like this: $USER1$/custom/submit_host_check_result.sh $HOSTNAME$ $HOSTSTATEID$ '$HOSTOUTPUT$' '$HOSTSTATETYPE$' And then my NSCA host script would then become: -- #!/bin/sh # Arguments and corresponding NAGIOS API variable # $1 = $HOSTNAME$ # $2 = $HOSTSTATEID$ # $3 = $HOSTOUTPUT$ # $4 = $HOSTSTATETYPE$ # # The variables must be piped in as tab delimited variables # with a newline termination if [ $4 = HARD ]; then /usr/bin/printf %s\t%s\t%s\n $1 $2 $3 | /usr/local/sbin/send_nsca XXX.XXX.XXX.XXX -c /usr/local/etc/send_nsca.cfg fi # Do nothing for SOFT -- Thank you, Jonathan This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt.
On Jan 16, 2009, at 12:40 PM, Patrick Morris wrote: The max_check_attempts only applies to active checks, not the passive ones you're sending the central server (at least I assume when you said max_retry_interval you meant max_check_attempts) -- and you may note that SOFT and HARD are only relative to the server doing the checking; they probably aren't passed as part of the passive check submission process. Correct, all passive host checks are assumed to be HARD states. Note that this is addressed in nagios-3 -- http://nagios.sourceforge.net/docs/3_0/configmain.html#passive_host_checks_are_soft -- Marc -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt.
-Original Message- From: Marc Powell [mailto:m...@ena.com] Sent: Friday, January 16, 2009 1:20 PM To: nagios-users Mailinglist Subject: Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt. On Jan 16, 2009, at 12:40 PM, Patrick Morris wrote: The max_check_attempts only applies to active checks, not the passive ones you're sending the central server (at least I assume when you said max_retry_interval you meant max_check_attempts) -- and you may note that SOFT and HARD are only relative to the server doing the checking; they probably aren't passed as part of the passive check submission process. Correct, all passive host checks are assumed to be HARD states. Note that this is addressed in nagios-3 -- http://nagios.sourceforge.net/docs/3_0/configmain.html#passive_host_chec ks _are_soft -- Marc If they're all assumed to be SOFT, then a host failure would never trigger a notification? Another potential option, if you're not using NSCA (like those using the OCP_daemon) is to have the slave servers send out the notification emails instead of the central one. The slaves would be active monitors and would honor the host's max_check_attempts variable. This of course introduces other problems if the slave is behind a restrictive firewall. Jonathan This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts report 'DOWN, HARD' after first attempt.
On Jan 16, 2009, at 3:15 PM, Jonathan Call wrote: http://nagios.sourceforge.net/docs/3_0/configmain.html#passive_host_chec ks_are_soft If they're all assumed to be SOFT, then a host failure would never trigger a notification? My interpretation of the parameter is that passive host results are not automatically HARD but instead follow the same processes as an active check. Upon reaching max_check_attempts, nagios sets the host into a HARD state. That would be most logical given the previous behavior and the other changes to host checking but I don't really care enough about it to go look in the source (I'm not running 3.x yet). ;) -- Marc -- This SF.net email is sponsored by: SourcForge Community SourceForge wants to tell your story. http://p.sf.net/sfu/sf-spreadtheword ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts randomly disappearing
Hello All, I have about 115 hosts in my nagios pool and I recently added a group of 7 UPSs. I made my own check using the check_snmp to check the load percentage. Funny thing is, some times they show up in the host detail and some times they do not. Refreshing a few times will get them to show, then another refresh usually causes them to disappear. Opening one of the hosts usually results in the message: Error: Host Status Information Not Found! a few refreshes gets the host state to become viewable. I find this behavior only with the 7 hosts i just added. The config file is: #cat ups.cfg #UPS1 define host{ host_name ups-r1.hou.example.com alias UPS Rack 1 address 192.168.96.8 check_command check-host-alive max_check_attempts 10 process_perf_data 1 retain_status_information 1 retain_nonstatus_information0 contact_groups web-admins notification_optionsd,r notification_interval 30 notification_period 24x7 parents co1.hou.example.com } define hostextinfo{ host_name ups-r1.hou.example.com icon_image www.jpg statusmap_image www.gd2 } define service{ host_name ups-r1.hou.example.com service_description check_ping check_command check_ping!140,25%!180,50% max_check_attempts 10 normal_check_interval 5 retry_check_interval3 check_period24x7 notification_interval 30 notification_period 24x7 notification_optionsw,c,r contact_groups web-admins } define service{ host_name ups-r1.hou.example.com service_description check_ups_load check_command check_ups_load!84!89 max_check_attempts 10 normal_check_interval 5 retry_check_interval3 check_period24x7 notification_interval 30 notification_period 24x7 notification_optionsw,c,r contact_groups web-admins } #UPS2 define host{ host_name ups-r2.hou.example.com alias UPS Rack 2 address 192.168.96.9 check_command check-host-alive max_check_attempts 10 process_perf_data 1 retain_status_information 1 retain_nonstatus_information0 contact_groups web-admins notification_optionsd,r notification_interval 30 notification_period 24x7 parents co1.hou.example.com } define hostextinfo{ host_name ups-r2.hou.example.com icon_image www.jpg statusmap_image www.gd2 } define service{ host_name ups-r2.hou.example.com service_description check_ping check_command check_ping!140,25%!180,50% max_check_attempts 10 normal_check_interval 5 retry_check_interval3 check_period24x7 notification_interval 30 notification_period 24x7 notification_optionsw,c,r contact_groups web-admins } define service{ host_name ups-r2.hou.example.com service_description check_ups_load check_command check_ups_load!84!89 max_check_attempts 10 normal_check_interval 5 retry_check_interval3 check_period24x7 notification_interval 30 notification_period 24x7 notification_optionsw,c,r contact_groups web-admins } #UPS3 define host{ host_name ups-r3.hou.example.com alias UPS Rack 3 address 192.168.96.10 check_command check-host-alive max_check_attempts 10 process_perf_data 1 retain_status_information 1 retain_nonstatus_information0 contact_groups web-admins
Re: [Nagios-users] Hosts randomly disappearing
On Dec 8, 2008, at 2:39 PM, Thomas Donnelly wrote: Hello All, I have about 115 hosts in my nagios pool and I recently added a group of 7 UPSs. I made my own check using the check_snmp to check the load percentage. Funny thing is, some times they show up in the host detail and some times they do not. Refreshing a few times will get them to show, then another refresh usually causes them to disappear. http://www.nagios.org/faqs/viewfaq.php?faq_id=21 -- Marc -- SF.Net email is Sponsored by MIX09, March 18-20, 2009 in Las Vegas, Nevada. The future of the web can't happen without you. Join us at MIX09 to help pave the way to the Next Web now. Learn more and register at http://ad.doubleclick.net/clk;208669438;13503038;i?http://2009.visitmix.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] hosts
I found this in the documentation: check_interval: NOTE: Do NOT enable regularly scheduled checks of a host unless you absolutely need to! Host checks are already performed on-demand when necessary, so there are few times when regularly scheduled checks would be needed. Regularly scheduled host checks can negatively impact performance - see the performance tuning tips for more information. This directive is used to define the number of time units between regularly scheduled checks of the host. Unless you've changed the interval_length directive from the default value of 60, this number will mean minutes. More information on this value can be found in the check scheduling documentation. If I do NOT have a check_interval, how often are hosts checked? - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts
On Jul 14, 2008, at 9:41 AM, James wrote: If I do NOT have a check_interval, how often are hosts checked? When necessary. The check scheduling documentation isn't complete yet for 3.x but the first section of the following page is still valid -- http://nagios.sourceforge.net/docs/2_0/networkreachability.html -- Marc - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts
On Mon, July 14, 2008 10:58 am, Marc Powell wrote: On Jul 14, 2008, at 9:41 AM, James wrote: If I do NOT have a check_interval, how often are hosts checked? When necessary. The check scheduling documentation isn't complete yet for 3.x but the first section of the following page is still valid -- http://nagios.sourceforge.net/docs/2_0/networkreachability.html What if there is one service (ping). Isn't it the same as having a check_interval on a host with no services? -- Marc - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts
What if there is one service (ping). Isn't it the same as having a check_interval on a host with no services? Depending on which version of Nagios you are running. Nagios 2.x host checks are run in series while Nagios 3.x most host checks are run in Parallel ( much faster as Nagios doesn't have to wait for the slow ping check to continue with everything else). So in Nagios 3.x it is similar to having it only check the host. But the logic of Nagios host and service checks are different. You should have a service check for ping and a host check with ping. And to your earlier question: You should check out Marc Powells post to the list earlier this year for some added insight to host checks keeping in mind that host checks are now performed faster in Nagios 3.x. http://article.gmane.org/gmane.network.nagios.user/52470 If you set the check_interval to be 0 disables regular checks or checks that are performed on a time interval. Nagios will then check the host if a service of the host is in a non-OK state. The documentation for the Nagios 3.x Service Check Scheduling has yet to be completed but you still might get some insight from the 2.x docs if you keep in mind the change in how host checks are executed http://nagios.sourceforge.net/docs/2_0/checkscheduling.html - Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts reboots too fast for check_alivenotification
Hello, You can also check the uptime of the server. Regards, Vincent Alloo TI France Design Systems Operations Manager Europe and Middle East IT Services Texas Instruments France E-Mail: [EMAIL PROTECTED] Phone: +33 4 93 22 26 97 Mobile: +33 6 82 13 00 80 -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Livio Zanol Puppim Sent: Friday, April 25, 2008 4:03 PM Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Hosts reboots too fast for check_alivenotification Or, you could try to use SNMPTRAP on the switches that the servers are, to inform link-down/link-up traps on interfaces and associates that interface to the server... []'s Lívio Zanol Puppim 2008/4/24, Aaron Devey [EMAIL PROTECTED]: You could have the server fire a script during reboots that submits a check result to nagios via NSCA. It might be a little more elaborate than what you were looking for, but it will always catch a reboot even when a host check misses it. -Aaron Rodrick Brown wrote: When one of my hosts reboots I'm never notified about the outage. Currently I'm using a custom script S99bootnotify to alert me when a host comes online, is there any way to shorten the polling for check_alive? I find it strange that a host could reboot and nagios not detect that outage. Thanks. --- Rodrick R. Brown Director, Systems Engineering Ballista Securities, LLC 120 Wall St. Suite 2400 P: 646 307 4709 C: 347 702 0012 F: 646 219-5872 E: rbrown(at)ballistasec.com - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts reboots too fast for check_alive notification
Or, you could try to use SNMPTRAP on the switches that the servers are, to inform link-down/link-up traps on interfaces and associates that interface to the server... []'s Lívio Zanol Puppim 2008/4/24, Aaron Devey [EMAIL PROTECTED]: You could have the server fire a script during reboots that submits a check result to nagios via NSCA. It might be a little more elaborate than what you were looking for, but it will always catch a reboot even when a host check misses it. -Aaron Rodrick Brown wrote: When one of my hosts reboots I'm never notified about the outage. Currently I'm using a custom script S99bootnotify to alert me when a host comes online, is there any way to shorten the polling for check_alive? I find it strange that a host could reboot and nagios not detect that outage. Thanks. --- Rodrick R. Brown Director, Systems Engineering Ballista Securities, LLC 120 Wall St. Suite 2400 P: 646 307 4709 C: 347 702 0012 F: 646 219-5872 E: rbrown(at)ballistasec.com - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts reboots too fast for check_alive notification
When one of my hosts reboots I'm never notified about the outage. Currently I'm using a custom script S99bootnotify to alert me when a host comes online, is there any way to shorten the polling for check_alive? I find it strange that a host could reboot and nagios not detect that outage. Thanks. --- Rodrick R. Brown Director, Systems Engineering Ballista Securities, LLC 120 Wall St. Suite 2400 P: 646 307 4709 C: 347 702 0012 F: 646 219-5872 E: rbrown(at)ballistasec.com - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts reboots too fast for check_alive notification
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Rodrick Brown Sent: Thursday, April 24, 2008 3:45 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Hosts reboots too fast for check_alive notification When one of my hosts reboots I'm never notified about the outage. Currently I'm using a custom script S99bootnotify to alert me when a host comes online, is there any way to shorten the polling for check_alive? I find it strange that a host could reboot and nagios not detect that outage. What version of nagios are you using? With nagios-2.x and prior, hosts are never checked unless a service on the host returns a non-OK result. Hosts are then checked with the host's check_command, up to max_check_attempts, with no pause between attempts. The end of that check process, which should be quick if you've followed the recommendations, will be the status of the host. It appears to me you might need to adjust your service check_interval to be less than the time it takes to reboot _or_ find a better way of knowing when the host reboots (uptime, timestamp on a file, log entries, etc...). -- Marc - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts w/o services
Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
advice tag='unsolicited' There must be some reason for each server or device to be on the network, it must provide some service, even if it's just a lowly(?) sshd. Do yourself and your organization a favor and monitor that reason! /advice -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: Wednesday, April 16, 2008 1:01 PM To: Nagios Users Mailinglist Subject: [Nagios-users] Hosts w/o services Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. Wachovia Securities, LLC's outgoing and incoming e-mails are electronically archived and subject to review and/or disclosure to someone other than the recipient. A.G. Edwards, a division of Wachovia Securities, LLC - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
You can use check_ping -- I assume the boxes are pingable, and you want to check they are pingable, even if they do nothing else that can be monitored. If you don't want to check they work via ping, why are you even modelling them? -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: 16 April 2008 19:01 To: Nagios Users Mailinglist Subject: [Nagios-users] Hosts w/o services Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- --- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java .sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: Wednesday, April 16, 2008 1:01 PM To: Nagios Users Mailinglist Subject: [Nagios-users] Hosts w/o services Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. Nagios is a service monitoring application and everything is developed with that in mind. Every host is expected to have at least one service associated with it. Being able to effectively monitor hosts without services is new in Nagios-3 and is considered an 'optional' and not a core feature. The effects you're seeing are the result of that expectation. For example, clicking on the hostname link from hostgroup overview takes you to a list of services on that host. It would be nice if it did something different in your case but it's not there and you should understand that your use case is the exception, not the norm. I have heard nothing about that behavior changing in the future. -- Marc - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 10:21 AM, Paul Weaver wrote: You can use check_ping -- I assume the boxes are pingable, and you want to check they are pingable, even if they do nothing else that can be monitored. If you don't want to check they work via ping, why are you even modelling them? I AM checking them via ping- in the host check. It just doesn't make sense to be pinging both as a service check and as a host check, and since the ping is actually determining the host state, not the state of a service, it makes more sense to put the ping check as the host check, rather than a separate service. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: 16 April 2008 19:01 To: Nagios Users Mailinglist Subject: [Nagios-users] Hosts w/o services Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- --- This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java .sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 10:10 AM, Ford, Andy wrote: advice tag='unsolicited' There must be some reason for each server or device to be on the network, it must provide some service, even if it's just a lowly(?) sshd. Do yourself and your organization a favor and monitor that reason! /advice To some extent, yes, but not necessarily a service we can monitor. There are quite a few devices on the network that it is important to us to know should the device fail, but aren't offering any separately monitorable service, per se. For example, printers. We need to know if one dies, so we can fix it before it becomes a problem (not to mention that fixing the boss's printer before he even knows it has a problem makes you look good ;-) ), but other than the simple ping host check, they don't offer any services we can monitor. Even some more essential devices such as switches can fall into this category, as they are just routing traffic, not offering any services. I could, of course, go crazy and use check_snmp or something to monitor each port on said switch as a service, but that is way overkill for our needs- we just need to know that the switch is there and functioning, i.e. host check. Not to mention the rather large category of client machines, which we need to know are running so they can be backed up, and, of course, used for whatever the user needs. Kind of difficult to monitor the ability to run office, or an e-mail client :-). We still want to monitor the host, though, so that hopefully if/when a client machine should die we can fix it before the user (who may well come in before us in the morning) is impacted. We could, of course, monitor something like ssh on those machines, but why? All we (and the user) care about is that the machine is functioning. So yeah, while this may not be the way nagios is designed to work, and may never be (which I can live with if so), I really don't see this as being all that unusual a situation, as some responses seem to imply. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: Wednesday, April 16, 2008 1:01 PM To: Nagios Users Mailinglist Subject: [Nagios-users] Hosts w/o services Running nagios 3.0, I have set up a number (most) of my hosts without services, since all I am interested in monitoring is the hosts themselves. However, this is causing a number of little annoyances, such as the hosts not showing up when I type their name into the nagios sidebar search, throwing numerous warnings if I do a verify from the command line, giving me a blank page when I click the hostname link from the hostgroup overview page, etc. Is there any way around these issues other than adding a check_dummy service check to each host? Thanks. Wachovia Securities, LLC's outgoing and incoming e-mails are electronically archived and subject to review and/or disclosure to someone other than the recipient. A.G. Edwards, a division of Wachovia Securities, LLC - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Wed, Apr 16, 2008 at 11:20:29AM -0800, Israel Brewster wrote: To some extent, yes, but not necessarily a service we can monitor. There are cases that fit this description, but I'm not sure if the examples you provide are. There are quite a few devices on the network that it is important to us to know should the device fail, but aren't offering any separately monitorable service, per se. For example, printers. We need to know if one dies, so we can fix it before it becomes a problem (not to mention that fixing the boss's printer before he even knows it has a problem makes you look good ;-) ), but other than the simple ping host check, they don't offer any services we can monitor. Telnet to 9100 and look for a banner. Or the equivalent for non JetDirect printers.. Even some more essential devices such as switches can fall into this category, as they are just routing traffic, not offering any services. I could, of course, go crazy and use check_snmp or something to monitor each port on said switch as a service, but that is way overkill for our needs- we just need to know that the switch is there and functioning, i.e. host check. And moving traffic; you might find it useful to ping-check other things on that switch. Or at least ssh-check the switch controller itself. Not to mention the rather large category of client machines, which we need to know are running so they can be backed up, and, of course, used for whatever the user needs. Kind of difficult to monitor the ability to run office, or an e-mail client :-). We still want to monitor the host, though, so that hopefully if/when a client machine should die we can fix it before the user (who may well come in before us in the morning) is impacted. We could, of course, monitor something like ssh on those machines, but why? All we (and the user) care about is that the machine is functioning. Sure. But for workstations, you're not monitoring diskspace? Open TCP listens (to watch for trojans) So yeah, while this may not be the way nagios is designed to work, and may never be (which I can live with if so), I really don't see this as being all that unusual a situation, as some responses seem to imply. Well, it's not that it's unusual, I think; I believe the assertion being made is that only doing a ping is not the Best Practice. Cheers, -- jra -- Jay R. Ashworth[EMAIL PROTECTED] Designer+-Internetworking--+-+RFC 2100 Ashworth Associates | Best Practices Wiki | | '87 e24 St Petersburg FL USA+-http://bestpractices.wikia.com-+ +1 727 647 1274 If you can read this... thank a system administrator. Or two. --me - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: Wednesday, April 16, 2008 2:20 PM To: Ford, Andy Cc: Nagios Users Mailinglist Subject: Re: [Nagios-users] Hosts w/o services On Apr 16, 2008, at 10:10 AM, Ford, Andy wrote: So yeah, while this may not be the way nagios is designed to work, and may never be (which I can live with if so), I really don't see this as being all that unusual a situation, as some responses seem to imply. In such cases the typical action is to configure the ping as a service check and no, a single ping or check_dummy, as a host check, Whether you're using parenting or not determines which of those you choose. -- Marc - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 11:25 AM, Jay R. Ashworth wrote: On Wed, Apr 16, 2008 at 11:20:29AM -0800, Israel Brewster wrote: To some extent, yes, but not necessarily a service we can monitor. There are cases that fit this description, but I'm not sure if the examples you provide are. For example, printers. Telnet to 9100 and look for a banner. Or the equivalent for non JetDirect printers.. Ok, that one might be of some use, in the case where the printer is responding to pings (host up) but not accepting connections (service down). Thanks for the suggestion, I'll look into it. Even some more essential devices such as switches... And moving traffic; you might find it useful to ping-check other things on that switch. Or at least ssh-check the switch controller itself. Yep, we are - all the hosts connected to the switch. That's not a service check on the switch, though, that's host checks on devices connected to it. Trying to monitor something like this directly on the switch as a service check would just be redundant. Maybe save a few seconds, but if you see all the hosts on the far side of a switch go down, while the switch is still up, it should be fairly obvious that the switch is having problems :-D Not to mention the rather large category of client machines Sure. But for workstations, you're not monitoring diskspace? Open TCP listens (to watch for trojans) Well, we could, and for some places this sort of through monitoring is overkill. So yeah, while this may not be the way nagios is designed to work, and may never be (which I can live with if so), I really don't see this as being all that unusual a situation, as some responses seem to imply. Well, it's not that it's unusual, I think; I believe the assertion being made is that only doing a ping is not the Best Practice. I guess I see your point, in that there is typically something on any device which could be monitorable as a service. One could also argue that if it could be an issue (such as disk space on a client machine, or Open TCP listens), it should be monitored. These are valid arguments. However, I would argue that cluttering up your nagios display with services that either a) are fringe cases which would be quickly noticed anyway, or b) which you really don't care about, not to mention the additional network, monitoring server, and client load this imposes, is not best practice. Best Practice, in my mind at least, is Keep It Simple. Just monitor what you need to in order to provide the best service to your clients you can. --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- Cheers, -- jra -- Jay R. Ashworth[EMAIL PROTECTED] Designer+-Internetworking--+- +RFC 2100 Ashworth Associates | Best Practices Wiki | | '87 e24 St Petersburg FL USA+-http://bestpractices.wikia.com-+ +1 727 647 1274 If you can read this... thank a system administrator. Or two. --me - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
On Apr 16, 2008, at 11:32 AM, Marc Powell wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios- users- [EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: Wednesday, April 16, 2008 2:20 PM To: Ford, Andy Cc: Nagios Users Mailinglist Subject: Re: [Nagios-users] Hosts w/o services On Apr 16, 2008, at 10:10 AM, Ford, Andy wrote: So yeah, while this may not be the way nagios is designed to work, and may never be (which I can live with if so), I really don't see this as being all that unusual a situation, as some responses seem to imply. In such cases the typical action is to configure the ping as a service check and no, a single ping or check_dummy, as a host check, Whether you're using parenting or not determines which of those you choose. Thanks :-) Yeah, that's how I had it when I was running 2.x. It just seemed more intuitively obvious to go with host checks once 3.0 made that a reasonable option, since it was the host state, not a service state, that we were monitoring. That's ok though. I was just wondering if there was a better way. Apparently not :-) Thanks again everyone! --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- Marc - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Israel Brewster Sent: Wednesday, April 16, 2008 2:49 PM To: Jay R. Ashworth Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Hosts w/o services this imposes, is not best practice. Best Practice, in my mind at least, is Keep It Simple. Just monitor what you need to in order to provide the best service to your clients you can. I think that in almost all cases, this is going to be more than a simple ping. Take a simple case of a router. You might think that if it's pingable, it's up and working, if not it's down. However, if you're looking to provide the best service to your clients you're also going to be concerned with CPU utilization on that router, memory use, interface errors/status, etc... -- Marc - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
Well, if you dig, I think you will find that you can monitor much more than you think. In fact, to take one example you mentioned: printers. If I was in charge of monitoring the availability of the boss's printer, I would have Nagios run a test printout every morning that would either return a status or, if it's an old printer that's less intelligent, I'd have a piece of paper in the output tray that I'd pick up on my way to my desk! At that point, you know not only that it's turned on, but that the toner is good, it's not jammed, and a bunch of other stuff. Even in the worst case scenario of a dot matrix printer connected via serial line, you can whip up a plugin that sends a printout via a command line tool with lpr or some other old standby. Samba might come in hand there. This may not be applicable to your printer, but for instance: [EMAIL PROTECTED] ~]$ /usr/lib/nagios/plugins/check_hpjd -h check_hpjd (nagios-plugins 1.4.8) 1.35 Copyright (c) 1999 Ethan Galstad [EMAIL PROTECTED] Copyright (c) 2000-2006 Nagios Plugin Development Team [EMAIL PROTECTED] This plugin tests the STATUS of an HP printer with a JetDirect card. Net-snmp must be installed on the computer running the plugin. -Original Message- From: Israel Brewster [mailto:[EMAIL PROTECTED] Sent: Wednesday, April 16, 2008 2:20 PM To: Ford, Andy Cc: Nagios Users Mailinglist Subject: Re: [Nagios-users] Hosts w/o services On Apr 16, 2008, at 10:10 AM, Ford, Andy wrote: advice tag='unsolicited' There must be some reason for each server or device to be on the network, it must provide some service, even if it's just a lowly(?) sshd. Do yourself and your organization a favor and monitor that reason! /advice To some extent, yes, but not necessarily a service we can monitor. There are quite a few devices on the network that it is important to us to know should the device fail, but aren't offering any separately monitorable service, per se. For example, printers. We need to know if one dies, so we can fix it before it becomes a problem (not to mention that fixing the boss's printer before he even knows it has a problem makes you look good ;-) ), but other than the simple ping host check, they don't offer any services we can monitor. Wachovia Securities, LLC's outgoing and incoming e-mails are electronically archived and subject to review and/or disclosure to someone other than the recipient. A.G. Edwards, a division of Wachovia Securities, LLC - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts w/o services
Hi, After reading this thread I just couldn't passed without commenting it. We also have this problem, we are an NREN and so a kind of ISP. We connect clients through rented lines and we want to monitor the lines we rent. How we can do this is simple: just ping the host in the other end of the line. It happens that these hosts are property of the client and so we don't have access to them or neither we care about the services they could have on them... (also, it was to complex to monitor services in multivendor hosts that we don't even know!) So we configured nagios 3.0.1 as we did since the old netsaint era, just created a service "ping", as been mentioned here before. The only difference is we disabled periodic host checks by setting the host check interval to 0. We also have a complex scenario here since we had setup a custom host_alive script to test hosts more time in after-hours time to don't have many false alarms to our night team. We do this with the new $ISVALIDTIME. (Thank you Nagios 3!! ;)) A final note on our monitoring experience is to only monitor services that are really important to us. I don't want to get my attention away of really hard problems with things that are not important to resolve. And that could be on a faulty state for weeks. Of course it depends on the size of the infrastructure you are monitoring, but I'm talking in hundreds of hosts and many more services. Regards Emanuel Massano Emanuel Massano FCCN Av. do Brasil, n. 101 1700-066 Lisboa Tel: +351 21 844 0100 Fax: +351 21 847 2167 E-Mail: [EMAIL PROTECTED] www.fccn.pt Aviso de Confidencialidade Esta mensagem exclusivamente destinada ao seu destinatrio, podendo conter informao CONFIDENCIAL, cuja divulgao est expressamente vedada nos termos da lei. Caso tenha recepcionado indevidamente esta mensagem, solicitamos-lhe que nos comunique esse mesmo facto por esta via ou para o telefone +351 218440100 devendo apagar o seu contedo de imediato. This message is intended exclusively for its addressee. It may contain CONFIDENTIAL information protected by law. If this message has been received by error, please notify us via e-mail or by telephone +351 218440100 and delete it immediately Israel Brewster wrote: On Apr 16, 2008, at 11:32 AM, David Wallis wrote: I have the same situation with some devices. I just configure a "ping" service for them, and then move on to more interesting problems. When there's such a simple solution, it sometimes doesn't pay to overthink the problem. Yeah, I'm seeing that. Just didn't think it would be such a big issue :-D I was actually expecting either a simple solution, or a simple "it can't be done". Unfortunately, being the stubborn type, I tend to beat these things into the ground :-P :-D --- Israel Brewster Computer Support Technician Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 --- -- David Wallis Information Technology Advanced Photon Source Argonne National Laboratory 630.252.7375 - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts/service Acknowledgement comment staleness?
Hey All, I am using version 2.9 currently and am noticing that when I ack a host/service and leave a comment and unchecked the persistent option that when the service or host changes states the acknowledgment comment is not being removed. I know that before I upgraded (was using 2.5) when I left a comment on acknowledging an alert and unchecked the persistent option the comment would automatically be removed when the host/service changed states. I'm not referring to the sticky acknowledgment which I know keeps the acks there across program restarts, I'm referring to the 3rd box under send notifications for the persistent option. Anyone else experience this behavior ? Donnell Lewis You can judge the character of others by how they treat those they think can do nothing for them...-Malcom Forbes - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts with one service in error state never get checked - bug?
Hi, I've got a few hosts with just one service each. If the service goes into a non-OK state, then later the host goes down, when it comes back up Nagios never checks the host if the service is still not OK. The host still looks like it's down as far as Nagios is concerned. Is this a bug? Thanks for any advice - Ian -- Ian Chard, Senior Unix and Network Admin | E: [EMAIL PROTECTED] Systems and Electronic Resources Service | T: 80587 / (01865) 280587 Oxford University Library Services| F: (01865) 242287 - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts and services not sending mail
I'm pretty sure I've got everything set up correctly, as yesterday I was getting notifications sent out, and today there are none going out. I've added some services that I knew would go critical, and started watching nagios.log. Here is a snippet from yesterdays log [1179271433] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;devstack01;1179271433 [1179271440] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;devstack02;1179271440 [1179271445] HOST ALERT: devstack01;DOWN;SOFT;1;CRITICAL - Host Unreachable (10.0.0.160) [1179271448] HOST ALERT: devstack01;DOWN;SOFT;2;CRITICAL - Host Unreachable (10.0.0.160) [1179271451] HOST ALERT: devstack01;DOWN;SOFT;3;CRITICAL - Host Unreachable (10.0.0.160) [1179271451] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;ilom-cp1;1179271449 [1179271454] HOST ALERT: devstack01;DOWN;SOFT;4;CRITICAL - Host Unreachable (10.0.0.160) [1179271457] HOST ALERT: devstack01;DOWN;SOFT;5;CRITICAL - Host Unreachable (10.0.0.160) [1179271460] HOST ALERT: devstack01;DOWN;SOFT;6;CRITICAL - Host Unreachable (10.0.0.160) [1179271460] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;ilom-cp2;1179271459 [1179271463] HOST ALERT: devstack01;DOWN;SOFT;7;CRITICAL - Host Unreachable (10.0.0.160) [1179271466] HOST ALERT: devstack01;DOWN;SOFT;8;CRITICAL - Host Unreachable (10.0.0.160) [1179271469] HOST ALERT: devstack01;DOWN;SOFT;9;CRITICAL - Host Unreachable (10.0.0.160) [1179271469] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;ilom-cp3;1179271467 [1179271472] HOST ALERT: devstack01;DOWN;HARD;10;CRITICAL - Host Unreachable (10.0.0.160) [1179271472] HOST NOTIFICATION: lbeavers-pager;devstack01;DOWN;host-notify-by-epager;CRITICAL - Host Unreachable (10.0.0.160) [1179271472] HOST NOTIFICATION: lbeavers;devstack01;DOWN;host-notify-by-email;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271472] HOST NOTIFICATION: gpoly-pager;devstack01;DOWN;host-notify-by-epager;CRITICAL - Host Unreachable (10.0.0.160) [1179271472] HOST NOTIFICATION: gpoly;devstack01;DOWN;host-notify-by-email;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271472] SERVICE ALERT: devstack01;ping;CRITICAL;HARD;1;CRITICAL - Host Unreachable (10.0.0.160) --- As you can see, host notifications are being sent out Today's log: --- [1179337965] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;contactpoint3;var_disk;1179337960 [1179337974] SERVICE ALERT: contactpoint3;var_disk;UNKNOWN;SOFT;1;SNMP problem - No data received from host [1179338034] SERVICE ALERT: contactpoint3;var_disk;UNKNOWN;SOFT;2;SNMP problem - No data received from host [1179338094] SERVICE ALERT: contactpoint3;var_disk;UNKNOWN;HARD;3;SNMP problem - No data received from host [1179338408] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;contactpoint3;sendmail_check;1179338407 [1179338414] SERVICE ALERT: contactpoint3;sendmail_check;CRITICAL;SOFT;1;sendmail Processes CRITICAL - *0* [1179338474] SERVICE ALERT: contactpoint3;sendmail_check;CRITICAL;SOFT;2;sendmail Processes CRITICAL - *0* [1179338484] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;contactpoint3;sendmail_check;1179338481 [1179338494] SERVICE ALERT: contactpoint3;sendmail_check;CRITICAL;HARD;3;sendmail Processes CRITICAL - *0* [1179338604] Warning: The results of service 'ping' on host 'contactpoint4' are stale by 45 seconds (threshold=615 seconds). I'm forcing an immediate check of the service. [1179338604] Warning: The results of service 'sendmail_check' on host 'contactpoint4' are stale by 45 seconds (threshold=615 seconds). I'm forcing an immediate check of the service. [1179338604] Warning: The results of service 'ping' on host 'contactpoint5' are stale by 45 seconds (threshold=61 -- As can be seen, it went thru the three criticals, went to CRIT HARD, but no NOTIFICATIONS were sent, it just continued looking at other services. I've got enable_notifications=1 set in nagios.cfg In services.cfg, I've got: notification_period 24x7 notifications_enabled 1 ; Service notifications are enabled notification_interval 15 ; Default interval - change only if needed in the service config and the web frontend reports ALL notifications enabled. Monitoring Features Flap Detection Notifications Event Handlers Active Checks Passive Checks [image: Flap Detection Enabled]http://nagios.quepasa.com/nagios/cgi-bin/cmd.cgi?cmd_typ=62 All Services Enabled No Services Flapping All Hosts Enabled No Hosts Flapping [image: Notifications Enabled]http://nagios.quepasa.com/nagios/cgi-bin/cmd.cgi?cmd_typ=11 All Services Enabled All Hosts Enabled [image: Event Handlers Enabled] http://nagios.quepasa.com/nagios/cgi-bin/cmd.cgi?cmd_typ=42All Services Enabled All Hosts Enabled [image: Active Checks Enabled]http://nagios.quepasa.com/nagios/cgi-bin/extinfo.cgi?type=0 All Services Enabled All Hosts Enabled [image: Passive Checks Enabled]
Re: [Nagios-users] Hosts and services not sending mail
I added some checks, the check_sendmail being one of them. I've also successfully executed the following from the command line (as nagios user) : /usr/bin/printf %b * Nagios 2.9 *\n\nNotification Type: NOTIFICATIONTYPE\nHost: HOSTNAME\nState: HOSTSTATE\nAddress: HOSTADDRESS\nInfo: HOSTOUTPUT\n\nDate/Time: NOW\n | /bin/mail -s Host DOWN alert for testmachine (address obfuscated) mail is working on the machine. I'm getting absolutely no NOTIFICATION entries in the logs (nagios.log) so the nagios server isn't even trying to send 'em out. That's the part that perplexes me. TIA G.~ On 5/16/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote: Have any configuration changes been made to the system since yesterday that you are aware of? Because on the day it worked your logs show nothing about sendmail, and on the day it stopped working sendmail shows up in the log. That leaves me to believe there is something fishy with the mail piece and not nagios itself. Try emailing yourself thru the terminal with both mail and sendmail and see with works and also in your notify-host-by-email add /usr/sbin/mail and see if works then. Stephen Valdinger MIS Helpdesk Coordinator Dover Chemical Corporation 3676 Davis Rd NW Dover, OH 44622 330-365-3622 [EMAIL PROTECTED] Ever notice how fast Windows runs ? — Neither did I. If at first you don't succeed, work for Microsoft. -- *From:* Gary Every [mailto:[EMAIL PROTECTED] *Sent:* Wednesday, May 16, 2007 2:32 PM *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Hosts and services not sending mail I'm pretty sure I've got everything set up correctly, as yesterday I was getting notifications sent out, and today there are none going out. I've added some services that I knew would go critical, and started watching nagios.log. Here is a snippet from yesterdays log [1179271433] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;devstack01;1179271433 [1179271440] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;devstack02;1179271440 [1179271445] HOST ALERT: devstack01;DOWN;SOFT;1;CRITICAL - Host Unreachable (10.0.0.160) [1179271448] HOST ALERT: devstack01;DOWN;SOFT;2;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271451] HOST ALERT: devstack01;DOWN;SOFT;3;CRITICAL - Host Unreachable (10.0.0.160) [1179271451] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;ilom-cp1;1179271449 [1179271454] HOST ALERT: devstack01;DOWN;SOFT;4;CRITICAL - Host Unreachable (10.0.0.160) [1179271457] HOST ALERT: devstack01;DOWN;SOFT;5;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271460] HOST ALERT: devstack01;DOWN;SOFT;6;CRITICAL - Host Unreachable (10.0.0.160) [1179271460] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;ilom-cp2;1179271459 [1179271463] HOST ALERT: devstack01;DOWN;SOFT;7;CRITICAL - Host Unreachable (10.0.0.160) [1179271466] HOST ALERT: devstack01;DOWN;SOFT;8;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271469] HOST ALERT: devstack01;DOWN;SOFT;9;CRITICAL - Host Unreachable (10.0.0.160) [1179271469] EXTERNAL COMMAND: SCHEDULE_FORCED_HOST_SVC_CHECKS;ilom-cp3;1179271467 [1179271472] HOST ALERT: devstack01;DOWN;HARD;10;CRITICAL - Host Unreachable (10.0.0.160) [1179271472] HOST NOTIFICATION: lbeavers-pager;devstack01;DOWN;host-notify-by-epager;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271472] HOST NOTIFICATION: lbeavers;devstack01;DOWN;host-notify-by-email;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271472] HOST NOTIFICATION: gpoly-pager;devstack01;DOWN;host-notify-by-epager;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271472] HOST NOTIFICATION: gpoly;devstack01;DOWN;host-notify-by-email;CRITICAL - Host Unreachable ( 10.0.0.160) [1179271472] SERVICE ALERT: devstack01;ping;CRITICAL;HARD;1;CRITICAL - Host Unreachable ( 10.0.0.160) --- As you can see, host notifications are being sent out Today's log: --- [1179337965] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;contactpoint3;var_disk;1179337960 [1179337974] SERVICE ALERT: contactpoint3;var_disk;UNKNOWN;SOFT;1;SNMP problem - No data received from host [1179338034] SERVICE ALERT: contactpoint3;var_disk;UNKNOWN;SOFT;2;SNMP problem - No data received from host [1179338094] SERVICE ALERT: contactpoint3;var_disk;UNKNOWN;HARD;3;SNMP problem - No data received from host [1179338408] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;contactpoint3;sendmail_check;1179338407 [1179338414] SERVICE ALERT: contactpoint3;sendmail_check;CRITICAL;SOFT;1;sendmail Processes CRITICAL - *0* [1179338474] SERVICE ALERT: contactpoint3;sendmail_check;CRITICAL;SOFT;2;sendmail Processes CRITICAL - *0* [1179338484] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;contactpoint3;sendmail_check;1179338481 [1179338494] SERVICE ALERT: contactpoint3;sendmail_check;CRITICAL;HARD;3;sendmail Processes CRITICAL - *0* [1179338604] Warning: The results of service 'ping' on host 'contactpoint4' are stale
[Nagios-users] Hosts in PENDING State
Hello, I have several hosts configured with active checks enabled and the check_command set to check-host-alive which is defined in the command file. The state on these hosts always stays in pending unless I manually schedule a check. I noticed that in the sample localhost configuration there is a PING service that simply pings the host. Do I need to associate some sort of a service with each host before they will be checked? Is there something that I have missed in the documentation that will cause host checks to be recorded without associating a service with each host? I did see something similar in a GMane search, but the articles did not seem to be available. Regards, Andy Smith - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts in PENDING State
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Andy Smith Sent: Friday, March 30, 2007 6:15 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Hosts in PENDING State Hello, I have several hosts configured with active checks enabled and the check_command set to check-host-alive which is defined in the command file. The state on these hosts always stays in pending unless I manually schedule a check. I noticed that in the sample localhost configuration there is a PING service that simply pings the host. Do I need to associate some sort of a service with each host before they will be checked? Is there something that I have missed in the documentation that This should be a FAQ and actually I believe it's in there. Nagios is first and foremost a service monitoring application. Hosts are never checked unless a service on that host fails* therefore every host must have at least one service defined for it. http://nagios.sourceforge.net/docs/2_0/checkscheduling.html#host_checks I'd be surprised if you didn't find lots of discussion about this by searching for 'pending' in the archives. -- Marc * There are exceptions when walking up a parent tree to determine network outages. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] hosts show up and then disappear..
hosts show up in host detail then when a I refresh or a few moments later they disappear. nagios.log is not showing anything.. anyone ever run across this? No need to miss a message. Get email on-the-go with Yahoo! Mail for Mobile. Get started. http://mobile.yahoo.com/mail - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hosts show up and then disappear..
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of abid dar Sent: Thursday, March 15, 2007 2:09 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] hosts show up and then disappear.. hosts show up in host detail then when a I refresh or a few moments later they disappear. nagios.log is not showing anything.. anyone ever run across this? If you're using Nagios 1.x, it's a FAQ. It shouldn't happen in 2.x. -- Marc - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts information problem when down
Hello, Nagios checks hosts using ping in my configuration. Today, However it didn't take mind and didn't send any alert although server was hang down. Host answer to ping. On the other hand, I already control the services. I want to examine the hosts if right method by ping control. Is there any suggestion to healthy control of the hosts? thanks. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts information problem when down
Hello, Nagios checks hosts using ping. Today, However it didn't take mind and didn't send alert although server was down. There was a problem related disks. On the other hand, I already control the services. I want to examine the hosts that it is right method or not by ping control. I don't think that right method from now. Is there any suggestion to healthy control of the hosts? thanks. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts status updating very slow
Hi Guys,My nagios server host status is updating very slow my configuration setting are correct and i don't know why my host status updating after 15-20 min.We are testing the servers to check the nagios functionality.I am using nagios 1.3 under debianThanks for your help Blab-away for as little as 1¢/min. Make PC-to-Phone Calls using Yahoo! Messenger with Voice.
[Nagios-users] Hosts file - help
Hello, Can I use an IP address instead of a DNS name for hosts in my /usr/local/nagios/etc/servers/hosts.cfg file? Is this the only place where I can define hosts to be monitored by Nagios? I can ping the host below but cannot monitor it even when I use a fully qualified DNS name for this host. Currently, my host file looks like this: define host { use host-template host_name 192.168.0.2 alias 192.168.0.2 address 192.168.0.2 } define service { use service-http host_name 192.168.0.2 } I have also tried: define host { use host-template host_name dns.fmr.com alias dns.fmr.com address dns.fmr.com } define service { use service-http host_name dns.fmr.com } Do you see any config problems? Thanks, Zaeem. --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid0944bid$1720dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Hosts checks not done when there is no service configured?
Hi there, I'm configuring Nagios and it seems that host-checks (check_command check-host-alive) aren't queued. I need to add a PING service on each host, to keep track of the uptime. Is this normal? I'd like to keep track of ISP routers at customers sites, so we can see if a fallout is caused by our devices or by the ISP. I'm using Nagios 1.3, the latest package from the stable Debian branche. -- Gegroet, Tim --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid0944bid$1720dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
RE: [Nagios-users] Hosts checks not done when there is no service configured?
Yes, that's completely normal. Please read the fine manual at: http://nagios.sourceforge.net/docs/1_0/ Specifically, the Network Reachability portion of the Theory of Operations section: http://nagios.sourceforge.net/docs/1_0/networkreachability.html And the FAQ at: http://www.nagios.org/faqs/viewfaq.php?faq_id=39 Thanks, Garry -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Tim Stoop Sent: Thursday, March 02, 2006 6:15 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Hosts checks not done when there is no service configured? Hi there, I'm configuring Nagios and it seems that host-checks (check_command check-host-alive) aren't queued. I need to add a PING service on each host, to keep track of the uptime. Is this normal? I'd like to keep track of ISP routers at customers sites, so we can see if a fallout is caused by our devices or by the ISP. I'm using Nagios 1.3, the latest package from the stable Debian branche. -- Gegroet, Tim --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=kkid0944bid$1720dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnkkid0944bid$1720dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts PENDING
I can't find the specific doc/FAQ that says so, but I know its been discussed here before: Every host needs a service. A host will not be checked if it doesn't have one. Even tho your host-check is probably a check_ping or check_icmp, add a _service_ to that host that does yet another check_ping or check_icmp. I doubt the extra bandwidth usage will kill your network. ;) I was bitten by this in my first installation too. -f On Thu, 2 Feb 2006, Patrick M. Rutkowski wrote: Date: Thu, 2 Feb 2006 20:12:52 -0500 From: Patrick M. Rutkowski [EMAIL PROTECTED] To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Hosts PENDING I have several hosts with no associated services, and their status bars are stuck on PENDING in the Host Detail cgi. check_command on them is set to check-host-alive, and checks_enabled is set to 1. What am I doing wrong, why aren't the hosts being checked? --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Hosts PENDING
Hi, Actually you have to add the equipment to the service.cfg file. In fact you have to add a service for your equipment, if not, nagios will mark the host with pending status because is waiting for some service to monitor. For example you have a router defined in hosts.cfg: # 'router1' host definition define host{ use generic-host ; Name of host template to use host_name router1 alias Router #1 address 192.168.1.254 check_command check-host-alive max_check_attempts 20 notification_interval 60 notification_period 24x7 notification_options d,u,r } Then you have to add the following into services.cfg to monitor the host: # Service definition define service{ use generic-service ; Name of service template to use host_name router1 service_description PING is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups router-admins notification_interval 240 notification_period 24x7 notification_options c,r check_command check_ping!100.0,20%!500.0,60% } The last configuration is the most simple monitor. Hope it help. Best Regards. On Thu, 2006-02-02 at 20:12 -0500, Patrick M. Rutkowski wrote: I have several hosts with no associated services, and their status bars are stuck on PENDING in the Host Detail cgi. check_command on them is set to check-host-alive, and checks_enabled is set to 1. What am I doing wrong, why aren't the hosts being checked? --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid3432bid#0486dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Diego. signature.asc Description: This is a digitally signed message part
Re: [Nagios-users] Hosts PENDING
On Thu, 2 Feb 2006, frank wrote: I can't find the specific doc/FAQ that says so, but I know its been discussed here before: Every host needs a service. A host will not be checked if it doesn't have one. Even tho your host-check is probably a check_ping or check_icmp, add a _service_ to that host that does yet another check_ping or check_icmp. I doubt the extra bandwidth usage will kill your network. ;) With Nagios v2 this does not seems to be true. Or I must be fortunate to have multiple hosts with only a host check that have an accurate state indicator. And some of the use other checks like check_http or check_smtp Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnkkid=103432bid=230486dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null