[Nagios-users] String errors
Ok, so I'm monitoring about 100 websites with string checks via check_http. We are mirroring what our datacenter actually checks, so we have notifications turned off so when a site goes down we aren't being spammed by the datacenter and our nagios installation. The issue is that every once in awhile a string changes on the site so it goes critical in our nagios. We perhaps won't notice it for a day which messes up our availability reports. Is there a way to retroactively mark the time that it was critical as scheduled downtime? - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Notifications
I have a quick question. I have a host template that disables notifications and I have about 53 hosts using it. If the notifications are disabled, why then does it still require that I have values for notification_interval, _period, and _options? - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Notifications
I have a nagios 2.9 install. I have one host with multiple services being monitored. On the 16th the host didn't respond to a ping (the server rebooted), and recovered within 3 minutes. I received an e-mail for both the failure and recovery. I am also monitoring some windows services on the same box using NsClient++. It shows on the same day that after it recovered the TrendMicro virus process was down for 6 hours. I didn't receive an e-mail during this entire time. It is set at 5 max attempts, 3 normal check, and 1 retry with a notification interval of 15 minutes. It should have at least notified once but it didn't. Any ideas? - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] CheckNT
I am monitoring the CPU load with NSClient on a host. Here is the service definition define service{ use windows-service namecheck-cpu notification_optionsw,u,c,r check_command check_nt!CPULOAD!-l 5, 80, 90 register0 } Here is the actual call to that service define service{ use check-cpu service_description CPU contact_groups mis hostgroup_name windows-clients } Here is what I see in the nagios interface CPU OK 11-19-2007 14:56:07 122d 2h 7m 2s 1/5 not enough values for -l parameters Any ideas? - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications
define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } define service{ use generic-service namewindows-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 3 retry_check_interval1 notification_interval 15 notification_period 24x7 register0 } define service{ use windows-service namecheck-trend notification_optionsw,u,c,r check_command check_nt!SERVICESTATE!-d SHOWALL -l ofcservice register0 } define service{ use check-trend service_description TrendMicro contact_groups mis # hostgroup_namewindows-clients host_name lg03 } On Nov 19, 2007 3:11 PM, Aaron Devey [EMAIL PROTECTED] wrote: What are your notification options set to? In 2.9 the default is none so if you didn't specify them for that service, it won't alert. If that's not the answer, perhaps you can paste your the definitions for your service, contact, and notification command? -Aaron Jerad Riggin wrote: I have a nagios 2.9 install. I have one host with multiple services being monitored. On the 16th the host didn't respond to a ping (the server rebooted), and recovered within 3 minutes. I received an e-mail for both the failure and recovery. I am also monitoring some windows services on the same box using NsClient++. It shows on the same day that after it recovered the TrendMicro virus process was down for 6 hours. I didn't receive an e-mail during this entire time. It is set at 5 max attempts, 3 normal check, and 1 retry with a notification interval of 15 minutes. It should have at least notified once but it didn't. Any ideas? - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications
I found out via nagios. It said the duration was 6 hours for the outage, and I couldn't match it up with any alerts that were sent to my mailbox. Thanks for the tips On Nov 19, 2007 3:48 PM, Aaron Devey [EMAIL PROTECTED] wrote: I don't see any obvious problems with your service definitions. Did you find out TrendMicro was down for 6 hours by reviewing the nagios logs? If so, that means nagios at least saw the service had a problem. If you found out it was down by some other means, perhaps you can check the nagios logs to make sure nagios saw a critical or warning problem with the service. Also, If you have log_notifications turned on, try examining the logs of the timeperiod it was down. If you don't see any attempts to send a notification for TrendMicro on lg03, then it's likely a configuration problem somewhere. Finding it is the hard part. :) The first places I would check are the service_notification_period, service_notification_options, and service_notification_commands for the contacts in the 'mis' group. Follow the service_notification_commands to make sure the command it points to is set up correctly as well. If there are no problems there, I'd make sure there are no service escalations for that service. If that doesn't help, I have no idea what the problem could be. :) Good luck, -Aaron Jerad Riggin wrote: define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } define service{ use generic-service namewindows-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 3 retry_check_interval1 notification_interval 15 notification_period 24x7 register0 } define service{ use windows-service namecheck-trend notification_optionsw,u,c,r check_command check_nt!SERVICESTATE!-d SHOWALL -l ofcservice register0 } define service{ use check-trend service_description TrendMicro contact_groups mis # hostgroup_namewindows-clients host_name lg03 } - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Host monitoring
I understand this part, but for example right now the service description for each host is CheckString. That way I can group all of them into a service group based on that description so we can pull availability reports for the host and the only service being monitored. If I wanted to do it your way, wouldn't I have to use the service description of each check in the servicegroups config file? On Nov 12, 2007 5:13 PM, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Jerad Riggin Sent: Monday, November 12, 2007 4:52 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Host monitoring So what would be the best way to monitor in this situation? I have Host A, hosting two websites. Currently I have both websites configured as separate hosts, so in nagios it shows up as ServerA - site.com, and another host ServerA - siteb.com This doesn't seem efficient since I am pinging both sites, but it's the same host. Ideally I would want to just ping the host, and check strings on each site under that host. Any recommendations? Unless I'm missing something obvious, this seems pretty straightforward. host{} definition for the web server machine, service{} definitions for each vhost using check_http to check them. You can craft a command{} definition to pass the variable parameters like vhost name, uri and search string like so -- define command { command_name check_http_url command_line $USER1$/check_http -I $HOSTADDRESS$ -H $ARG1$ -u $ARG2$ -wt 20 -ct 30 -to 35 -s $ARG3$ } To check to vhosts -- define service { usegeneric-service host_name my_webserver service_descriptionHTTP VHOST site1 check_command check_http_url!www.site1.com!/mypage!/html } define service { usegeneric-service host_name my_webserver service_descriptionHTTP VHOST site2 check_command check_http_url!www.site2.com!/someotherpage!/html } -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitoring a process
Would it be possible using Nsclient++ to monitor for a process name? We need to make sure Outlook.exe is running on the server and if it isn't, send out notifications. Thanks, Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring a process
Thanks for the response. This is what I did just a bit ago. I have it successfully monitoring outlook.exe, however no e-mails are being sent when it's critical. Here is what I have as the service definition: define service{ use leadlander host_name leadlandervm service_description Outlook Process contact_groups bo,mis notification_optionsw,u,c,r check_command check_nt2!PROCSTATE!-d SHOWALL -l outlook.exe } Here is what I have for the leadlander template define service{ use generic-service nameleadlander is_volatile 0 check_period24x7 max_check_attempts 15 normal_check_interval 20 retry_check_interval20 notification_interval 20 notification_period 24x7 register0 } Does it only notify after the first retry failure, or should it notify as soon as the service is critical? Any ideas? - Show quoted text - On Nov 13, 2007 2:39 PM, Rich Sasko [EMAIL PROTECTED] wrote: Jerad Riggin jriggin at gmail.com writes: Would it be possible using Nsclient++ to monitor for a process name? We need to make sure Outlook.exe is running on the server and if it isn't, send out notifications. Thanks, Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Jerad, We have the NSClient++ running on our Windows servers and the following service check is an example of how we are monitoring services from the nagios server: define service{ use generic-service host_name email server service_description explorer check_command check_nt!PROCSTATE!-d SHOWALL -l explorer.exe } It is actually in one of the sample config files, you should just have to tell it what process you want to watch. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring a process
Just to clarify normal_check_interval 60 retry_check_interval360 notification_interval 360 This means that it checks every hour. After it fails the first time, it will wait 6 hours to retry, if it is still not OK after 6 hours, it notifies and begins to notify every 6 hours until it's ok. On Nov 13, 2007 3:00 PM, Rich Sasko [EMAIL PROTECTED] wrote: It only notifies after it enters a hard state which is usually after the third try. Richard Sasko Niagara Lasalle Corp Phone: (219) 853-6272 Mobile: (219) 484-5617 E-mail: [EMAIL PROTECTED] -Original Message- From: Jerad Riggin [mailto:[EMAIL PROTECTED] Sent: Tuesday, November 13, 2007 2:54 PM To: Rich Sasko Subject: Re: [Nagios-users] Monitoring a process Thanks for the response. This is what I did just a bit ago. I have it successfully monitoring outlook.exe, however no e-mails are being sent when it's critical. Here is what I have as the service definition: define service{ use leadlander host_name leadlandervm service_description Outlook Process contact_groups bo,mis notification_optionsw,u,c,r check_command check_nt2!PROCSTATE!-d SHOWALL -l outlook.exe } Here is what I have for the leadlander template define service{ use generic-service nameleadlander is_volatile 0 check_period24x7 max_check_attempts 15 normal_check_interval 20 retry_check_interval20 notification_interval 20 notification_period 24x7 register0 } Does it only notify after the first retry failure, or should it notify as soon as the service is critical? Any ideas? On Nov 13, 2007 2:39 PM, Rich Sasko [EMAIL PROTECTED] wrote: Jerad Riggin jriggin at gmail.com writes: Would it be possible using Nsclient++ to monitor for a process name? We need to make sure Outlook.exe is running on the server and if it isn't, send out notifications. Thanks, Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Jerad, We have the NSClient++ running on our Windows servers and the following service check is an example of how we are monitoring services from the nagios server: define service{ use generic-service host_name email server service_description explorer check_command check_nt!PROCSTATE!-d SHOWALL -l explorer.exe } It is actually in one of the sample config files, you should just have to tell it what process you want to watch. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Checkhttp string
How do I go about checking a specific page in a site? I am able to successfully check a string on the default document for a site but if I specify for example index2.aspx in the host part of checkhttp, it throws an error. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios time format
Is there a way to change the time format that nagios uses? CST would make the logs easier to read. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Service scheduling
I have a checkstring service that kicks off every 10 minutes. After failure it will retry every minute. We're trying to track availability on some of our servers so this is its purpose. However I noticed this morning that a host went down, I received a PING 100% packet loss alert, and shortly after within the same minute an alert that it couldn't find the HTTP string. However, by 7:50 PING was ok, and HTTP didn't return ok until 8:00 AM. If it's retrying every minute (or should be), why is there such a large gap? - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service scheduling
I currently have the normal check at 10 minutes, max attempts at 1, and retry intervals at 1. On Nov 9, 2007 9:36 AM, Giles Coochey [EMAIL PROTECTED] wrote: I have a checkstring service that kicks off every 10 minutes. After failure it will retry every minute. We're trying to track availability on some of our servers so this is its purpose. However I noticed this morning that a host went down, I received a PING 100% packet loss alert, and shortly after within the same minute an alert that it couldn't find the HTTP string. However, by 7:50 PING was ok, and HTTP didn't return ok until 8:00 AM. If it's retrying every minute (or should be), why is there such a large gap? Normal Behaviour: Scheduling During Problems So what happens when there are problems with a service? Well, one of the things that happens is the service check scheduling changes. If you've configured the max_attempts option of the service definition to be something greater than 1, Nagios will recheck the service before deciding that a real problem exists. While the service is being rechecked (up to max_attempts times) it is considered to be in a soft state (as described here) and the service checks are rescheduled at a frequency determined by the retry_interval option. If Nagios rechecks the service max_attempts times and it is still in a non-OK state, Nagios will put the service into a hard state, send out notifications to contacts (if applicable), and start rescheduling future checks of the service at a frequency determined by the check_interval option. http://nagios.sourceforge.net/docs/2_0/checkscheduling.html - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service scheduling
Thanks for the clarification. On Nov 9, 2007 9:50 AM, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Jerad Riggin Sent: Friday, November 09, 2007 9:44 AM To: Giles Coochey Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Service scheduling I currently have the normal check at 10 minutes, max attempts at 1, and retry intervals at 1. This means that nagios will check the service every 10 minutes until there is a problem, then it will perform 1 check 1 minute later and send a notification if it's still in a non-OK state. All future checks return to the 10 minute interval, even if it's still in a non-OK state. retry_check_interval is only used up to max_check_attempts. -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Availability reports
I have two sites being monitored with checkhttp a string. When I do an availability report for today it shows as 100% availability, however if I go further in and click on the host, it shows 95.4% Time OK with 4.5% Time Critical. Why is this downtime not showing in the availability report? - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Availability reports
I think the issue was that the checkstring is a service, but the host wasn't down (it could still ping it). So it considered the host up but the service down. I created a service group so we could check availability based on the checkstring service. On Nov 5, 2007 10:56 AM, Sebastian Ries [EMAIL PROTECTED] wrote: Hi I have two sites being monitored with checkhttp a string. When I do an availability report for today it shows as 100% availability, however if I go further in and click on the host, it shows 95.4% Time OK with 4.5% Time Critical. Why is this downtime not showing in the availability report? Is the availability report based on the service checks? Nagios does not check services as long as it knows the host is down. So it could be that EVERY service check returned OK (100%) because as the host was down the server hasn't been checked... Regards Sebastian Ries -- DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: [EMAIL PROTECTED] - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service checks
In that case, is there a way to specify more than one possible string, so that for example if it can't find Home, but it can find Checkout, it considers the host up? On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote: yes you have to use a separate one for each site, how else would you be able to use a separate check? Unless you expect the word Home on each site and that is your string check. Another option is using Macros but I suspect this may not do what you want... -h Hari Sekhon Jerad Riggin wrote: I've read the help docs, like I said I have a working installation checking about 15 servers. I have a PING service that pings a host group. I can't do that with this because I'm checking a different string on each site, so i'm guessing it has to be separated out. Does this make sense? On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: You need to reread the docs, this is the most basic of questions. http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service Hint: You are missing a service_description in the last block for one. You should not have name in that last block either, you need host_name. Read docs pls. -h Hari Sekhon Jerad Riggin wrote: I have a functioning nagios setup but I have a quick question. I am going through and adding website string checks so we can keep track of availability on one of our webservers. So in services.cfg I have define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 3 retry_check_interval1 notification_interval 15 notification_period 24x7 register0 } I then have as just one example: define service{ use basic-service namecheck-site4 notification_optionsw,u,c,r check_command check_http!site.com!20!Home register0 } My question is, you notice that I have the name as check-site4, and then later on in the services.cfg I call up that checksite-4. Is this the correct way? Do I need to define a service for each host and then later on call it by name to execute the service check? Is this a bad way of going about it? Thanks, Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com
Re: [Nagios-users] Service checks
I've read the help docs, like I said I have a working installation checking about 15 servers. I have a PING service that pings a host group. I can't do that with this because I'm checking a different string on each site, so i'm guessing it has to be separated out. Does this make sense? On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote: You need to reread the docs, this is the most basic of questions. http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service Hint: You are missing a service_description in the last block for one. You should not have name in that last block either, you need host_name. Read docs pls. -h Hari Sekhon Jerad Riggin wrote: I have a functioning nagios setup but I have a quick question. I am going through and adding website string checks so we can keep track of availability on one of our webservers. So in services.cfg I have define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 3 retry_check_interval1 notification_interval 15 notification_period 24x7 register0 } I then have as just one example: define service{ use basic-service namecheck-site4 notification_optionsw,u,c,r check_command check_http!site.com!20!Home register0 } My question is, you notice that I have the name as check-site4, and then later on in the services.cfg I call up that checksite-4. Is this the correct way? Do I need to define a service for each host and then later on call it by name to execute the service check? Is this a bad way of going about it? Thanks, Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Service checks
I have a functioning nagios setup but I have a quick question. I am going through and adding website string checks so we can keep track of availability on one of our webservers. So in services.cfg I have define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 3 retry_check_interval1 notification_interval 15 notification_period 24x7 register0 } I then have as just one example: define service{ use basic-service namecheck-site4 notification_optionsw,u,c,r check_command check_http!site.com!20!Home register0 } My question is, you notice that I have the name as check-site4, and then later on in the services.cfg I call up that checksite-4. Is this the correct way? Do I need to define a service for each host and then later on call it by name to execute the service check? Is this a bad way of going about it? Thanks, Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Service checks
I think we're losing something in translation of me trying to tell you what i'm doing. Here is what i've been doing. define service{ use basic-service namecheck-site4 notification_optionsw,u,c,r check_command check_http!site.com!20!Home register0 } and then later on in the config file define service{ usecheck-site4 service_descriptionHTTP contact_groupsmis host_namesite.com } On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote: If you had read the docs like I said, I can't see how you could miss the fact that you have a service definition without a host_name or hostgroup_name! Where is this service check going to run against if you haven't told it which host you want to test? I even gave you the anchored link to the exact place where it shows you the definitions that are needed for that block... -h Hari Sekhon Jerad Riggin wrote: Ok, so for example define service{ use basic-service notification_options w,u,c,r check_command check_http!site.com!20!Home Service description CheckString } Sorry if this seems like a newbie question. I'm just trying to backtrack and optimize the config before I get too far down the road. On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: also, you are making it more trouble than it needs to be, instead of trying to do register 0 and making the last block a template block, just put the host name in there and the service description as I hinted and that is all you need, you will then have the first 2 blocks plus one for each service on each host with different string checks... -h Hari Sekhon Jerad Riggin wrote: I've read the help docs, like I said I have a working installation checking about 15 servers. I have a PING service that pings a host group. I can't do that with this because I'm checking a different string on each site, so i'm guessing it has to be separated out. Does this make sense? On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] mailto:[EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: You need to reread the docs, this is the most basic of questions. http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service Hint: You are missing a service_description in the last block for one. You should not have name in that last block either, you need host_name. Read docs pls. -h Hari Sekhon Jerad Riggin wrote: I have a functioning nagios setup but I have a quick question. I am going through and adding website string checks so we can keep track of availability on one of our webservers. So in services.cfg I have define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0
Re: [Nagios-users] Service checks
I wish I could, but I'm not a programmer and have limited experience with Linux. I usually just manage Windows servers. Thanks for your help though. I condensed the string check into one instead of double/triple templating. Thanks. On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote: I've just tried this, using the standard check_http plugin you are using. It appears not. As soon as one string is not found it goes critical. You could always write a custom plugin to test your websites, or much more easily a shell wrapper plugin to call the first, if it fails, call the second, and only if that fails go critical... not perfect though, a better/custom plugin would serve you better. -h Hari Sekhon Jerad Riggin wrote: In that case, is there a way to specify more than one possible string, so that for example if it can't find Home, but it can find Checkout, it considers the host up? On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: yes you have to use a separate one for each site, how else would you be able to use a separate check? Unless you expect the word Home on each site and that is your string check. Another option is using Macros but I suspect this may not do what you want... -h Hari Sekhon Jerad Riggin wrote: I've read the help docs, like I said I have a working installation checking about 15 servers. I have a PING service that pings a host group. I can't do that with this because I'm checking a different string on each site, so i'm guessing it has to be separated out. Does this make sense? On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] mailto:[EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: You need to reread the docs, this is the most basic of questions. http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service Hint: You are missing a service_description in the last block for one. You should not have name in that last block either, you need host_name. Read docs pls. -h Hari Sekhon Jerad Riggin wrote: I have a functioning nagios setup but I have a quick question. I am going through and adding website string checks so we can keep track of availability on one of our webservers. So in services.cfg I have define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 3 retry_check_interval1 notification_interval 15 notification_period 24x7 register
Re: [Nagios-users] Downtime scheduling
Thanks for the reply. Here is what I did.. define timeperiod{ timeperiod_name maintenance alias Maintenance period sunday 04:00-01:00 monday 04:00-01:00 tuesday 04:00-01:00 wednesday 04:00-01:00 thursday04:00-01:00 friday 04:00-01:00 saturday04:00-01:00 } And on the host check template I put maintenance in as the notification/check schedule. Will this work? On 11/2/07, Larry Low [EMAIL PROTECTED] wrote: Timeperiods is the best idea. You can either use the notification_period or check_period depending on your end needs. Larry Low 4150 N Drinkwater Blvd., 5th Floor Scottsdale, AZ 85251 Office: 480.385.7045 E-mail: [EMAIL PROTECTED]' Telesphere Networks, Inc *From:* [EMAIL PROTECTED] [mailto: [EMAIL PROTECTED] *On Behalf Of *Jerad Riggin *Sent:* Friday, November 02, 2007 11:59 AM *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Downtime scheduling What is the best way to allow for a 1-4AM outage on a few hosts? A cron job that will schedule it every night, or through time periods? - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios e-mail reports
Last night I got a few warning/critical errors with a few recovery e-mails. However towards this morning I received 2 critical alerts and even though they recovered I didn't receive a recovery e-mail. Any ideas? Jerad - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios e-mail reports
Now that I look at it, it looks like the down e-mails that didn't have corresponding recovery e-mails were delayed for some reason (by about 15 minutes). So perhaps I'm just reading this out of order. I have on clue why that would happen. On 7/24/07, Jeremy C. Reed [EMAIL PROTECTED] wrote: On Tue, 24 Jul 2007, Jerad Riggin wrote: Last night I got a few warning/critical errors with a few recovery e-mails. However towards this morning I received 2 critical alerts and even though they recovered I didn't receive a recovery e-mail. Any ideas? What do your nagios logs say about it? Jeremy C. Reed - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Recurring Downtime
What about this one? http://www.nagiosexchange.org/Utilities.16.0.html?tx_netnagext_pi1[p_view]=893 On 6/28/07, Moore, Nicholas [EMAIL PROTECTED] wrote: New Nagios user here (2.9). I have a server that restarts every day. I want to schedule downtime for a server every day at the same time. I know that I can't do it with the basic nagios package. I found an add-on on nagios exchange. ( http://www.nagiosexchange.org/Downtimes.38.0.html?tx_netnagext_pi1%5Bp_view%5D=363) The new version of the add-on is supposed to work with 2.x. Does anyone here use it and have any suggestions for me? Is there another way to accomplish what I want to do? Thanks, Nick - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Notifications
Does Nagios only send out notifications once it has reached the max retry on services? If so, is the only way to make it automatically notify you as soon as there is a problem to set the max retry to 1? Thanks Jerad - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] PNP
I'm trying to get the PNP stuff setup so I can create some graphs for the disk usage of my servers. I have it installed and configured correctly as far as I can tell. However, no performance data is being generated in the perfdata folder. What are some things I can look for? Thanks - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Windows monitoring
Sounds good... What is the default command definition for check_nt? On 6/26/07, Frost, Mark {PBG} [EMAIL PROTECTED] wrote: That's what I've done. My understanding in Nagios is that a config file is a config file. I've never tried it, but I believe you could stuff your entire config (hosts, services, checkcommands, notification commands) into a single file. You'd be nuts given how incredibly complex Nagios configs can get, but you could. By the same token, you could break them out into as many individual files as you like as well as long as your nagios.cfg lists all the files it's supposed to read. I did as you indicated -- keep my hosts (windows or otherwise) in my hosts.cfg file. I tend to sort my config files by application rather than OS type so I put the service entries in files/directories with still other names. Mark -- *From:* Jerad Riggin [mailto:[EMAIL PROTECTED] *Sent:* Tuesday, June 26, 2007 9:53 AM *To:* Frost, Mark {PBG} *Cc:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Windows monitoring Thanks for the info. So for example, it wants me to define new hosts in the windows.cfg. Is there a point to that? Can I just put service definitions in the windows.cfg and utilize the hosts I already have defined in hosts.cfg ? On 6/26/07, Frost, Mark {PBG} [EMAIL PROTECTED] wrote: Jerad, I took this to mean that it's a sort of generic/example config file. You can always break out your configuration into additional files/directories as makes sense for your installation as long as you add new lines in the nagios.cfg file to tell it to read each new file and/or directory you create. I found that a few things in that 3.0 doc for monitoring with NSClient++ that weren't correct so you will find yourself making some modifications to get things to work if you've completely copied entries from that doc page. Mark -- *From:* [EMAIL PROTECTED] [mailto: [EMAIL PROTECTED] *On Behalf Of *Jerad Riggin *Sent:* Tuesday, June 26, 2007 9:16 AM *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Windows monitoring I'm going through this article: http://nagios.sourceforge.net/docs/3_0/monitoring-windows.html It is referring to a windows.cfg within nagios.cfg. I don't see any such commented out line in my nagios.cfg, and I can't even find a windows.cfg at all. Is the difference that I am running 2.9 and this is referring to 3.0? - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitoring Windows Services
I feel like i'm spamming this mail list. So far I'm monitoring a few processes services on a Windows 2003 Server. Would there be a way to have Nagios report if any automatic service fails? Or do I need to go through and define each service? Let me know if you need more information. - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios HTTP monitoring
Any idea why this -a switch isn't working? If I do this in a command prompt it works fine: ./check_http -S -H url.com-a username:password it returns an OK. The same command in nagios won't work (unauthorized). On 6/22/07, Jerad Riggin [EMAIL PROTECTED] wrote: It's still giving me an unauthorized message. I've verified the credentials work for me to get in through a browser. Is there something else I could be missing? On 6/22/07, GJP [EMAIL PROTECTED] wrote: Jerad, No entirely, because you feed the arguments to the actual command with $ARGn$ So in de service definition: check_http!site.com!5!admin:password And the command_line format in command definition (probably in: /etc/nagios-plugins/config/http.cfg) should be: command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -t $ARG2$ -a $ARG3$ Be carefull with that -t 5 btw. That is a very low value to timeout. Hope it helps. Regards, Gerard. On Friday 22 June 2007 19:17:29 Jerad Riggin wrote: So check_http!-H site.com -t 5 -a admin:password should be check_http!-H site.com!-t 5!-a admin:password? On 6/22/07, GJP [EMAIL PROTECTED] wrote: Jerad, I didn't understand that either. That's why I customized the the check_http command definition. That way (in my previous example) you can have your http checks from any Nagios client. Usefull for instance for off-site checks. Regarding the -a switch. The -a $ARG1$ (or whatever) should be in your command definition. And your parms as you 'feed' them to the command in your service definition seperated with !. Regards, Gerard. On Friday 22 June 2007 16:59:30 Jerad Riggin wrote: If I change the hosts.cfg from the IP of the server (10.10.10.7) to the URL I'm specifying in services.cfg, it works fine. The default site for 10.10.10.7 is a CRM site which requires basic auth. So even though i'm specifying a URL in services.cfg, it looks like it's be overridden by the hosts.cfg so it can't auth. What is the point of the -H switch in services.cfg if it always uses the IP for the host in hosts.cfg ? I tried that format (-a admin:password) at the end of the check in services.cfgbut it still failed with unauthorized. On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: [EMAIL PROTECTED] [mailto: nagios-users- [EMAIL PROTECTED] On Behalf Of Jerad Riggin Sent: Friday, June 22, 2007 8:41 AM To: GJP Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios HTTP monitoring I have a check_http command that works perfectly at the command prompt ./check_http -H lagarde -t 5 -s LaGarde Intranet site However in nagios, check_http!-H lagarde -t 5 -s LaGarde Intranet site still returns an unauthorized. So basically why would it work in a command prompt but not in nagios? Perhaps some difference in environment? Did you test as the nagios user? Is that the _exact_ way it's defined in the config? And what is the syntax for the -a (basic auth) switch? Thanks again for everyones help. -a username:password. -- Marc - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to ::: /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios HTTP monitoring
Where do I use the --help switch? I've tried in the terminal but I get a command not found error. Thanks for the links btw. On 6/22/07, GJP [EMAIL PROTECTED] wrote: Yes ofcourse there is. This one is for Samba: www.samba.org And like Patrick said if you do a check_http -h it tells you what ik can do. And if you're on a common linux distro. you can find files with: locate [filename] Then I would also check: http://nagios.sourceforge.net/docs/2_0/beginners.html http://nagios.sourceforge.net/docs/3_0/monitoring-publicservices.html And I would take your time for all this. My linux experience is over 8 years and it took me three weeks to have a full blown secure sms sending Nagios setup. Goodluck! G On Thursday 21 June 2007 23:24:43 Jerad Riggin wrote: Is there any link to information about that? Like I said this is basically the 2nd day I've used Linux and Nagios so much of this seems like a different language to me. Thanks for the info. On 6/21/07, GJP [EMAIL PROTECTED] wrote: Hi Jerad, If you mean http auth, this is what I use. The check_command in the service definition: check_command check_http_auth!host.example.com!/path!user:password And the command definition. define command{ command_namecheck_http_auth command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -u $ARG2$ -a $ARG3$ } But if you mean authentication within a windows AD domain. Maybe Samba can help. Because then you can make your linux server a member of the AD domain. Regards, Gerard. On Thursday 21 June 2007 22:48:08 Jerad Riggin wrote: I'm new to Linux and new to Nagios so this may sound very stupid. Basically I have everything up and running, ping tests work, http tests work, and the smtp test I have setup works. If the host goes down I get a host down alert and a host up alert once the problem is resolved. Is there a way to have Nagios send me an alert for critical status? I'm pretty sure I have that setup but it isn't working. After this I went ahead and set up basic HTTP monitoring for all of our servers, however, 403 errors are not sending any notifications to me. Also, I'm trying to monitor the CRM site on our windows domain however since the Linux box is obviously not on the domain, it can't authenticate when hitting the site so it can't find the string i've specified. Any ideas on how to make nagios authenticate and then check the string? Thanks - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Notifications
Sorry if this question is confusing. I'm monitoring the HTTP service on a server. I have it set to notify on w,u,c,r in services.cfg. I turned of HTTP to trigger a warning state and I successfully received an e-mail. I turned HTTP back on and successfully received a recovery warning. I then triggered another warning and never received an e-mail and a notification wasn't listed in nagios' web interface. Any ideas? - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios HTTP monitoring
If I change the hosts.cfg from the IP of the server (10.10.10.7) to the URL I'm specifying in services.cfg, it works fine. The default site for 10.10.10.7 is a CRM site which requires basic auth. So even though i'm specifying a URL in services.cfg, it looks like it's be overridden by the hosts.cfg so it can't auth. What is the point of the -H switch in services.cfg if it always uses the IP for the host in hosts.cfg? I tried that format (-a admin:password) at the end of the check in services.cfg but it still failed with unauthorized. On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Jerad Riggin Sent: Friday, June 22, 2007 8:41 AM To: GJP Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios HTTP monitoring I have a check_http command that works perfectly at the command prompt ./check_http -H lagarde -t 5 -s LaGarde Intranet site However in nagios, check_http!-H lagarde -t 5 -s LaGarde Intranet site still returns an unauthorized. So basically why would it work in a command prompt but not in nagios? Perhaps some difference in environment? Did you test as the nagios user? Is that the _exact_ way it's defined in the config? And what is the syntax for the -a (basic auth) switch? Thanks again for everyones help. -a username:password. -- Marc - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications
Contacts.cfg define contact{ contact_namemis alias mis host_notification_period24x7 service_notification_period24x7 host_notification_options d,r,u,f service_notification_optionsc,r,w,f host_notification_commands host-notify-by-email service_notification_commandsnotify-by-email email [EMAIL PROTECTED] (edited) } Services.cfg # Generic service definition template define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } # Generic for all services define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 5 retry_check_interval1 notification_interval 5 notification_period 24x7 register0 } define service{ use basic-service nameping-service notification_optionsn check_command check_ping!1000.0,20%!2000.0,60% register0 } define service{ use basic-service namecheck-http notification_optionsn check_command check_http register0 } define service{ use basic-service namecheck-site notification_optionsw,u,c,r check_command check_http!-H lagarde.lagardeinc.com-t 5 -s LaGarde register0 } define service{ use basic-service namecheck-smtp notification_optionsw,u,c,r check_command check_smtp!-t 5 -e mail.domain.com register0 } define service{ use ping-service service_description PING contact_groups mishelpdesk hostgroup_name basic-clients # host_name one_client } define service{ usecheck-http service_descriptionHTTP contact_groupsmishelpdesk hostgroup_namebasic-clients } define service{ usecheck-site service_descriptionSite contact_groupsmishelpdesk host_namelg02 } define service{ usecheck-smtp service_descriptionSMTP contact_groupsmishelpdesk host_namelg01 } On 6/22/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote: Can you post your definition and your template that it uses? It may have something to do with your notification options. Check your mail log and nagios log to see if anything happened behind the scenes and post any funny or suspicious output. Thanks. Stephen Valdinger MIS Helpdesk Coordinator Dover Chemical Corporation 3676 Davis Rd NW Dover, OH 44622 330-365-3622 [EMAIL PROTECTED] -- *From:* Jerad Riggin [mailto:[EMAIL PROTECTED] *Sent:* Friday, June 22, 2007 10:56 AM *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] Notifications Sorry if this question is confusing. I'm monitoring the HTTP service
Re: [Nagios-users] Notifications
That's true on check-http, but on check-site for one of my servers it has w,u,c,r and it worked once, but all alerts since then go unnotified. Also, how do you check a specific URL for http? It looks like nagios just uses the address in hosts.cfg and ignores the -H site.com switch you add in services.cfg. On 6/22/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote: You are supplying a notification option of n inside the service definition for http. Should it not be set for w,c,r etc? try doing that and triggering something to happen again. Stephen Valdinger MIS Helpdesk Coordinator Dover Chemical Corporation 3676 Davis Rd NW Dover, OH 44622 330-365-3622 [EMAIL PROTECTED] -- *From:* Jerad Riggin [mailto:[EMAIL PROTECTED] *Sent:* Friday, June 22, 2007 11:54 AM *To:* Valdinger, Stephen (DOV, MSX) *Cc:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Notifications *Contacts.cfg* define contact{ contact_namemis alias mis host_notification_period24x7 service_notification_period24x7 host_notification_options d,r,u,f service_notification_optionsc,r,w,f host_notification_commands host-notify-by-email service_notification_commandsnotify-by-email email [EMAIL PROTECTED] (edited) } *Services.cfg* # Generic service definition template define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } # Generic for all services define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5 normal_check_interval 5 retry_check_interval1 notification_interval 5 notification_period 24x7 register0 } define service{ use basic-service nameping-service notification_optionsn check_command check_ping!1000.0,20%!2000.0,60% register0 } define service{ use basic-service namecheck-http notification_optionsn check_command check_http register0 } define service{ use basic-service namecheck-site notification_optionsw,u,c,r check_command check_http!-H lagarde.lagardeinc.com -t 5 -s LaGarde register0 } define service{ use basic-service namecheck-smtp notification_optionsw,u,c,r check_command check_smtp!-t 5 -e mail.domain.com register0 } define service{ use ping-service service_description PING contact_groups mishelpdesk hostgroup_name basic-clients # host_name one_client } define service{ usecheck-http service_descriptionHTTP contact_groupsmishelpdesk hostgroup_namebasic-clients } define service{ use
Re: [Nagios-users] Nagios HTTP monitoring {Disarmed} {Fraud?}
commands.cfg # 'check_http' command definition define command{ command_namecheck_http command_line$USER1$/check_http -H $HOSTADDRESS$ $ARG1$ -a $ARG2$ } Here is an entry in services.cfg define service{ use basic-service namecheck-site notification_optionsw,u,c,r check_command check_http!-H site.com -t 5 -a admin:password -s String register0 } On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: Jerad Riggin [mailto:[EMAIL PROTECTED] Sent: Friday, June 22, 2007 10:00 AM To: Marc Powell Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios HTTP monitoring {Disarmed} {Fraud?} If I change the hosts.cfg from the IP of the server (10.10.10.7) to the URL I'm specifying in services.cfg, it works fine. The default site for 10.10.10.7 is a CRM site which requires basic auth. So even though i'm specifying a URL in services.cfg, it looks like it's be overridden by the hosts.cfg so it can't auth. What is the point of the -H switch in services.cfg if it always uses the IP for the host in hosts.cfg? I tried that format (-a admin:password) at the end of the check in services.cfg but it still failed with unauthorized. It's looking more and more clear that your command{} definition doesn't do what you think it does. You don't usually specify switches such as -H and -a as part of the service definition. Perhaps you should post the host, service and relevant command definitions for us to look at. I know for a fact that check_http can and will work the way you want if properly called. -- Marc - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios HTTP monitoring
So check_http!-H site.com -t 5 -a admin:password should be check_http!-H site.com!-t 5!-a admin:password? On 6/22/07, GJP [EMAIL PROTECTED] wrote: Jerad, I didn't understand that either. That's why I customized the the check_http command definition. That way (in my previous example) you can have your http checks from any Nagios client. Usefull for instance for off-site checks. Regarding the -a switch. The -a $ARG1$ (or whatever) should be in your command definition. And your parms as you 'feed' them to the command in your service definition seperated with !. Regards, Gerard. On Friday 22 June 2007 16:59:30 Jerad Riggin wrote: If I change the hosts.cfg from the IP of the server (10.10.10.7) to the URL I'm specifying in services.cfg, it works fine. The default site for 10.10.10.7 is a CRM site which requires basic auth. So even though i'm specifying a URL in services.cfg, it looks like it's be overridden by the hosts.cfg so it can't auth. What is the point of the -H switch in services.cfg if it always uses the IP for the host in hosts.cfg? I tried that format (-a admin:password) at the end of the check in services.cfgbut it still failed with unauthorized. On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: [EMAIL PROTECTED] [mailto: nagios-users- [EMAIL PROTECTED] On Behalf Of Jerad Riggin Sent: Friday, June 22, 2007 8:41 AM To: GJP Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios HTTP monitoring I have a check_http command that works perfectly at the command prompt ./check_http -H lagarde -t 5 -s LaGarde Intranet site However in nagios, check_http!-H lagarde -t 5 -s LaGarde Intranet site still returns an unauthorized. So basically why would it work in a command prompt but not in nagios? Perhaps some difference in environment? Did you test as the nagios user? Is that the _exact_ way it's defined in the config? And what is the syntax for the -a (basic auth) switch? Thanks again for everyones help. -a username:password. -- Marc - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notifications
I'm running 2.9. The problem is that the server i'm monitoring uses host headers, so multiple sites are on 10.10.10.7. The default site for *.7 is a Microsoft CRM site which requires basic authentication. So even though in services I specify -H lagarde.lagardeinc.com it seems to just hit the IP listed for the host which hits Microsoft CRM and nagios reports unauthorized. On 6/22/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote: http://nagios.sourceforge.net/docs/3_0/monitoring-publicservices.html [Snippet] A more advanced definition for monitoring the HTTP service is shown below. This service definition will check to see if the /download/index.php URI contains the string latest-version.tar.gz. It will produce an error if the string isn't found, the URI isn't valid, or the web server takes longer than 5 seconds to respond. define service{ usegeneric-service; Inherit default values from a template host_name remotehost service_descriptionProduct Download Link check_command check_http!-u /download/index.php -t 5 -s latest-version.tar.gz } [/Snippet] As you can see, supplying the –u switch will allow you to choose a particular page, and you can then use the –s switch to search for specific text content on that page. The problem with notifications eludes, me, but I know I read about it in the documentation somewhere. Also, what version are you using? Stephen Valdinger MIS Helpdesk Coordinator Dover Chemical Corporation 3676 Davis Rd NW Dover, OH 44622 330-365-3622 [EMAIL PROTECTED] -- *From:* Jerad Riggin [mailto:[EMAIL PROTECTED] *Sent:* Friday, June 22, 2007 12:03 PM *To:* Valdinger, Stephen (DOV, MSX) *Cc:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Notifications That's true on check-http, but on check-site for one of my servers it has w,u,c,r and it worked once, but all alerts since then go unnotified. Also, how do you check a specific URL for http? It looks like nagios just uses the address in hosts.cfg and ignores the -H site.com switch you add in services.cfg. On 6/22/07, *Valdinger, Stephen (DOV, MSX)* [EMAIL PROTECTED] wrote: You are supplying a notification option of n inside the service definition for http. Should it not be set for w,c,r etc? try doing that and triggering something to happen again. Stephen Valdinger MIS Helpdesk Coordinator Dover Chemical Corporation 3676 Davis Rd NW Dover, OH 44622 330-365-3622 [EMAIL PROTECTED] -- *From:* Jerad Riggin [mailto: [EMAIL PROTECTED] *Sent:* Friday, June 22, 2007 11:54 AM *To:* Valdinger, Stephen (DOV, MSX) *Cc:* nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Notifications *Contacts.cfg* define contact{ contact_namemis alias mis host_notification_period24x7 service_notification_period24x7 host_notification_options d,r,u,f service_notification_optionsc,r,w,f host_notification_commands host-notify-by-email service_notification_commandsnotify-by-email email [EMAIL PROTECTED] (edited) } *Services.cfg* # Generic service definition template define service{ namegeneric-service ; Generic service name active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (Don't disable) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information1 ; Retain non-status information across program restarts register0 ; DONT REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE! } # Generic for all services define service{ use generic-service namebasic-service is_volatile 0 check_period24x7 max_check_attempts 5
Re: [Nagios-users] Nagios HTTP monitoring
It's still giving me an unauthorized message. I've verified the credentials work for me to get in through a browser. Is there something else I could be missing? On 6/22/07, GJP [EMAIL PROTECTED] wrote: Jerad, No entirely, because you feed the arguments to the actual command with $ARGn$ So in de service definition: check_http!site.com!5!admin:password And the command_line format in command definition (probably in: /etc/nagios-plugins/config/http.cfg) should be: command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -t $ARG2$ -a $ARG3$ Be carefull with that -t 5 btw. That is a very low value to timeout. Hope it helps. Regards, Gerard. On Friday 22 June 2007 19:17:29 Jerad Riggin wrote: So check_http!-H site.com -t 5 -a admin:password should be check_http!-H site.com!-t 5!-a admin:password? On 6/22/07, GJP [EMAIL PROTECTED] wrote: Jerad, I didn't understand that either. That's why I customized the the check_http command definition. That way (in my previous example) you can have your http checks from any Nagios client. Usefull for instance for off-site checks. Regarding the -a switch. The -a $ARG1$ (or whatever) should be in your command definition. And your parms as you 'feed' them to the command in your service definition seperated with !. Regards, Gerard. On Friday 22 June 2007 16:59:30 Jerad Riggin wrote: If I change the hosts.cfg from the IP of the server (10.10.10.7) to the URL I'm specifying in services.cfg, it works fine. The default site for 10.10.10.7 is a CRM site which requires basic auth. So even though i'm specifying a URL in services.cfg, it looks like it's be overridden by the hosts.cfg so it can't auth. What is the point of the -H switch in services.cfg if it always uses the IP for the host in hosts.cfg? I tried that format (-a admin:password) at the end of the check in services.cfgbut it still failed with unauthorized. On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote: -Original Message- From: [EMAIL PROTECTED] [mailto: nagios-users- [EMAIL PROTECTED] On Behalf Of Jerad Riggin Sent: Friday, June 22, 2007 8:41 AM To: GJP Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios HTTP monitoring I have a check_http command that works perfectly at the command prompt ./check_http -H lagarde -t 5 -s LaGarde Intranet site However in nagios, check_http!-H lagarde -t 5 -s LaGarde Intranet site still returns an unauthorized. So basically why would it work in a command prompt but not in nagios? Perhaps some difference in environment? Did you test as the nagios user? Is that the _exact_ way it's defined in the config? And what is the syntax for the -a (basic auth) switch? Thanks again for everyones help. -a username:password. -- Marc - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to ::: /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios HTTP monitoring
Is there any link to information about that? Like I said this is basically the 2nd day I've used Linux and Nagios so much of this seems like a different language to me. Thanks for the info. On 6/21/07, GJP [EMAIL PROTECTED] wrote: Hi Jerad, If you mean http auth, this is what I use. The check_command in the service definition: check_command check_http_auth!host.example.com!/path!user:password And the command definition. define command{ command_namecheck_http_auth command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -u $ARG2$ -a $ARG3$ } But if you mean authentication within a windows AD domain. Maybe Samba can help. Because then you can make your linux server a member of the AD domain. Regards, Gerard. On Thursday 21 June 2007 22:48:08 Jerad Riggin wrote: I'm new to Linux and new to Nagios so this may sound very stupid. Basically I have everything up and running, ping tests work, http tests work, and the smtp test I have setup works. If the host goes down I get a host down alert and a host up alert once the problem is resolved. Is there a way to have Nagios send me an alert for critical status? I'm pretty sure I have that setup but it isn't working. After this I went ahead and set up basic HTTP monitoring for all of our servers, however, 403 errors are not sending any notifications to me. Also, I'm trying to monitor the CRM site on our windows domain however since the Linux box is obviously not on the domain, it can't authenticate when hitting the site so it can't find the string i've specified. Any ideas on how to make nagios authenticate and then check the string? Thanks - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios HTTP monitoring
More importantly, can I just specify a URL to check? The default site on the server in question directs you to CRM which requires auth. However on the same server we have an intranet site which doesn't require authorization. How would I go about just checking the intranet URL? On 6/21/07, Jerad Riggin [EMAIL PROTECTED] wrote: Is there any link to information about that? Like I said this is basically the 2nd day I've used Linux and Nagios so much of this seems like a different language to me. Thanks for the info. On 6/21/07, GJP [EMAIL PROTECTED] wrote: Hi Jerad, If you mean http auth, this is what I use. The check_command in the service definition: check_command check_http_auth!host.example.com!/path!user:password And the command definition. define command{ command_namecheck_http_auth command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -u $ARG2$ -a $ARG3$ } But if you mean authentication within a windows AD domain. Maybe Samba can help. Because then you can make your linux server a member of the AD domain. Regards, Gerard. On Thursday 21 June 2007 22:48:08 Jerad Riggin wrote: I'm new to Linux and new to Nagios so this may sound very stupid. Basically I have everything up and running, ping tests work, http tests work, and the smtp test I have setup works. If the host goes down I get a host down alert and a host up alert once the problem is resolved. Is there a way to have Nagios send me an alert for critical status? I'm pretty sure I have that setup but it isn't working. After this I went ahead and set up basic HTTP monitoring for all of our servers, however, 403 errors are not sending any notifications to me. Also, I'm trying to monitor the CRM site on our windows domain however since the Linux box is obviously not on the domain, it can't authenticate when hitting the site so it can't find the string i've specified. Any ideas on how to make nagios authenticate and then check the string? Thanks - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null