[Nagios-users] String errors

2007-11-29 Thread Jerad Riggin
Ok, so I'm monitoring about 100 websites with string checks via check_http.
We are mirroring what our datacenter actually checks, so we have
notifications turned off so when a site goes down we aren't being spammed by
the datacenter and our nagios installation.

The issue is that every once in awhile a string changes on the site so it
goes critical in our nagios.  We perhaps won't notice it for a day which
messes up our availability reports.  Is there a way to retroactively mark
the time that it was critical as scheduled downtime?
-
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Notifications

2007-11-19 Thread Jerad Riggin
I have a quick question.  I have a host template that disables
notifications and I have about 53 hosts using it.  If the
notifications are disabled, why then does it still require that I have
values for notification_interval, _period, and _options?

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Notifications

2007-11-19 Thread Jerad Riggin
I have a nagios 2.9 install.  I have one host with multiple services
being monitored.  On the 16th the host didn't respond to a ping (the
server rebooted), and recovered within 3 minutes.  I received an
e-mail for both the failure and recovery.  I am also monitoring some
windows services on the same box using NsClient++.  It shows on the
same day that after it recovered the TrendMicro virus process was down
for 6 hours.  I didn't receive an e-mail during this entire time.  It
is set at 5 max attempts, 3 normal check, and 1 retry with a
notification interval of 15 minutes.  It should have at least notified
once but it didn't.  Any ideas?

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] CheckNT

2007-11-19 Thread Jerad Riggin
I am monitoring the CPU load with NSClient on a host.  Here is the
service definition

define service{
use windows-service
namecheck-cpu
notification_optionsw,u,c,r
check_command   check_nt!CPULOAD!-l 5, 80, 90
register0
}

Here is the actual call to that service

define service{
use check-cpu
service_description CPU
contact_groups  mis
hostgroup_name  windows-clients
}

Here is what I see in the nagios interface

CPU OK 11-19-2007 14:56:07 122d 2h 7m 2s 1/5 not enough values for -l
parameters

Any ideas?

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notifications

2007-11-19 Thread Jerad Riggin
define service{
namegeneric-service ; Generic service name
active_checks_enabled   1   ; Active
service checks are enabled
passive_checks_enabled  1   ; Passive
service checks are enabled/accepted
parallelize_check   1   ; Active
service checks should be parallelized (Don't disable)
obsess_over_service 1   ; We should
obsess over this service (if necessary)
check_freshness 0   ; Default is
to NOT check service 'freshness'
notifications_enabled   1   ; Service
notifications are enabled
event_handler_enabled   1   ; Service
event handler is enabled
flap_detection_enabled  1   ; Flap
detection is enabled
process_perf_data   1   ; Process
performance data
retain_status_information   1   ; Retain
status information across program restarts
retain_nonstatus_information1   ; Retain
non-status information across program restarts
register0   ; DONT
REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
}


define service{
use generic-service
namewindows-service
is_volatile 0
check_period24x7
max_check_attempts  5
normal_check_interval   3
retry_check_interval1
notification_interval   15
notification_period 24x7
register0
}

define service{
use windows-service
namecheck-trend
notification_optionsw,u,c,r
check_command   check_nt!SERVICESTATE!-d
SHOWALL -l ofcservice
register0
}

define service{
use check-trend
service_description TrendMicro
contact_groups  mis
# hostgroup_namewindows-clients
host_name   lg03
}



On Nov 19, 2007 3:11 PM, Aaron Devey [EMAIL PROTECTED] wrote:
 What are your notification options set to?  In 2.9 the default is none
 so if you didn't specify them for that service, it won't alert.  If
 that's not the answer, perhaps you can paste your the definitions for
 your service, contact, and notification command?

 -Aaron



 Jerad Riggin wrote:
 
  I have a nagios 2.9 install.  I have one host with multiple services
  being monitored.  On the 16th the host didn't respond to a ping (the
  server rebooted), and recovered within 3 minutes.  I received an
  e-mail for both the failure and recovery.  I am also monitoring some
  windows services on the same box using NsClient++.  It shows on the
  same day that after it recovered the TrendMicro virus process was down
  for 6 hours.  I didn't receive an e-mail during this entire time.  It
  is set at 5 max attempts, 3 normal check, and 1 retry with a
  notification interval of 15 minutes.  It should have at least notified
  once but it didn't.  Any ideas?
 
  -
  This SF.net email is sponsored by: Microsoft
  Defy all challenges. Microsoft(R) Visual Studio 2005.
  http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when
  reporting any issue.
  ::: Messages without supporting info will risk being sent to /dev/null
 



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notifications

2007-11-19 Thread Jerad Riggin
I found out via nagios.  It said the duration was 6 hours for the
outage, and I couldn't match it up with any alerts that were sent to
my mailbox.

Thanks for the tips

On Nov 19, 2007 3:48 PM, Aaron Devey [EMAIL PROTECTED] wrote:
 I don't see any obvious problems with your service definitions.  Did you
 find out TrendMicro was down for 6 hours by reviewing the nagios logs?
 If so, that means nagios at least saw the service had a problem.  If you
 found out it was down by some other means, perhaps you can check the
 nagios logs to make sure nagios saw a critical or warning problem
 with the service.

 Also, If you have log_notifications turned on, try examining the logs of
 the timeperiod it was down.  If you don't see any attempts to send a
 notification for TrendMicro on lg03, then it's likely a configuration
 problem somewhere.

 Finding it is the hard part. :)  The first places I would check are the
 service_notification_period, service_notification_options, and
 service_notification_commands for the contacts in the 'mis' group.
 Follow the service_notification_commands to make sure the command it
 points to is set up correctly as well.  If there are no problems there,
 I'd make sure there are no service escalations for that service.

 If that doesn't help, I have no idea what the problem could be. :)

 Good luck,

 -Aaron



 Jerad Riggin wrote:
 
  define service{
  namegeneric-service ; Generic
  service name
  active_checks_enabled   1   ; Active
  service checks are enabled
  passive_checks_enabled  1   ; Passive
  service checks are enabled/accepted
  parallelize_check   1   ; Active
  service checks should be parallelized (Don't disable)
  obsess_over_service 1   ; We should
  obsess over this service (if necessary)
  check_freshness 0   ; Default is
  to NOT check service 'freshness'
  notifications_enabled   1   ; Service
  notifications are enabled
  event_handler_enabled   1   ; Service
  event handler is enabled
  flap_detection_enabled  1   ; Flap
  detection is enabled
  process_perf_data   1   ; Process
  performance data
  retain_status_information   1   ; Retain
  status information across program restarts
  retain_nonstatus_information1   ; Retain
  non-status information across program restarts
  register0   ; DONT
  REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
  }
 
 
  define service{
  use generic-service
  namewindows-service
  is_volatile 0
  check_period24x7
  max_check_attempts  5
  normal_check_interval   3
  retry_check_interval1
  notification_interval   15
  notification_period 24x7
  register0
  }
 
  define service{
  use windows-service
  namecheck-trend
  notification_optionsw,u,c,r
  check_command   check_nt!SERVICESTATE!-d
  SHOWALL -l ofcservice
  register0
  }
 
  define service{
  use check-trend
  service_description TrendMicro
  contact_groups  mis
  # hostgroup_namewindows-clients
  host_name   lg03
  }
 
 



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Host monitoring

2007-11-14 Thread Jerad Riggin
I understand this part, but for example right now the service
description for each host is CheckString.  That way I can group all of
them into a service group based on that description so we can pull
availability reports for the host and the only service being
monitored.  If I wanted to do it your way, wouldn't I have to use the
service description of each check in the servicegroups config file?

On Nov 12, 2007 5:13 PM, Marc Powell [EMAIL PROTECTED] wrote:



  -Original Message-
  From: [EMAIL PROTECTED] [mailto:nagios-users-
  [EMAIL PROTECTED] On Behalf Of Jerad Riggin
  Sent: Monday, November 12, 2007 4:52 PM
  To: nagios-users@lists.sourceforge.net
  Subject: [Nagios-users] Host monitoring
 
  So what would be the best way to monitor in this situation?
 
  I have Host A, hosting two websites.
 
  Currently I have both websites configured as separate hosts, so in
  nagios it shows up as ServerA - site.com, and another host ServerA -
  siteb.com
 
  This doesn't seem efficient since I am pinging both sites, but it's
  the same host.  Ideally I would want to just ping the host, and check
  strings on each site under that host.  Any recommendations?

 Unless I'm missing something obvious, this seems pretty straightforward.
 host{} definition for the web server machine, service{} definitions for
 each vhost using check_http to check them. You can craft a command{}
 definition to pass the variable parameters like vhost name, uri and
 search string like so --

 define command {
command_name   check_http_url
command_line   $USER1$/check_http -I $HOSTADDRESS$
 -H $ARG1$ -u $ARG2$ -wt 20 -ct 30 -to 35 -s $ARG3$
}

 To check to vhosts --

 define service {
usegeneric-service
host_name  my_webserver
service_descriptionHTTP VHOST site1
check_command
 check_http_url!www.site1.com!/mypage!/html
}

 define service {
usegeneric-service
host_name  my_webserver
service_descriptionHTTP VHOST site2
check_command
 check_http_url!www.site2.com!/someotherpage!/html
}

 --
 Marc

 -
 This SF.net email is sponsored by: Splunk Inc.
 Still grepping through log files to find problems?  Stop.
 Now Search log events and configuration files using AJAX and a browser.
 Download your FREE copy of Splunk now  http://get.splunk.com/
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Monitoring a process

2007-11-13 Thread Jerad Riggin
Would it be possible using Nsclient++ to monitor for a process name?
We need to make sure Outlook.exe is running on the server and if it
isn't, send out notifications.

Thanks,

Jerad

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring a process

2007-11-13 Thread Jerad Riggin
Thanks for the response.  This is what I did just a bit ago.  I have
it successfully monitoring outlook.exe, however no e-mails are being
sent when it's critical.

Here is what I have as the service definition:

define service{
   use leadlander
   host_name   leadlandervm
   service_description Outlook Process
   contact_groups  bo,mis
   notification_optionsw,u,c,r
   check_command   check_nt2!PROCSTATE!-d SHOWALL -l outlook.exe
   }

Here is what I have for the leadlander template


define service{
   use generic-service

   nameleadlander
   is_volatile 0
   check_period24x7
   max_check_attempts  15
   normal_check_interval   20
   retry_check_interval20
   notification_interval   20
   notification_period 24x7
   register0
   }

Does it only notify after the first retry failure, or should it notify
as soon as the service is critical?  Any ideas?

- Show quoted text -

On Nov 13, 2007 2:39 PM, Rich Sasko [EMAIL PROTECTED] wrote:

 Jerad Riggin jriggin at gmail.com writes:

 
  Would it be possible using Nsclient++ to monitor for a process name?
  We need to make sure Outlook.exe is running on the server and if it
  isn't, send out notifications.
 
  Thanks,
 
  Jerad
 
  -
  This SF.net email is sponsored by: Splunk Inc.
  Still grepping through log files to find problems?  Stop.
  Now Search log events and configuration files using AJAX and a browser.
  Download your FREE copy of Splunk now  http://get.splunk.com/
  ___
  Nagios-users mailing list
  Nagios-users at lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when reporting
 any issue.
  ::: Messages without supporting info will risk being sent to /dev/null
 
 


 Jerad,

 We have the NSClient++ running on our Windows servers and the following
 service check is an example of how we are monitoring services from the nagios
 server:

 define service{
 use   generic-service
 host_name email server
 service_description   explorer
 check_command check_nt!PROCSTATE!-d SHOWALL -l explorer.exe
 }

 It is actually in one of the sample config files, you should just have to tell
 it what process you want to watch.


 -
 This SF.net email is sponsored by: Splunk Inc.
 Still grepping through log files to find problems?  Stop.
 Now Search log events and configuration files using AJAX and a browser.
 Download your FREE copy of Splunk now  http://get.splunk.com/
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring a process

2007-11-13 Thread Jerad Riggin
Just to clarify

normal_check_interval   60
retry_check_interval360
notification_interval   360

This means that it checks every hour.  After it fails the first time,
it will wait 6 hours to retry, if it is still not OK after 6 hours, it
notifies and begins to notify every 6 hours until it's ok.

On Nov 13, 2007 3:00 PM, Rich Sasko [EMAIL PROTECTED] wrote:
 It only notifies after it enters a hard state which is usually after the
 third try.

 Richard Sasko
 Niagara Lasalle Corp
 Phone: (219) 853-6272
 Mobile: (219) 484-5617
 E-mail: [EMAIL PROTECTED]



 -Original Message-
 From: Jerad Riggin [mailto:[EMAIL PROTECTED]
 Sent: Tuesday, November 13, 2007 2:54 PM
 To: Rich Sasko
 Subject: Re: [Nagios-users] Monitoring a process

 Thanks for the response.  This is what I did just a bit ago.  I have
 it successfully monitoring outlook.exe, however no e-mails are being
 sent when it's critical.

 Here is what I have as the service definition:

 define service{
use leadlander
host_name   leadlandervm
service_description Outlook Process
contact_groups  bo,mis
notification_optionsw,u,c,r
check_command   check_nt2!PROCSTATE!-d SHOWALL -l
 outlook.exe
}

 Here is what I have for the leadlander template

 define service{
use generic-service
nameleadlander
is_volatile 0
check_period24x7
max_check_attempts  15
normal_check_interval   20
retry_check_interval20
notification_interval   20
notification_period 24x7
register0
}

 Does it only notify after the first retry failure, or should it notify
 as soon as the service is critical?  Any ideas?


 On Nov 13, 2007 2:39 PM, Rich Sasko [EMAIL PROTECTED] wrote:
 
  Jerad Riggin jriggin at gmail.com writes:
 
  
   Would it be possible using Nsclient++ to monitor for a process name?
   We need to make sure Outlook.exe is running on the server and if it
   isn't, send out notifications.
  
   Thanks,
  
   Jerad
  
  
 
 -
   This SF.net email is sponsored by: Splunk Inc.
   Still grepping through log files to find problems?  Stop.
   Now Search log events and configuration files using AJAX and a
 browser.
   Download your FREE copy of Splunk now  http://get.splunk.com/
   ___
   Nagios-users mailing list
   Nagios-users at lists.sourceforge.net
   https://lists.sourceforge.net/lists/listinfo/nagios-users
   ::: Please include Nagios version, plugin version (-v) and OS when
 reporting
  any issue.
   ::: Messages without supporting info will risk being sent to
 /dev/null
  
  
 
 
  Jerad,
 
  We have the NSClient++ running on our Windows servers and the
 following
  service check is an example of how we are monitoring services from the
 nagios
  server:
 
  define service{
  use   generic-service
  host_name email server
  service_description   explorer
  check_command check_nt!PROCSTATE!-d SHOWALL -l
 explorer.exe
  }
 
  It is actually in one of the sample config files, you should just have
 to tell
  it what process you want to watch.
 
 
 
 
 -
  This SF.net email is sponsored by: Splunk Inc.
  Still grepping through log files to find problems?  Stop.
  Now Search log events and configuration files using AJAX and a
 browser.
  Download your FREE copy of Splunk now  http://get.splunk.com/
  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
  ::: Messages without supporting info will risk being sent to /dev/null
 


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Checkhttp string

2007-11-12 Thread Jerad Riggin
How do I go about checking a specific page in a site?  I am able to
successfully check a string on the default document for a site but if
I specify for example index2.aspx in the host part of checkhttp, it
throws an error.

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Nagios time format

2007-11-12 Thread Jerad Riggin
Is there a way to change the time format that nagios uses?  CST would
make the logs easier to read.

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Service scheduling

2007-11-09 Thread Jerad Riggin
I have a checkstring service that kicks off every 10 minutes.  After
failure it will retry every minute.  We're trying to track
availability on some of our servers so this is its purpose.  However I
noticed this morning that a host went down, I received a PING 100%
packet loss alert, and shortly after within the same minute an alert
that it couldn't find the HTTP string.  However, by 7:50 PING was ok,
and HTTP didn't return ok until 8:00 AM.  If it's retrying every
minute (or should be), why is there such a large gap?

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Service scheduling

2007-11-09 Thread Jerad Riggin
I currently have the normal check at 10 minutes, max attempts at 1,
and retry intervals at 1.

On Nov 9, 2007 9:36 AM, Giles Coochey [EMAIL PROTECTED] wrote:

 
  I have a checkstring service that kicks off every 10 minutes.  After
  failure it will retry every minute.  We're trying to track
  availability on some of our servers so this is its purpose.  However I
  noticed this morning that a host went down, I received a PING 100%
  packet loss alert, and shortly after within the same minute an alert
  that it couldn't find the HTTP string.  However, by 7:50 PING was ok,
  and HTTP didn't return ok until 8:00 AM.  If it's retrying every
  minute (or should be), why is there such a large gap?
 

 Normal Behaviour:

 Scheduling During Problems

 So what happens when there are problems with a service? Well, one of the
 things that happens is the service check scheduling changes. If you've
 configured the max_attempts option of the service definition to be
 something greater than 1, Nagios will recheck the service before
 deciding that a real problem exists. While the service is being
 rechecked (up to max_attempts times) it is considered to be in a soft
 state (as described here) and the service checks are rescheduled at a
 frequency determined by the retry_interval option.

 If Nagios rechecks the service max_attempts times and it is still in a
 non-OK state, Nagios will put the service into a hard state, send out
 notifications to contacts (if applicable), and start rescheduling future
 checks of the service at a frequency determined by the check_interval
 option.

 http://nagios.sourceforge.net/docs/2_0/checkscheduling.html



-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Service scheduling

2007-11-09 Thread Jerad Riggin
Thanks for the clarification.

On Nov 9, 2007 9:50 AM, Marc Powell [EMAIL PROTECTED] wrote:


  -Original Message-
  From: [EMAIL PROTECTED] [mailto:nagios-users-
  [EMAIL PROTECTED] On Behalf Of Jerad Riggin
  Sent: Friday, November 09, 2007 9:44 AM
  To: Giles Coochey
  Cc: nagios-users@lists.sourceforge.net
  Subject: Re: [Nagios-users] Service scheduling
 
  I currently have the normal check at 10 minutes, max attempts at 1,
  and retry intervals at 1.

 This means that nagios will check the service every 10 minutes until
 there is a problem, then it will perform 1 check 1 minute later and send
 a notification if it's still in a non-OK state. All future checks return
 to the 10 minute interval, even if it's still in a non-OK state.
 retry_check_interval is only used up to max_check_attempts.

 --
 Marc

 -
 This SF.net email is sponsored by: Splunk Inc.
 Still grepping through log files to find problems?  Stop.
 Now Search log events and configuration files using AJAX and a browser.
 Download your FREE copy of Splunk now  http://get.splunk.com/
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Availability reports

2007-11-05 Thread Jerad Riggin
I have two sites being monitored with checkhttp  a string.  When I do
an availability report for today it shows as 100% availability,
however if I go further in and click on the host, it shows 95.4% Time
OK with 4.5% Time Critical.  Why is this downtime not showing in the
availability report?

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Availability reports

2007-11-05 Thread Jerad Riggin
I think the issue was that the checkstring is a service, but the host
wasn't down (it could still ping it).  So it considered the host up
but the service down.  I created a service group so we could check
availability based on the checkstring service.

On Nov 5, 2007 10:56 AM, Sebastian Ries [EMAIL PROTECTED] wrote:
 Hi


  I have two sites being monitored with checkhttp  a string.  When I do
  an availability report for today it shows as 100% availability,
  however if I go further in and click on the host, it shows 95.4% Time
  OK with 4.5% Time Critical.  Why is this downtime not showing in the
  availability report?

 Is the availability report based on the service checks?
 Nagios does not check services as long as it knows the host is down.

 So it could be that EVERY service check returned OK (100%) because as
 the host was down the server hasn't been checked...

 Regards
 Sebastian Ries

 --
 
 DT Netsolution GmbH -  Talaeckerstr. 30 -  D-70437 Stuttgart
 Tel: +49-711-849910-36   Fax: +49-711-849910-936
 WEB: http://www.dtnet.de/ email: [EMAIL PROTECTED]

 -
 This SF.net email is sponsored by: Splunk Inc.
 Still grepping through log files to find problems?  Stop.
 Now Search log events and configuration files using AJAX and a browser.
 Download your FREE copy of Splunk now  http://get.splunk.com/
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Service checks

2007-11-02 Thread Jerad Riggin
In that case, is there a way to specify more than one possible string, so
that for example if it can't find Home, but it can find Checkout,  it
considers the host up?

On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote:

 yes you have to use a separate one for each site, how else would you be
 able to use a separate check? Unless you expect the word Home on each
 site and that is your string check. Another option is using Macros but I
 suspect this may not do what you want...

 -h

 Hari Sekhon



 Jerad Riggin wrote:
  I've read the help docs, like I said I have a working installation
  checking about 15 servers.  I have a PING service that pings a host
  group.  I can't do that with this because I'm checking a different
  string on each site, so i'm guessing it has to be separated out.  Does
  this make sense?
 
  On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED] wrote:
 
  You need to reread the docs, this is the most basic of questions.
 
  http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service
 
  Hint: You are missing a service_description in the last block for
 one.
  You should not have name in that last block either, you need
  host_name.
 
  Read docs pls.
 
  -h
 
  Hari Sekhon
 
 
 
  Jerad Riggin wrote:
   I have a functioning nagios setup but I have a quick question.  I
 am
   going through and adding website string checks so we can keep
  track of
   availability on one of our webservers.
  
   So in services.cfg I have
  
   define service{
   namegeneric-service ; Generic
   service name
   active_checks_enabled   1   ; Active
   service checks are enabled
   passive_checks_enabled  1   ; Passive
   service checks are enabled/accepted
   parallelize_check   1   ; Active
   service checks should be parallelized (Don't disable)
   obsess_over_service 1   ; We
 should
   obsess over this service (if necessary)
   check_freshness 0   ;
  Default is
   to NOT check service 'freshness'
   notifications_enabled   1   ; Service
   notifications are enabled
   event_handler_enabled   1   ; Service
   event handler is enabled
   flap_detection_enabled  1   ; Flap
   detection is enabled
   process_perf_data   1   ; Process
   performance data
   retain_status_information   1   ; Retain
   status information across program restarts
   retain_nonstatus_information1   ; Retain
   non-status information across program restarts
   register0   ; DONT
   REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
   }
  
   define service{
   use generic-service
   namebasic-service
   is_volatile 0
   check_period24x7
   max_check_attempts  5
   normal_check_interval   3
   retry_check_interval1
   notification_interval   15
   notification_period 24x7
   register0
   }
  
   I then have as just one example:
  
   define service{
   use basic-service
   namecheck-site4
   notification_optionsw,u,c,r
   check_command
  check_http!site.com!20!Home
   register0
   }
  
  
   My question is, you notice that I have the name as check-site4,
 and
   then later on in the services.cfg I call up that
  checksite-4.  Is this
   the correct way?  Do I need to define a service for each host
  and then
   later on call it by name to execute the service check?  Is this
  a bad
   way of going about it?
  
   Thanks,
  
   Jerad
  
 
 
  
  
 
 -
 
   This SF.net email is sponsored by: Splunk Inc.
   Still grepping through log files to find problems?  Stop.
   Now Search log events and configuration files using AJAX and a
  browser.
   Download your FREE copy of Splunk now  http://get.splunk.com

Re: [Nagios-users] Service checks

2007-11-02 Thread Jerad Riggin
I've read the help docs, like I said I have a working installation checking
about 15 servers.  I have a PING service that pings a host group.  I can't
do that with this because I'm checking a different string on each site, so
i'm guessing it has to be separated out.  Does this make sense?

On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote:

 You need to reread the docs, this is the most basic of questions.

 http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service

 Hint: You are missing a service_description in the last block for one.
 You should not have name in that last block either, you need host_name.

 Read docs pls.

 -h

 Hari Sekhon



 Jerad Riggin wrote:
  I have a functioning nagios setup but I have a quick question.  I am
  going through and adding website string checks so we can keep track of
  availability on one of our webservers.
 
  So in services.cfg I have
 
  define service{
  namegeneric-service ; Generic
  service name
  active_checks_enabled   1   ; Active
  service checks are enabled
  passive_checks_enabled  1   ; Passive
  service checks are enabled/accepted
  parallelize_check   1   ; Active
  service checks should be parallelized (Don't disable)
  obsess_over_service 1   ; We should
  obsess over this service (if necessary)
  check_freshness 0   ; Default is
  to NOT check service 'freshness'
  notifications_enabled   1   ; Service
  notifications are enabled
  event_handler_enabled   1   ; Service
  event handler is enabled
  flap_detection_enabled  1   ; Flap
  detection is enabled
  process_perf_data   1   ; Process
  performance data
  retain_status_information   1   ; Retain
  status information across program restarts
  retain_nonstatus_information1   ; Retain
  non-status information across program restarts
  register0   ; DONT
  REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
  }
 
  define service{
  use generic-service
  namebasic-service
  is_volatile 0
  check_period24x7
  max_check_attempts  5
  normal_check_interval   3
  retry_check_interval1
  notification_interval   15
  notification_period 24x7
  register0
  }
 
  I then have as just one example:
 
  define service{
  use basic-service
  namecheck-site4
  notification_optionsw,u,c,r
  check_command   check_http!site.com!20!Home
  register0
  }
 
 
  My question is, you notice that I have the name as check-site4, and
  then later on in the services.cfg I call up that checksite-4.  Is this
  the correct way?  Do I need to define a service for each host and then
  later on call it by name to execute the service check?  Is this a bad
  way of going about it?
 
  Thanks,
 
  Jerad
  
 
 
 -
  This SF.net email is sponsored by: Splunk Inc.
  Still grepping through log files to find problems?  Stop.
  Now Search log events and configuration files using AJAX and a browser.
  Download your FREE copy of Splunk now  http://get.splunk.com/
  
 
  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
  ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
  ::: Messages without supporting info will risk being sent to /dev/null

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Service checks

2007-11-02 Thread Jerad Riggin
I have a functioning nagios setup but I have a quick question.  I am going
through and adding website string checks so we can keep track of
availability on one of our webservers.

So in services.cfg I have

define service{
namegeneric-service ; Generic service
name
active_checks_enabled   1   ; Active service
checks are enabled
passive_checks_enabled  1   ; Passive service
checks are enabled/accepted
parallelize_check   1   ; Active service
checks should be parallelized (Don't disable)
obsess_over_service 1   ; We should obsess
over this service (if necessary)
check_freshness 0   ; Default is to NOT
check service 'freshness'
notifications_enabled   1   ; Service
notifications are enabled
event_handler_enabled   1   ; Service event
handler is enabled
flap_detection_enabled  1   ; Flap detection is
enabled
process_perf_data   1   ; Process
performance data
retain_status_information   1   ; Retain status
information across program restarts
retain_nonstatus_information1   ; Retain non-status
information across program restarts
register0   ; DONT REGISTER THIS
DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
}

define service{
use generic-service
namebasic-service
is_volatile 0
check_period24x7
max_check_attempts  5
normal_check_interval   3
retry_check_interval1
notification_interval   15
notification_period 24x7
register0
}

I then have as just one example:

define service{
use basic-service
namecheck-site4
notification_optionsw,u,c,r
check_command   check_http!site.com!20!Home
register0
}


My question is, you notice that I have the name as check-site4, and then
later on in the services.cfg I call up that checksite-4.  Is this the
correct way?  Do I need to define a service for each host and then later on
call it by name to execute the service check?  Is this a bad way of going
about it?

Thanks,

Jerad
-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Service checks

2007-11-02 Thread Jerad Riggin
I think we're losing something in translation of me trying to tell you what
i'm doing.  Here is what i've been doing.

define service{
use basic-service
namecheck-site4
notification_optionsw,u,c,r
check_command   check_http!site.com!20!Home
   register0
}

and then later on in the config file


define service{
usecheck-site4
service_descriptionHTTP
contact_groupsmis
host_namesite.com
}


On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote:

 If you had read the docs like I said, I can't see how you could miss the
 fact that you have a service definition without a host_name or
 hostgroup_name!

 Where is this service check going to run against if you haven't told it
 which host you want to test?

 I even gave you the anchored link to the exact place where it shows you
 the definitions that are needed for that block...

 -h

 Hari Sekhon



 Jerad Riggin wrote:
  Ok, so for example
 
  define service{
  use basic-service
  notification_options   w,u,c,r
  check_command check_http!site.com!20!Home
  Service description  CheckString
  }
 
  Sorry if this seems like a newbie question.  I'm just trying to
  backtrack and optimize the config before I get too far down the road.
 
 
  On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED] wrote:
 
  also, you are making it more trouble than it needs to be, instead of
  trying to do register 0 and making the last block a template
  block, just
  put the host name in there and the service description as I hinted
 and
  that is all you need, you will then have the first 2 blocks plus
  one for
  each service on each host with different string checks...
 
  -h
 
  Hari Sekhon
 
 
 
  Jerad Riggin wrote:
   I've read the help docs, like I said I have a working installation
   checking about 15 servers.  I have a PING service that pings a
 host
   group.  I can't do that with this because I'm checking a different
   string on each site, so i'm guessing it has to be separated
  out.  Does
   this make sense?
  
   On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED]
   mailto:[EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED] wrote:
  
   You need to reread the docs, this is the most basic of
  questions.
  
  
  http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service
  http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service
  
   Hint: You are missing a service_description in the last
  block for one.
   You should not have name in that last block either, you need
   host_name.
  
   Read docs pls.
  
   -h
  
   Hari Sekhon
  
  
  
   Jerad Riggin wrote:
I have a functioning nagios setup but I have a quick
  question.  I am
going through and adding website string checks so we can
 keep
   track of
availability on one of our webservers.
   
So in services.cfg I have
   
define service{
namegeneric-service ;
  Generic
service name
active_checks_enabled   1   ;
  Active
service checks are enabled
passive_checks_enabled  1   ;
  Passive
service checks are enabled/accepted
parallelize_check   1   ;
  Active
service checks should be parallelized (Don't disable)
obsess_over_service 1   ;
  We should
obsess over this service (if necessary)
check_freshness 0   ;
   Default is
to NOT check service 'freshness'
notifications_enabled   1   ;
  Service
notifications are enabled
event_handler_enabled   1   ;
  Service
event handler is enabled
flap_detection_enabled  1   ;
  Flap
detection is enabled
process_perf_data   1   ;
  Process
performance data
retain_status_information   1   ;
  Retain
status information across program restarts
retain_nonstatus_information1   ;
  Retain
non-status information across program restarts
register0

Re: [Nagios-users] Service checks

2007-11-02 Thread Jerad Riggin
I wish I could, but I'm not a programmer and have limited experience with
Linux.  I usually just manage Windows servers.

Thanks for your help though.  I condensed the string check into one instead
of double/triple templating.  Thanks.

On 11/2/07, Hari Sekhon [EMAIL PROTECTED] wrote:

 I've just tried this, using the standard check_http plugin you are
 using. It appears not.

 As soon as one string is not found it goes critical.

 You could always write a custom plugin to test your websites, or much
 more easily a shell wrapper plugin to call the first, if it fails, call
 the second, and only if that fails go critical... not perfect though, a
 better/custom plugin would serve you better.

 -h

 Hari Sekhon



 Jerad Riggin wrote:
  In that case, is there a way to specify more than one possible string,
  so that for example if it can't find Home, but it can find
  Checkout,  it considers the host up?
 
  On 11/2/07, *Hari Sekhon* [EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED] wrote:
 
  yes you have to use a separate one for each site, how else would
  you be
  able to use a separate check? Unless you expect the word Home on
  each
  site and that is your string check. Another option is using Macros
  but I
  suspect this may not do what you want...
 
  -h
 
  Hari Sekhon
 
 
 
  Jerad Riggin wrote:
   I've read the help docs, like I said I have a working installation
   checking about 15 servers.  I have a PING service that pings a
 host
   group.  I can't do that with this because I'm checking a different
   string on each site, so i'm guessing it has to be separated
  out.  Does
   this make sense?
  
   On 11/2/07, *Hari Sekhon*  [EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED]
   mailto:[EMAIL PROTECTED]
  mailto:[EMAIL PROTECTED] wrote:
  
   You need to reread the docs, this is the most basic of
  questions.
  
  
 http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#service
  
   Hint: You are missing a service_description in the last
  block for one.
   You should not have name in that last block either, you need
   host_name.
  
   Read docs pls.
  
   -h
  
   Hari Sekhon
  
  
  
   Jerad Riggin wrote:
I have a functioning nagios setup but I have a quick
  question.  I am
going through and adding website string checks so we can
 keep
   track of
availability on one of our webservers.
   
So in services.cfg I have
   
define service{
namegeneric-service ;
  Generic
service name
active_checks_enabled   1   ;
  Active
service checks are enabled
passive_checks_enabled  1   ;
  Passive
service checks are enabled/accepted
parallelize_check   1   ;
  Active
service checks should be parallelized (Don't disable)
obsess_over_service 1   ;
  We should
obsess over this service (if necessary)
check_freshness 0   ;
   Default is
to NOT check service 'freshness'
notifications_enabled   1   ;
  Service
notifications are enabled
event_handler_enabled   1   ;
  Service
event handler is enabled
flap_detection_enabled  1   ;
  Flap
detection is enabled
process_perf_data   1   ;
  Process
performance data
retain_status_information   1   ;
  Retain
status information across program restarts
retain_nonstatus_information1   ;
  Retain
non-status information across program restarts
register0   ;
  DONT
REGISTER THIS DEFINITION - NOT A REAL SERVICE, JUST A
  TEMPLATE!
}
   
define service{
use generic-service
namebasic-service
is_volatile 0
check_period24x7
max_check_attempts  5
normal_check_interval   3
retry_check_interval1
notification_interval   15
notification_period 24x7
register

Re: [Nagios-users] Downtime scheduling

2007-11-02 Thread Jerad Riggin
Thanks for the reply.  Here is what I did..


define timeperiod{
timeperiod_name maintenance
alias   Maintenance period
sunday  04:00-01:00
monday  04:00-01:00
tuesday 04:00-01:00
wednesday   04:00-01:00
thursday04:00-01:00
friday  04:00-01:00
saturday04:00-01:00
}

And on the host check template I put maintenance in as the
notification/check schedule.  Will this work?


On 11/2/07, Larry Low [EMAIL PROTECTED] wrote:

  Timeperiods is the best idea.  You can either use the notification_period
 or check_period depending on your end needs.



 

 Larry Low

 4150 N Drinkwater Blvd., 5th Floor

 Scottsdale, AZ  85251

 Office: 480.385.7045

 E-mail: [EMAIL PROTECTED]'



 Telesphere Networks, Inc



 *From:* [EMAIL PROTECTED] [mailto:
 [EMAIL PROTECTED] *On Behalf Of *Jerad Riggin
 *Sent:* Friday, November 02, 2007 11:59 AM
 *To:* nagios-users@lists.sourceforge.net
 *Subject:* [Nagios-users] Downtime scheduling



 What is the best way to allow for a 1-4AM outage on a few hosts?  A cron
 job that will schedule it every night, or through time periods?

 -
 This SF.net email is sponsored by: Splunk Inc.
 Still grepping through log files to find problems?  Stop.
 Now Search log events and configuration files using AJAX and a browser.
 Download your FREE copy of Splunk now  http://get.splunk.com/
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now  http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios e-mail reports

2007-07-24 Thread Jerad Riggin

Last night I got a few warning/critical errors with a few recovery e-mails.
However towards this morning I received 2 critical alerts and even though
they recovered I didn't receive a recovery e-mail.  Any ideas?

Jerad
-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios e-mail reports

2007-07-24 Thread Jerad Riggin

Now that I look at it, it looks like the down e-mails that didn't have
corresponding recovery e-mails were delayed for some reason (by about 15
minutes).  So perhaps I'm just reading this out of order.  I have on clue
why that would happen.

On 7/24/07, Jeremy C. Reed [EMAIL PROTECTED] wrote:


On Tue, 24 Jul 2007, Jerad Riggin wrote:

 Last night I got a few warning/critical errors with a few recovery
e-mails.
 However towards this morning I received 2 critical alerts and even
though
 they recovered I didn't receive a recovery e-mail.  Any ideas?

What do your nagios logs say about it?

Jeremy C. Reed

-
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now   http://get.splunk.com/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Recurring Downtime

2007-06-28 Thread Jerad Riggin

What about this one?

http://www.nagiosexchange.org/Utilities.16.0.html?tx_netnagext_pi1[p_view]=893


On 6/28/07, Moore, Nicholas [EMAIL PROTECTED] wrote:


 New Nagios user here (2.9).  I have a server that restarts every day.  I
want to schedule downtime for a server every day at the same time.  I know
that I can't do it with the basic nagios package.  I found an add-on on
nagios exchange. (
http://www.nagiosexchange.org/Downtimes.38.0.html?tx_netnagext_pi1%5Bp_view%5D=363)
The new version of the add-on is supposed to work with 2.x.  Does anyone
here use it and have any suggestions for me?  Is there another way to
accomplish what I want to do?



Thanks,

Nick

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Notifications

2007-06-27 Thread Jerad Riggin

Does Nagios only send out notifications once it has reached the max retry on
services?  If so, is the only way to make it automatically notify you as
soon as there is a problem to set the max retry to 1?

Thanks

Jerad
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] PNP

2007-06-27 Thread Jerad Riggin

I'm trying to get the PNP stuff setup so I can create some graphs for the
disk usage of my servers.  I have it installed and configured correctly as
far as I can tell.  However, no performance data is being generated in the
perfdata folder.  What are some things I can look for?

Thanks
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Windows monitoring

2007-06-26 Thread Jerad Riggin

Sounds good...  What is the default command definition for check_nt?

On 6/26/07, Frost, Mark {PBG} [EMAIL PROTECTED] wrote:


 That's what I've done.  My understanding in Nagios is that a config file
is a config file.  I've never tried it, but I believe you could stuff your
entire config (hosts, services, checkcommands, notification commands) into a
single file.  You'd be nuts given how incredibly complex Nagios configs can
get, but you could.  By the same token, you could break them out into as
many individual files as you like as well as long as your nagios.cfg lists
all the files it's supposed to read.

I did as you indicated -- keep my hosts (windows or otherwise) in my
hosts.cfg file.  I tend to sort my config files by application rather than
OS type so I put the service entries in files/directories with still other
names.

Mark

 --
*From:* Jerad Riggin [mailto:[EMAIL PROTECTED]
*Sent:* Tuesday, June 26, 2007 9:53 AM
*To:* Frost, Mark {PBG}
*Cc:* nagios-users@lists.sourceforge.net
*Subject:* Re: [Nagios-users] Windows monitoring


 Thanks for the info.  So for example, it wants me to define new hosts in
the windows.cfg.  Is there a point to that?  Can I just put service
definitions in the windows.cfg and utilize the hosts I already have
defined in hosts.cfg ?

On 6/26/07, Frost, Mark {PBG} [EMAIL PROTECTED] wrote:

  Jerad,

 I took this to mean that it's a sort of generic/example config file.
 You can always break out your configuration into additional
 files/directories as makes sense for your installation as long as you add
 new lines in the nagios.cfg file to tell it to read each new file and/or
 directory you create.

 I found that a few things in that 3.0 doc for monitoring with NSClient++
 that weren't correct so you will find yourself making some modifications to
 get things to work if you've completely copied entries from that doc page.

 Mark

  --
 *From:* [EMAIL PROTECTED] [mailto:
 [EMAIL PROTECTED] *On Behalf Of *Jerad Riggin
 *Sent:* Tuesday, June 26, 2007 9:16 AM
 *To:* nagios-users@lists.sourceforge.net
 *Subject:* [Nagios-users] Windows monitoring

  I'm going through this article: 
http://nagios.sourceforge.net/docs/3_0/monitoring-windows.html


 It is referring to a windows.cfg within nagios.cfg.  I don't see any
 such commented out line in my nagios.cfg, and I can't even find a
 windows.cfg at all.  Is the difference that I am running 2.9 and this is
 referring to 3.0?



-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Monitoring Windows Services

2007-06-26 Thread Jerad Riggin

I feel like i'm spamming this mail list.  So far I'm monitoring a few
processes  services on a Windows 2003 Server.  Would there be a way to have
Nagios report if any automatic service fails?  Or do I need to go through
and define each service?  Let me know if you need more information.
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-25 Thread Jerad Riggin

Any idea why this -a switch isn't working?

If I do this in a command prompt it works fine: ./check_http -S -H
url.com-a username:password it returns an OK.

The same command in nagios won't work (unauthorized).


On 6/22/07, Jerad Riggin [EMAIL PROTECTED] wrote:


It's still giving me an unauthorized message.  I've verified the
credentials work for me to get in through a browser.  Is there something
else I could be missing?

On 6/22/07, GJP [EMAIL PROTECTED] wrote:

 Jerad,

 No entirely, because you feed the arguments to the actual command with
 $ARGn$

 So in de service definition:

 check_http!site.com!5!admin:password

 And the command_line format in command definition (probably
 in: /etc/nagios-plugins/config/http.cfg) should be:

 command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -t $ARG2$
 -a
 $ARG3$

 Be carefull with that -t 5 btw. That is a very low value to timeout.

 Hope it helps.

 Regards,

 Gerard.

 On Friday 22 June 2007 19:17:29 Jerad Riggin wrote:
  So check_http!-H site.com -t 5 -a admin:password
 
  should be
 
  check_http!-H site.com!-t 5!-a admin:password?
 
  On 6/22/07, GJP [EMAIL PROTECTED]  wrote:
   Jerad,
  
   I didn't understand that either. That's why I customized the the
   check_http
   command definition. That way (in my previous example) you can have
 your
   http
   checks from any Nagios client. Usefull for instance for  off-site
 checks.
  
   Regarding the -a switch. The -a $ARG1$ (or whatever) should be in
 your
   command
   definition. And your parms as you 'feed' them to the command in your
   service
   definition seperated with !.
  
   Regards,
  
   Gerard.
  
   On Friday 22 June 2007 16:59:30 Jerad Riggin wrote:
If I change the hosts.cfg from the IP of the server (10.10.10.7)
 to the
  
   URL
  
I'm specifying in services.cfg, it works fine.  The default site
 for
10.10.10.7 is a CRM site which requires basic auth.  So even
 though i'm
specifying a URL in services.cfg, it looks like it's be overridden
 by
  
   the
  
hosts.cfg so it can't auth.  What is the point of the -H switch in

services.cfg if it always uses the IP for the host in hosts.cfg
 ?  I
  
   tried
  
that format (-a admin:password) at the end of the check in
services.cfgbut it still failed with unauthorized.
   
On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote:
  -Original Message-
  From: [EMAIL PROTECTED] [mailto:
  
   nagios-users-
  
  [EMAIL PROTECTED] On Behalf Of Jerad Riggin
  Sent: Friday, June 22, 2007 8:41 AM
  To: GJP
  Cc: nagios-users@lists.sourceforge.net
  Subject: Re: [Nagios-users] Nagios HTTP monitoring
 
  I have a check_http command that works perfectly at the
 command
  
   prompt
  
  ./check_http -H lagarde -t 5 -s LaGarde Intranet site
 
  However in nagios, check_http!-H lagarde -t 5 -s LaGarde
 Intranet

 site

  still returns an unauthorized.  So basically why would it work
 in a
  command prompt but not in nagios?

 Perhaps some difference in environment? Did you test as the
 nagios
  
   user?
  
 Is that the _exact_ way it's defined in the config?

  And what is the syntax for the -a
  (basic auth) switch?  Thanks again for everyones help.

 -a username:password.

 --
 Marc
  
  
 -
  
 This SF.net email is sponsored by DB2 Express
 Download DB2 Express C - the FREE version of DB2 express and
 take
 control of your XML. No limits. Just data. Click to get it now.
 http://sourceforge.net/powerbar/db2/
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users

 ::: Please include Nagios version, plugin version (-v) and OS
 when

 reporting any issue.

 ::: Messages without supporting info will risk being sent to
 ::: /dev/null




-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-22 Thread Jerad Riggin

Where do I use the --help switch?  I've tried in the terminal but I get a
command not found error.  Thanks for the links btw.

On 6/22/07, GJP [EMAIL PROTECTED] wrote:


Yes ofcourse there is. This one is for Samba: www.samba.org

And like Patrick said if you do a check_http -h it tells you what ik can
do.
And if you're on a common linux distro. you can find files with:

locate [filename]

Then I would also check:

http://nagios.sourceforge.net/docs/2_0/beginners.html
http://nagios.sourceforge.net/docs/3_0/monitoring-publicservices.html

And I would take your time for all this. My linux experience is over 8
years
and it took me three weeks to have a full blown secure sms sending Nagios
setup.

Goodluck!

G

On Thursday 21 June 2007 23:24:43 Jerad Riggin wrote:
 Is there any link to information about that?  Like I said this is
basically
 the 2nd day I've used Linux and Nagios so much of this seems like a
 different language to me.  Thanks for the info.

 On 6/21/07, GJP [EMAIL PROTECTED] wrote:
  Hi Jerad,
 
  If you mean http auth, this is what I use.
 
  The check_command in the service definition:
 
  check_command
  check_http_auth!host.example.com!/path!user:password
 
  And the command definition.
 
  define command{
 command_namecheck_http_auth
 command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -u
  $ARG2$ -a $ARG3$
 }
 
  But if you mean authentication within a windows AD domain. Maybe Samba
  can help. Because then you can make your linux server a member of the
AD
  domain.
 
  Regards,
 
  Gerard.
 
  On Thursday 21 June 2007 22:48:08 Jerad Riggin wrote:
   I'm new to Linux and new to Nagios so this may sound very stupid.
   Basically I have everything up and running, ping tests work, http
tests
   work, and the smtp test I have setup works.  If the host goes down I
   get
 
  a
 
   host down alert and a host up alert once the problem is
resolved.  Is
 
  there
 
   a way to have Nagios send me an alert for critical status?  I'm
pretty
 
  sure
 
   I have that setup but it isn't working.  After this I went ahead and
   set
 
  up
 
   basic HTTP monitoring for all of our servers, however, 403 errors
are
 
  not
 
   sending any notifications to me.  Also, I'm trying to monitor the
CRM
 
  site
 
   on our windows domain however since the Linux box is obviously not
on
 
  the
 
   domain, it can't authenticate when hitting the site so it can't find
   the string i've specified.  Any ideas on how to make nagios
   authenticate and then check the string?
  
   Thanks



-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Notifications

2007-06-22 Thread Jerad Riggin

Sorry if this question is confusing.

I'm monitoring the HTTP service on a server.  I have it set to notify on
w,u,c,r in services.cfg.  I turned of HTTP to trigger a warning state and I
successfully received an e-mail.  I turned HTTP back on and successfully
received a recovery warning.  I then triggered another warning and never
received an e-mail and a notification wasn't listed in nagios' web
interface.  Any ideas?
-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-22 Thread Jerad Riggin

If I change the hosts.cfg from the IP of the server (10.10.10.7) to the URL
I'm specifying in services.cfg, it works fine.  The default site for
10.10.10.7 is a CRM site which requires basic auth.  So even though i'm
specifying a URL in services.cfg, it looks like it's be overridden by the
hosts.cfg so it can't auth.  What is the point of the -H switch in
services.cfg if it always uses the IP for the host in hosts.cfg?  I tried
that format (-a admin:password) at the end of the check in services.cfg but
it still failed with unauthorized.

On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote:




 -Original Message-
 From: [EMAIL PROTECTED] [mailto:nagios-users-
 [EMAIL PROTECTED] On Behalf Of Jerad Riggin
 Sent: Friday, June 22, 2007 8:41 AM
 To: GJP
 Cc: nagios-users@lists.sourceforge.net
 Subject: Re: [Nagios-users] Nagios HTTP monitoring

 I have a check_http command that works perfectly at the command prompt

 ./check_http -H lagarde -t 5 -s LaGarde Intranet site

 However in nagios, check_http!-H lagarde -t 5 -s LaGarde Intranet
site
 still returns an unauthorized.  So basically why would it work in a
 command prompt but not in nagios?

Perhaps some difference in environment? Did you test as the nagios user?
Is that the _exact_ way it's defined in the config?

 And what is the syntax for the -a
 (basic auth) switch?  Thanks again for everyones help.

-a username:password.

--
Marc

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notifications

2007-06-22 Thread Jerad Riggin

Contacts.cfg

define contact{
   contact_namemis
   alias   mis
   host_notification_period24x7
   service_notification_period24x7
   host_notification_options   d,r,u,f
   service_notification_optionsc,r,w,f
   host_notification_commands  host-notify-by-email
   service_notification_commandsnotify-by-email
   email   [EMAIL PROTECTED] (edited)
   }


Services.cfg

# Generic service definition template
define service{
   namegeneric-service ; Generic service
name
   active_checks_enabled   1   ; Active service
checks are enabled
   passive_checks_enabled  1   ; Passive service
checks are enabled/accepted
   parallelize_check   1   ; Active service
checks should be parallelized (Don't disable)
   obsess_over_service 1   ; We should obsess
over this service (if necessary)
   check_freshness 0   ; Default is to NOT
check service 'freshness'
   notifications_enabled   1   ; Service
notifications are enabled
   event_handler_enabled   1   ; Service event
handler is enabled
   flap_detection_enabled  1   ; Flap detection is
enabled
   process_perf_data   1   ; Process
performance data
   retain_status_information   1   ; Retain status
information across program restarts
   retain_nonstatus_information1   ; Retain non-status
information across program restarts
   register0   ; DONT REGISTER THIS
DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
   }

# Generic for all services
define service{
   use generic-service
   namebasic-service
   is_volatile 0
   check_period24x7
   max_check_attempts  5
   normal_check_interval   5
   retry_check_interval1
   notification_interval   5
   notification_period 24x7
   register0
   }

define service{
   use basic-service
   nameping-service
   notification_optionsn
   check_command   check_ping!1000.0,20%!2000.0,60%
   register0
   }

define service{
   use basic-service
   namecheck-http
   notification_optionsn
   check_command   check_http
   register0
   }

define service{
   use basic-service
   namecheck-site
   notification_optionsw,u,c,r
   check_command   check_http!-H
lagarde.lagardeinc.com-t 5 -s LaGarde
   register0
   }

define service{
   use basic-service
   namecheck-smtp
   notification_optionsw,u,c,r
   check_command   check_smtp!-t 5 -e mail.domain.com
   register0
   }


define service{
   use ping-service
   service_description PING
   contact_groups  mishelpdesk
   hostgroup_name  basic-clients
#   host_name   one_client
   }

define service{
   usecheck-http
   service_descriptionHTTP
   contact_groupsmishelpdesk
   hostgroup_namebasic-clients
}

define service{
   usecheck-site
   service_descriptionSite
   contact_groupsmishelpdesk
   host_namelg02
}

define service{
   usecheck-smtp
   service_descriptionSMTP
   contact_groupsmishelpdesk
   host_namelg01
}


On 6/22/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote:


 Can you post your definition and your template that it uses? It may have
something to do with your notification options.  Check your mail log and
nagios log to see if anything happened behind the scenes and post any funny
or suspicious output. Thanks.





Stephen Valdinger

MIS Helpdesk Coordinator

Dover Chemical Corporation

3676 Davis Rd NW

Dover, OH 44622

330-365-3622

[EMAIL PROTECTED]




  --

*From:* Jerad Riggin [mailto:[EMAIL PROTECTED]
*Sent:* Friday, June 22, 2007 10:56 AM
*To:* nagios-users@lists.sourceforge.net
*Subject:* [Nagios-users] Notifications



Sorry if this question is confusing.



I'm monitoring the HTTP service

Re: [Nagios-users] Notifications

2007-06-22 Thread Jerad Riggin

That's true on check-http, but on check-site for one of my servers it has
w,u,c,r and it worked once, but all alerts since then go unnotified.

Also, how do you check a specific URL for http?  It looks like nagios just
uses the address in hosts.cfg and ignores the -H site.com switch you add in
services.cfg.


On 6/22/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote:


 You are supplying a notification option of n inside the service
definition for http. Should it not be set for w,c,r etc? try doing that and
triggering something to happen again.





Stephen Valdinger

MIS Helpdesk Coordinator

Dover Chemical Corporation

3676 Davis Rd NW

Dover, OH 44622

330-365-3622

[EMAIL PROTECTED]




 --

*From:* Jerad Riggin [mailto:[EMAIL PROTECTED]
*Sent:* Friday, June 22, 2007 11:54 AM
*To:* Valdinger, Stephen (DOV, MSX)
*Cc:* nagios-users@lists.sourceforge.net
*Subject:* Re: [Nagios-users] Notifications



*Contacts.cfg*

define contact{
contact_namemis
alias   mis
host_notification_period24x7
service_notification_period24x7
host_notification_options   d,r,u,f
service_notification_optionsc,r,w,f
host_notification_commands  host-notify-by-email
service_notification_commandsnotify-by-email
email   [EMAIL PROTECTED] (edited)
}


*Services.cfg*

# Generic service definition template
define service{
namegeneric-service ; Generic service
name
active_checks_enabled   1   ; Active service
checks are enabled
passive_checks_enabled  1   ; Passive service
checks are enabled/accepted
parallelize_check   1   ; Active service
checks should be parallelized (Don't disable)
obsess_over_service 1   ; We should obsess
over this service (if necessary)
check_freshness 0   ; Default is to
NOT check service 'freshness'
notifications_enabled   1   ; Service
notifications are enabled
event_handler_enabled   1   ; Service event
handler is enabled
flap_detection_enabled  1   ; Flap detection
is enabled
process_perf_data   1   ; Process
performance data
retain_status_information   1   ; Retain status
information across program restarts
retain_nonstatus_information1   ; Retain
non-status information across program restarts
register0   ; DONT REGISTER
THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
}

# Generic for all services
define service{
use generic-service
namebasic-service
is_volatile 0
check_period24x7
max_check_attempts  5
normal_check_interval   5
retry_check_interval1
notification_interval   5
notification_period 24x7
register0
}

define service{
use basic-service
nameping-service
notification_optionsn
check_command   check_ping!1000.0,20%!2000.0,60%
register0
}

define service{
use basic-service
namecheck-http
notification_optionsn
check_command   check_http
register0
}

define service{
use basic-service
namecheck-site
notification_optionsw,u,c,r
check_command   check_http!-H
lagarde.lagardeinc.com -t 5 -s LaGarde
register0
}

define service{
use basic-service
namecheck-smtp
notification_optionsw,u,c,r
check_command   check_smtp!-t 5 -e 
mail.domain.com
register0
}


define service{
use ping-service
service_description PING
contact_groups  mishelpdesk
hostgroup_name  basic-clients
#   host_name   one_client
}

define service{
usecheck-http
service_descriptionHTTP
contact_groupsmishelpdesk
hostgroup_namebasic-clients
}

define service{
use

Re: [Nagios-users] Nagios HTTP monitoring {Disarmed} {Fraud?}

2007-06-22 Thread Jerad Riggin

commands.cfg

# 'check_http' command definition
define command{
   command_namecheck_http
   command_line$USER1$/check_http -H $HOSTADDRESS$ $ARG1$ -a $ARG2$
   }

Here is an entry in services.cfg

define service{
   use basic-service
   namecheck-site
   notification_optionsw,u,c,r
   check_command   check_http!-H site.com -t 5 -a
admin:password -s String
   register0
   }


On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote:




 -Original Message-
 From: Jerad Riggin [mailto:[EMAIL PROTECTED]
 Sent: Friday, June 22, 2007 10:00 AM
 To: Marc Powell
 Cc: nagios-users@lists.sourceforge.net
 Subject: Re: [Nagios-users] Nagios HTTP monitoring {Disarmed} {Fraud?}

 If I change the hosts.cfg from the IP of the server (10.10.10.7) to
the
 URL I'm specifying in services.cfg, it works fine.  The default site
for
 10.10.10.7 is a CRM site which requires basic auth.  So even though
i'm
 specifying a URL in services.cfg, it looks like it's be overridden by
the
 hosts.cfg so it can't auth.  What is the point of the -H switch in
 services.cfg if it always uses the IP for the host in hosts.cfg?  I
tried
 that format (-a admin:password) at the end of the check in
services.cfg
 but it still failed with unauthorized.

It's looking more and more clear that your command{} definition doesn't
do what you think it does. You don't usually specify switches such as -H
and -a as part of the service definition. Perhaps you should post the
host, service and relevant command definitions for us to look at. I know
for a fact that check_http can and will work the way you want if
properly called.

--
Marc

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null

-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-22 Thread Jerad Riggin

So check_http!-H site.com -t 5 -a admin:password

should be

check_http!-H site.com!-t 5!-a admin:password?


On 6/22/07, GJP [EMAIL PROTECTED] wrote:


Jerad,

I didn't understand that either. That's why I customized the the
check_http
command definition. That way (in my previous example) you can have your
http
checks from any Nagios client. Usefull for instance for  off-site checks.

Regarding the -a switch. The -a $ARG1$ (or whatever) should be in your
command
definition. And your parms as you 'feed' them to the command in your
service
definition seperated with !.

Regards,

Gerard.

On Friday 22 June 2007 16:59:30 Jerad Riggin wrote:
 If I change the hosts.cfg from the IP of the server (10.10.10.7) to the
URL
 I'm specifying in services.cfg, it works fine.  The default site for
 10.10.10.7 is a CRM site which requires basic auth.  So even though i'm
 specifying a URL in services.cfg, it looks like it's be overridden by
the
 hosts.cfg so it can't auth.  What is the point of the -H switch in
 services.cfg if it always uses the IP for the host in hosts.cfg?  I
tried
 that format (-a admin:password) at the end of the check in services.cfgbut
 it still failed with unauthorized.

 On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote:
   -Original Message-
   From: [EMAIL PROTECTED] [mailto:
nagios-users-
   [EMAIL PROTECTED] On Behalf Of Jerad Riggin
   Sent: Friday, June 22, 2007 8:41 AM
   To: GJP
   Cc: nagios-users@lists.sourceforge.net
   Subject: Re: [Nagios-users] Nagios HTTP monitoring
  
   I have a check_http command that works perfectly at the command
prompt
  
   ./check_http -H lagarde -t 5 -s LaGarde Intranet site
  
   However in nagios, check_http!-H lagarde -t 5 -s LaGarde Intranet
 
  site
 
   still returns an unauthorized.  So basically why would it work in a
   command prompt but not in nagios?
 
  Perhaps some difference in environment? Did you test as the nagios
user?
  Is that the _exact_ way it's defined in the config?
 
   And what is the syntax for the -a
   (basic auth) switch?  Thanks again for everyones help.
 
  -a username:password.
 
  --
  Marc
 
 
-
  This SF.net email is sponsored by DB2 Express
  Download DB2 Express C - the FREE version of DB2 express and take
  control of your XML. No limits. Just data. Click to get it now.
  http://sourceforge.net/powerbar/db2/
  ___
  Nagios-users mailing list
  Nagios-users@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/nagios-users
 
  ::: Please include Nagios version, plugin version (-v) and OS when
 
  reporting any issue.
 
  ::: Messages without supporting info will risk being sent to /dev/null



-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notifications

2007-06-22 Thread Jerad Riggin

I'm running 2.9.

The problem is that the server i'm monitoring uses host headers, so multiple
sites are on 10.10.10.7.  The default site for *.7 is a Microsoft CRM site
which requires basic authentication.  So even though in services I specify
-H lagarde.lagardeinc.com it seems to just hit the IP listed for the
host which hits Microsoft CRM and nagios reports unauthorized.


On 6/22/07, Valdinger, Stephen (DOV, MSX) [EMAIL PROTECTED] wrote:


 http://nagios.sourceforge.net/docs/3_0/monitoring-publicservices.html

[Snippet]

A more advanced definition for monitoring the HTTP service is shown below.
This service definition will check to see if the /download/index.php URI
contains the string latest-version.tar.gz. It will produce an error if
the string isn't found, the URI isn't valid, or the web server takes longer
than 5 seconds to respond.

define service{

usegeneric-service; Inherit default values from a 
template

host_name  remotehost

service_descriptionProduct Download Link

check_command  check_http!-u /download/index.php -t 5 -s 
latest-version.tar.gz

}



[/Snippet]



As you can see, supplying the –u switch will allow you to choose a
particular page, and you can then use the –s switch to search for specific
text content on that page.



The problem with notifications eludes, me, but I know I read about it in
the documentation somewhere. Also, what version are you using?



Stephen Valdinger

MIS Helpdesk Coordinator

Dover Chemical Corporation

3676 Davis Rd NW

Dover, OH 44622

330-365-3622

[EMAIL PROTECTED]




 --

*From:* Jerad Riggin [mailto:[EMAIL PROTECTED]
*Sent:* Friday, June 22, 2007 12:03 PM
*To:* Valdinger, Stephen (DOV, MSX)
*Cc:* nagios-users@lists.sourceforge.net
*Subject:* Re: [Nagios-users] Notifications



That's true on check-http, but on check-site for one of my servers it has
w,u,c,r and it worked once, but all alerts since then go unnotified.



Also, how do you check a specific URL for http?  It looks like nagios just
uses the address in hosts.cfg and ignores the -H site.com switch you add
in services.cfg.



On 6/22/07, *Valdinger, Stephen (DOV, MSX)* [EMAIL PROTECTED]
wrote:

You are supplying a notification option of n inside the service
definition for http. Should it not be set for w,c,r etc? try doing that and
triggering something to happen again.





Stephen Valdinger

MIS Helpdesk Coordinator

Dover Chemical Corporation

3676 Davis Rd NW

Dover, OH 44622

330-365-3622

[EMAIL PROTECTED]




 --

*From:* Jerad Riggin [mailto: [EMAIL PROTECTED]
*Sent:* Friday, June 22, 2007 11:54 AM
*To:* Valdinger, Stephen (DOV, MSX)
*Cc:* nagios-users@lists.sourceforge.net
*Subject:* Re: [Nagios-users] Notifications



*Contacts.cfg*

define contact{
contact_namemis
alias   mis
host_notification_period24x7
service_notification_period24x7
host_notification_options   d,r,u,f
service_notification_optionsc,r,w,f
host_notification_commands  host-notify-by-email
service_notification_commandsnotify-by-email
email   [EMAIL PROTECTED] (edited)
}


*Services.cfg*

# Generic service definition template
define service{
namegeneric-service ; Generic service
name
active_checks_enabled   1   ; Active service
checks are enabled
passive_checks_enabled  1   ; Passive service
checks are enabled/accepted
parallelize_check   1   ; Active service
checks should be parallelized (Don't disable)
obsess_over_service 1   ; We should obsess
over this service (if necessary)
check_freshness 0   ; Default is to
NOT check service 'freshness'
notifications_enabled   1   ; Service
notifications are enabled
event_handler_enabled   1   ; Service event
handler is enabled
flap_detection_enabled  1   ; Flap detection
is enabled
process_perf_data   1   ; Process
performance data
retain_status_information   1   ; Retain status
information across program restarts
retain_nonstatus_information1   ; Retain
non-status information across program restarts
register0   ; DONT REGISTER
THIS DEFINITION - NOT A REAL SERVICE, JUST A TEMPLATE!
}

# Generic for all services
define service{
use generic-service
namebasic-service
is_volatile 0
check_period24x7
max_check_attempts  5

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-22 Thread Jerad Riggin

It's still giving me an unauthorized message.  I've verified the credentials
work for me to get in through a browser.  Is there something else I could be
missing?

On 6/22/07, GJP [EMAIL PROTECTED] wrote:


Jerad,

No entirely, because you feed the arguments to the actual command with
$ARGn$

So in de service definition:

check_http!site.com!5!admin:password

And the command_line format in command definition (probably
in: /etc/nagios-plugins/config/http.cfg) should be:

command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -t $ARG2$ -a
$ARG3$

Be carefull with that -t 5 btw. That is a very low value to timeout.

Hope it helps.

Regards,

Gerard.

On Friday 22 June 2007 19:17:29 Jerad Riggin wrote:
 So check_http!-H site.com -t 5 -a admin:password

 should be

 check_http!-H site.com!-t 5!-a admin:password?

 On 6/22/07, GJP [EMAIL PROTECTED] wrote:
  Jerad,
 
  I didn't understand that either. That's why I customized the the
  check_http
  command definition. That way (in my previous example) you can have
your
  http
  checks from any Nagios client. Usefull for instance for  off-site
checks.
 
  Regarding the -a switch. The -a $ARG1$ (or whatever) should be in your
  command
  definition. And your parms as you 'feed' them to the command in your
  service
  definition seperated with !.
 
  Regards,
 
  Gerard.
 
  On Friday 22 June 2007 16:59:30 Jerad Riggin wrote:
   If I change the hosts.cfg from the IP of the server (10.10.10.7) to
the
 
  URL
 
   I'm specifying in services.cfg, it works fine.  The default site for
   10.10.10.7 is a CRM site which requires basic auth.  So even though
i'm
   specifying a URL in services.cfg, it looks like it's be overridden
by
 
  the
 
   hosts.cfg so it can't auth.  What is the point of the -H switch in
   services.cfg if it always uses the IP for the host in hosts.cfg?  I
 
  tried
 
   that format (-a admin:password) at the end of the check in
   services.cfgbut it still failed with unauthorized.
  
   On 6/22/07, Marc Powell [EMAIL PROTECTED] wrote:
 -Original Message-
 From: [EMAIL PROTECTED] [mailto:
 
  nagios-users-
 
 [EMAIL PROTECTED] On Behalf Of Jerad Riggin
 Sent: Friday, June 22, 2007 8:41 AM
 To: GJP
 Cc: nagios-users@lists.sourceforge.net
 Subject: Re: [Nagios-users] Nagios HTTP monitoring

 I have a check_http command that works perfectly at the command
 
  prompt
 
 ./check_http -H lagarde -t 5 -s LaGarde Intranet site

 However in nagios, check_http!-H lagarde -t 5 -s LaGarde
Intranet
   
site
   
 still returns an unauthorized.  So basically why would it work
in a
 command prompt but not in nagios?
   
Perhaps some difference in environment? Did you test as the nagios
 
  user?
 
Is that the _exact_ way it's defined in the config?
   
 And what is the syntax for the -a
 (basic auth) switch?  Thanks again for everyones help.
   
-a username:password.
   
--
Marc
 
 
-
 
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
   
::: Please include Nagios version, plugin version (-v) and OS when
   
reporting any issue.
   
::: Messages without supporting info will risk being sent to
::: /dev/null



-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-21 Thread Jerad Riggin

Is there any link to information about that?  Like I said this is basically
the 2nd day I've used Linux and Nagios so much of this seems like a
different language to me.  Thanks for the info.

On 6/21/07, GJP [EMAIL PROTECTED] wrote:


Hi Jerad,

If you mean http auth, this is what I use.

The check_command in the service definition:

check_command
check_http_auth!host.example.com!/path!user:password

And the command definition.

define command{
   command_namecheck_http_auth
   command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -u
$ARG2$ -a $ARG3$
   }

But if you mean authentication within a windows AD domain. Maybe Samba can
help. Because then you can make your linux server a member of the AD
domain.

Regards,

Gerard.

On Thursday 21 June 2007 22:48:08 Jerad Riggin wrote:
 I'm new to Linux and new to Nagios so this may sound very stupid.
 Basically I have everything up and running, ping tests work, http tests
 work, and the smtp test I have setup works.  If the host goes down I get
a
 host down alert and a host up alert once the problem is resolved.  Is
there
 a way to have Nagios send me an alert for critical status?  I'm pretty
sure
 I have that setup but it isn't working.  After this I went ahead and set
up
 basic HTTP monitoring for all of our servers, however, 403 errors are
not
 sending any notifications to me.  Also, I'm trying to monitor the CRM
site
 on our windows domain however since the Linux box is obviously not on
the
 domain, it can't authenticate when hitting the site so it can't find the
 string i've specified.  Any ideas on how to make nagios authenticate and
 then check the string?

 Thanks



-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios HTTP monitoring

2007-06-21 Thread Jerad Riggin

More importantly, can I just specify a URL to check?  The default site on
the server in question directs you to CRM which requires auth.  However on
the same server we have an intranet site which doesn't require
authorization.  How would I go about just checking the intranet URL?

On 6/21/07, Jerad Riggin [EMAIL PROTECTED] wrote:


Is there any link to information about that?  Like I said this is
basically the 2nd day I've used Linux and Nagios so much of this seems like
a different language to me.  Thanks for the info.

On 6/21/07, GJP [EMAIL PROTECTED] wrote:

 Hi Jerad,

 If you mean http auth, this is what I use.

 The check_command in the service definition:

 check_command
 check_http_auth!host.example.com!/path!user:password

 And the command definition.

 define command{
command_namecheck_http_auth
command_line/usr/lib/nagios/plugins/check_http -H $ARG1$ -u
 $ARG2$ -a $ARG3$
}

 But if you mean authentication within a windows AD domain. Maybe Samba
 can
 help. Because then you can make your linux server a member of the AD
 domain.

 Regards,

 Gerard.

 On Thursday 21 June 2007 22:48:08 Jerad Riggin wrote:
  I'm new to Linux and new to Nagios so this may sound very stupid.
  Basically I have everything up and running, ping tests work, http
 tests
  work, and the smtp test I have setup works.  If the host goes down I
 get a
  host down alert and a host up alert once the problem is resolved.  Is
 there
  a way to have Nagios send me an alert for critical status?  I'm pretty
 sure
  I have that setup but it isn't working.  After this I went ahead and
 set up
  basic HTTP monitoring for all of our servers, however, 403 errors are
 not
  sending any notifications to me.  Also, I'm trying to monitor the CRM
 site
  on our windows domain however since the Linux box is obviously not on
 the
  domain, it can't authenticate when hitting the site so it can't find
 the
  string i've specified.  Any ideas on how to make nagios authenticate
 and
  then check the string?
 
  Thanks




-
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null