Re: [Nagios-users] Return Code 127

2006-12-19 Thread Andreas Ericsson
Kaplan, Andrew H. wrote:
 I checked the commands.cfg file and modified the command syntax of check_nrpe
 from /usr/lib/nagios/plugins/check_nrpe to $USER1$/check_nrpe.
 
 The result was I am no longer getting the 127 error, but I am now faced
 with another issue. 
 
 The new error that I am seeing is:
 
 CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for
 error messages.
 
 I checked the HP-UX 11.11 box and the syslog.log file had the following 
 entry:
 
 Host 0.0.0.0 is not allowed to talk to us!
 

It seems there's a bug in HP-UX or you have some extremely odd firewall 
installed. This is supposed to print the ip-address of the connecting host.

 I checked the nrpe.cfg file and under SERVER ADDRESS, I have nrpe bound to
 client's ip address, and under ALLOWED HOSTS, I have several addresses,
 including the Nagios 2.6 server, that are permitted to talk to NRPE on the
 client. 
 

Quick and dirty hack: Add 0.0.0.0 to the list of allowed_hosts. This 
will make it work but will most likely also make it possible to query 
the host from any other server on the network.

-- 
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Schedule Downtime w/o authentication requirement

2006-12-19 Thread Andreas Ericsson
Subin Shakya wrote:
  I want to configure nagios in such a way that it does not require any
 kind of authorization/authentication including 'guest'.  How can I
 do
 that?
 I do not know where to go in Nagios config to make this happen.
 TIA, Symuser
 =0D
 Right now when I select a Host to schedule a downtime I get:
 =0D
 Sorry Dave, I can't let you do that...
 It seems that you have chosen to not use the authentication
 functionality of the CGIs.
 I don't want to be personally responsible for what may happen as a
 result of allowing unauthorized users to issue commands to Nagios,so
 you'll have to disable this safeguard if you are really stubborn and
 want to invite trouble.
 Read the section on CGI authentication in the HTML documentation to
 learn how you can enable authentication and why you should want to.
 

Nagios is more polite than I am. RTFM and do some thinking and you'll 
find the answer.

On a sidenote; I know the answer but I'm not interested in giving it to 
you when you haven't even bothered trying to find it for yourself.

-- 
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios monitoring bacula - service dependencies

2006-12-19 Thread Jeronimo Zucco
NOBODY KNOWS ???

Jeronimo Zucco escreveu:
 Hi, all.

 I have problem using service dependencies. I have a backup server 
 (bacula), and I check status of backup through a mysql query ( 
 http://darcs.complete.org/debian/bacula.upstream/examples/nagios.txt ) 
 on server bacula using nrpe in server bacula.

 I have many others servers with I do backups in bacula (like server1 
 below), then I check status of backup of this server, with this definition:

 define service{
 use generic-service ; Name 
 of service template to use
 nameTEMPLATE-REMOTE-BACULA-BACKUP
 service_description Backup
 is_volatile 0
 check_period24x7
 max_check_attempts  4
 normal_check_interval   5
 retry_check_interval1
 notification_optionsw,u,c,r
 notification_interval   0
 notification_period 24x7
 register0
 }

 define service{
 use TEMPLATE-REMOTE-BACULA-BACKUP
 host_nameserver1
 contact_groupsadmins
 register1
 check_command   check_remote_bacula_server1Job
 }

 define servicedependency{
 host_name   server1
 service_description Backup
 dependent_host_name bacula
 dependent_service_description   PING
 execution_failure_criteria  u,c
 notification_failure_criteria   w,u,c
 }

 I configure service dependency because when I shutdown bacula/mysql 
 server, I got many notifications about backups. What is wrong in my 
 definition of service dependency ? I want no notifications when bacula 
 server is down.

 Thank you.

   


-- 
Jeronimo Zucco
LPIC-1 Linux Professional Institute Certified
Núcleo de Processamento de Dados
Universidade de Caxias do Sul

http://jczucco.blogspot.com


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! 

Recently I have run into the very same performance issues 
as Daniel Meyer (or so it seems). However, I'm not quite sure
about it. Here's the gist of it.

Currently, service check latency slowly creeps up. As it is now,
it starts out at a little over 1s and after about 12 hours it's
in the area of about 90s. It keeps climbing after that. 

Here's the output of nagios -s:
Nagios 2.6
Copyright (c) 1999-2006 Ethan Galstad (http://www.nagios.org)
Last Modified: 11-27-2006
License: GPL

Warning: Contact group 'Singles-Truppe' is not used in any
host/service definitions or host/service escalations!
Projected scheduling information for host and service
checks is listed below.  This information assumes that
you are going to start running Nagios with your current
config files.

HOST SCHEDULING INFORMATION
---
Total hosts: 330
Total scheduled hosts:   0
Host inter-check delay method:   SMART
Average host check interval: 0.00 sec
Host inter-check delay:  0.00 sec
Max host check spread:   10 min
First scheduled check:   N/A
Last scheduled check:N/A


SERVICE SCHEDULING INFORMATION
---
Total services: 2836
Total scheduled services:   2836
Service inter-check delay method:   SMART
Average service check interval: 2225.56 sec
Inter-check delay:  0.21 sec
Interleave factor method:   SMART
Average services per host:  8.59
Service interleave factor:  9
Max service check spread:   10 min
First scheduled check:  Tue Dec 19 11:21:45 2006
Last scheduled check:   Tue Dec 19 11:31:47 2006


CHECK PROCESSING INFORMATION

Service check reaper interval:  5 sec
Max concurrent service checks:  Unlimited


PERFORMANCE SUGGESTIONS
---
I have no suggestions - things look okay.

This all looks peachy - I think. What I don't get is this line:

Average service check interval: 2225.56 sec

It seems to me that this is either a skewed value, stemming from
my history of looong latencies (at one point we were beyonf
9000 seconds). *Or* it is indicative of a misconfiguration on my
part. If the latter is the case, I'd be eager, nay ecstatic to
hear what I did wrong. Here are a few of the config vars that
might influence this:

sleep_time=0.25
service_reaper_frequency=5
max_concurrent_checks=0
max_host_check_spread=10
host_inter_check_delay_method=s
service_interleave_factor=s
command_check_interval=1
obsess_over_services=0
aggregate_status_updates=1
status_update_interval=20

Also, here's the output from nagiostats:
Nagios Stats 2.6
Copyright (c) 2003-2005 Ethan Galstad (www.nagios.org)
Last Modified: 11-27-2006
License: GPL

CURRENT STATUS DATA

Status File:  /var/nagios/status.dat
Status File Age:  0d 0h 0m 3s
Status File Version:  2.6

Program Running Time: 0d 1h 59m 5s

Total Services:   2836
Services Checked: 2836
Services Scheduled:   2758
Active Service Checks:2836
Passive Service Checks:   0
Total Service State Change:   0.000 / 12.370 / 0.007 %
Active Service Latency:   0.006 / 10.237 / 0.906 sec
Active Service Execution Time:0.047 / 10.159 / 0.180 sec
Active Service State Change:  0.000 / 12.370 / 0.007 %
Active Services Last 1/5/15/60 min:   477 / 2678 / 2745 / 2754
Passive Service State Change: 0.000 / 0.000 / 0.000 %
Passive Services Last 1/5/15/60 min:  0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit:2814 / 6 / 0 / 16
Services Flapping:0
Services In Downtime: 0

Total Hosts:  330
Hosts Checked:330
Hosts Scheduled:  0
Active Host Checks:   330
Passive Host Checks:  0
Total Host State Change:  0.000 / 0.000 / 0.000 %
Active Host Latency:  0.000 / 1.000 / 0.888 sec
Active Host Execution Time:   0.030 / 4.059 / 0.112 sec
Active Host State Change: 0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min:  0 / 12 / 12 / 12
Passive Host State Change:0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0
Hosts Up/Down/Unreach:329 / 1 / 0
Hosts Flapping:   0
Hosts In Downtime:0

Hardware is a dual-2.8GHz Xeon, 2G RAM and a 100 FDX interface.
LoadAvg is around 1.6, sometimes gets to 1.9. CPUs are both
around 40% idle most of the time. I see about 300 context
switches and 500 interrupts per second. The network load is
neglible, ditto the packet rate.

The way these figures look I don't see a performance problem per
se, but maybe I 

[Nagios-users] Questions about scheduling

2006-12-19 Thread Tobias Klausmann
Hi! 

I have a few questions about scheduling in Nagios.

- How does the scheduling queue work? From the docs it seems the
  whole queue is held up as soon as a host check is necessary. 
  As far as I know, Nagios parallelizes checks, so my question
  is if the current checking thread is held up only or if all of
  the checks are stopped immediately?

- If the whole set of workers is stopped, this would mean that a
  failing check would result in an immeidate host check which in
  turn holds up all the queues until it is complete. Does it
  really work this way?

- How much performance overhead do service dependencies generate?
  I have quite a few NRPE checks and all of them depend on the
  NRPE dummy check I always define. Does this stall checking by
  any considerable amount?

Thansk for any help,
Tobias


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios monitoring bacula - service dependencies

2006-12-19 Thread Andreas Ericsson
Jeronimo Zucco wrote:
 NOBODY KNOWS ???
 

Lots of people know. The answer is ofcourse that you should read and 
understand the manual before asking questions here. The Nagios config 
documentation really is quite extensive.

Perhaps you missed this particular snippet when configuring?

---%---%---%---
One important thing to note is that by default, Nagios will use the most 
current hard state of the service(s) that is/are being depended upon 
when it does the dependeny checks. If you want Nagios to use the most 
current state of the services (regardless of whether its a soft or hard 
state), enable the soft_service_dependencies option.
---%---%---%---

There are lots of other goodies in the docs.


 Jeronimo Zucco escreveu:
 Hi, all.

 I have problem using service dependencies. I have a backup server 
 (bacula), and I check status of backup through a mysql query ( 
 http://darcs.complete.org/debian/bacula.upstream/examples/nagios.txt ) 
 on server bacula using nrpe in server bacula.

 I have many others servers with I do backups in bacula (like server1 
 below), then I check status of backup of this server, with this definition:

 define service{
 use generic-service ; Name 
 of service template to use
 nameTEMPLATE-REMOTE-BACULA-BACKUP
 service_description Backup
 is_volatile 0
 check_period24x7
 max_check_attempts  4
 normal_check_interval   5
 retry_check_interval1
 notification_optionsw,u,c,r
 notification_interval   0
 notification_period 24x7
 register0
 }

 define service{
 use TEMPLATE-REMOTE-BACULA-BACKUP
 host_nameserver1
 contact_groupsadmins
 register1
 check_command   check_remote_bacula_server1Job
 }

 define servicedependency{
 host_name   server1
 service_description Backup
 dependent_host_name bacula
 dependent_service_description   PING
 execution_failure_criteria  u,c
 notification_failure_criteria   w,u,c
 }

 I configure service dependency because when I shutdown bacula/mysql 
 server, I got many notifications about backups. What is wrong in my 
 definition of service dependency ? I want no notifications when bacula 
 server is down.

 Thank you.

   
 
 

-- 
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Hugo van der Kooij
On Tue, 19 Dec 2006, Tobias Klausmann wrote:

 - How does the scheduling queue work? From the docs it seems the
  whole queue is held up as soon as a host check is necessary.
  As far as I know, Nagios parallelizes checks, so my question
  is if the current checking thread is held up only or if all of
  the checks are stopped immediately?

 - If the whole set of workers is stopped, this would mean that a
  failing check would result in an immeidate host check which in
  turn holds up all the queues until it is complete. Does it
  really work this way?

Judging from the past months traffic on this mailinglist I am inclined to 
say: yes. I think you will find it interresting to delve a bit into the 
archives of this mailinglist.

Hugo.

-- 
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
This message is using 100% recycled electrons.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Andreas Ericsson
Thanks for an excellently detailed problem report, missing only the 
Nagios version and system type/version info. I've got some comments and 
followup questions. See below.

Tobias Klausmann wrote:
 Hi! 
 
 Recently I have run into the very same performance issues 
 as Daniel Meyer (or so it seems). However, I'm not quite sure
 about it. Here's the gist of it.
 
 Currently, service check latency slowly creeps up. As it is now,
 it starts out at a little over 1s and after about 12 hours it's
 in the area of about 90s. It keeps climbing after that. 
 
 Here's the output of nagios -s:
 
 HOST SCHEDULING INFORMATION
 ---
 Total hosts: 330
 Total scheduled hosts:   0

No scheduled host-checks. That's good, cause they interfere with normal 
operations in Nagios.

 Host inter-check delay method:   SMART
 Average host check interval: 0.00 sec
 Host inter-check delay:  0.00 sec
 Max host check spread:   10 min
 First scheduled check:   N/A
 Last scheduled check:N/A
 
 
 SERVICE SCHEDULING INFORMATION
 ---
 Total services: 2836
 Total scheduled services:   2836
 Service inter-check delay method:   SMART
 Average service check interval: 2225.56 sec


This is, as you point out below, quite odd. What's your _longest_ 
normal_check_interval for services?


 Inter-check delay:  0.21 sec
 Interleave factor method:   SMART
 Average services per host:  8.59
 Service interleave factor:  9
 Max service check spread:   10 min
 First scheduled check:  Tue Dec 19 11:21:45 2006
 Last scheduled check:   Tue Dec 19 11:31:47 2006
 
 
 CHECK PROCESSING INFORMATION
 
 Service check reaper interval:  5 sec

You could lower this to 2 seconds. I've done so on any number of 
installations and it has no negative impact what so ever, but seems to 
make Nagios a bit more responsive.

 Max concurrent service checks:  Unlimited
 

I assume you aren't running in to hardware limits on this machine. 
What's the normal load when you're running nagios? If it's  NUM_CPUS 
then you most likely don't have beefy enough hardware. That's hardly 
ever the case though, so don't bother looking into it unless all else fails.

Nvm, question answered below. Hardware resources should be no problem 
what so ever.

 
 This all looks peachy - I think. What I don't get is this line:
 
 Average service check interval: 2225.56 sec
 
 It seems to me that this is either a skewed value, stemming from
 my history of looong latencies (at one point we were beyonf
 9000 seconds).

Nopes. Nagios doesn't bother reading logfiles when it calculates the 
scheduling numbers.

 *Or* it is indicative of a misconfiguration on my
 part. If the latter is the case, I'd be eager, nay ecstatic to
 hear what I did wrong. Here are a few of the config vars that
 might influence this:
 

There has been a slight thinko in Nagios. I don't know if it's still 
there in recent CVS versions. The thinko is that it (used to?) calculate 
average service check interval by adding up all normal_check_interval 
values and dividing it by the number of services configured (or 
something along those lines), which leads to long latencies. This 
normally didn't make those latencies increase though. Humm...


 sleep_time=0.25
 service_reaper_frequency=5
 max_concurrent_checks=0
 max_host_check_spread=10
 host_inter_check_delay_method=s
 service_interleave_factor=s
 command_check_interval=1
 obsess_over_services=0
 aggregate_status_updates=1
 status_update_interval=20
 
 Also, here's the output from nagiostats:
 Nagios Stats 2.6
 Copyright (c) 2003-2005 Ethan Galstad (www.nagios.org)
 Last Modified: 11-27-2006
 License: GPL
 
 CURRENT STATUS DATA
 
 Status File:  /var/nagios/status.dat
 Status File Age:  0d 0h 0m 3s
 Status File Version:  2.6
 
 Program Running Time: 0d 1h 59m 5s
 
 Total Services:   2836
 Services Checked: 2836
 Services Scheduled:   2758
 Active Service Checks:2836
 Passive Service Checks:   0


All services aren't being scheduled, but you have no passive service 
checks. Have you disabled checks of 78 services?


 Total Service State Change:   0.000 / 12.370 / 0.007 %
 Active Service Latency:   0.006 / 10.237 / 0.906 sec
 Active Service Execution Time:0.047 / 10.159 / 0.180 sec
 Active Service State Change:  0.000 / 12.370 / 0.007 %
 Active Services Last 1/5/15/60 min:   477 / 2678 / 2745 / 2754
 Passive Service State Change: 0.000 / 0.000 / 0.000 %
 Passive Services Last 1/5/15/60 min:  0 / 0 / 0 / 0
 Services Ok/Warn/Unk/Crit:2814 / 6 / 0 / 16
 Services Flapping: 

Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Andreas Ericsson
Tobias Klausmann wrote:
 Hi! 
 
 I have a few questions about scheduling in Nagios.
 
 - How does the scheduling queue work? From the docs it seems the
   whole queue is held up as soon as a host check is necessary. 
   As far as I know, Nagios parallelizes checks, so my question
   is if the current checking thread is held up only or if all of
   the checks are stopped immediately?
 

All the checks are stopped immediately. This is to prevent sending 
service notifications when hosts go down, so it's sort of inevitable. 
Nagios 3.0 will support asynchronous host checks, but we're not there yet.

 - If the whole set of workers is stopped, this would mean that a
   failing check would result in an immeidate host check which in
   turn holds up all the queues until it is complete. Does it
   really work this way?
 

Yes, for reasons stated above. It gets slightly worse if you have a 
largely linear network (many hosts only have one child), since it also 
has to check parent hosts until it finds the closest possible up to 
determine where a possible network outage is happening.

 - How much performance overhead do service dependencies generate?
   I have quite a few NRPE checks and all of them depend on the
   NRPE dummy check I always define. Does this stall checking by
   any considerable amount?
 

Not much, and not really, respectively :). You might want to enable 
soft_service_dependencies though. The dependencies only add a couple 
of internal checks along the lines of

if (service-state  dependent_state-notification_failure_criteria)
/* don't send a notification */
;

-- 
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! 

On Tue, 19 Dec 2006, Andreas Ericsson wrote:
 Thanks for an excellently detailed problem report, missing only the 
 Nagios version and system type/version info. I've got some comments and 
 followup questions. See below.

I'm running 2.6 now but I had the troubles with 2.5 initially.
OS is a Gentoo Linux, Kernel 2.6.15.5 initially, upgrade to
2.6.19 today.

  ---
  Total hosts: 330
  Total scheduled hosts:   0
 
 No scheduled host-checks. That's good, cause they interfere with normal 
 operations in Nagios.

I've read as much. In my seperate mail I had a few questions
about it, let's keep them (and the answers there ;)

  Host inter-check delay method:   SMART
  Average host check interval: 0.00 sec
  Host inter-check delay:  0.00 sec
  Max host check spread:   10 min
  First scheduled check:   N/A
  Last scheduled check:N/A
  
  
  SERVICE SCHEDULING INFORMATION
  ---
  Total services: 2836
  Total scheduled services:   2836
  Service inter-check delay method:   SMART
  Average service check interval: 2225.56 sec
 
 This is, as you point out below, quite odd. What's your _longest_ 
 normal_check_interval for services?

The longest check_interval is 86400 seconds. It's a SSL cert
freshness check. I figured it wasn't necesseary to check that
more often than once a day. I also have check_intervals of 3, 5,
15, 20, 30 and 1440 seconds. The latter is also a cert freshness
check which is lower because the customer wanted it to be that
short.

  CHECK PROCESSING INFORMATION
  
  Service check reaper interval:  5 sec
 
 You could lower this to 2 seconds. I've done so on any number of 
 installations and it has no negative impact what so ever, but seems to 
 make Nagios a bit more responsive.

I'll give that a try.

  Max concurrent service checks:  Unlimited
 
 I assume you aren't running in to hardware limits on this machine. 
 What's the normal load when you're running nagios? If it's  NUM_CPUS 
 then you most likely don't have beefy enough hardware. That's hardly 
 ever the case though, so don't bother looking into it unless all else fails.
 
 Nvm, question answered below. Hardware resources should be no problem 
 what so ever.

I also noticed that HT was disabled on the machine. I've changed
that (and added support for it to the kernel) when I did the
kernel upgrade today. I'll keep an eye on check latency.

  *Or* it is indicative of a misconfiguration on my
  part. If the latter is the case, I'd be eager, nay ecstatic to
  hear what I did wrong. Here are a few of the config vars that
  might influence this:
 
 There has been a slight thinko in Nagios. I don't know if it's still 
 there in recent CVS versions. The thinko is that it (used to?) calculate 
 average service check interval by adding up all normal_check_interval 
 values and dividing it by the number of services configured (or 
 something along those lines), which leads to long latencies. This 
 normally didn't make those latencies increase though. Humm...

Well, the numbers sure do get whacky after a restart: first it
skyrockets for about five minutes, then plummets to 1s. From
there it works its way up the way I described.

  Total Services:   2836
  Services Checked: 2836
  Services Scheduled:   2758
  Active Service Checks:2836
  Passive Service Checks:   0
 
 All services aren't being scheduled, but you have no passive service 
 checks. Have you disabled checks of 78 services?

Oops, forgot to mention that. Yes, a server farm is being rebuilt
currently. As I didn't want all the host check timeouts to make
matters much, much, worse, I disabled them entirely.

  Hardware is a dual-2.8GHz Xeon, 2G RAM and a 100 FDX interface.
  LoadAvg is around 1.6, sometimes gets to 1.9. CPUs are both
  around 40% idle most of the time. I see about 300 context
  switches and 500 interrupts per second. The network load is
  neglible, ditto the packet rate.
  
  The way these figures look I don't see a performance problem per
  se, but maybe I have overlooked a metric that descirbes the
  usual bottleneck of installations.
  
 
 Are the CPU's 64 bit ones running in 32-bit emulation mode? For intel 
 cpu's, that causes up to 60% performance loss (yes, it really is that bad).

Sheesh. Yes, it is a 32-bit installation. I only ever bothered
with 64-bit installs on Opteron hardware. I might look into
migrating to 64 bits, then.

 I'm puzzled. Please let me know if you find the answer to this problem. 
 I'll help you debug it as best I can, but please continue posting 
 on-list. Thanks.

Sure. I'll first check if the processor upgrade and kernel
update helped anything, then try lowering the reaper interval to
2. I'll post the results as soon as I have them.

Regards  Thanks,
Tobias
-- 
Never 

Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Tobias Klausmann
Hi! 

On Tue, 19 Dec 2006, Andreas Ericsson wrote:
  - How does the scheduling queue work? From the docs it seems the
whole queue is held up as soon as a host check is necessary. 
As far as I know, Nagios parallelizes checks, so my question
is if the current checking thread is held up only or if all of
the checks are stopped immediately?
 
 All the checks are stopped immediately. This is to prevent sending 
 service notifications when hosts go down, so it's sort of inevitable. 
 Nagios 3.0 will support asynchronous host checks, but we're not there yet.

Good, good, at least there's a solution on the horizon.

  - If the whole set of workers is stopped, this would mean that a
failing check would result in an immeidate host check which in
turn holds up all the queues until it is complete. Does it
really work this way?
 
 Yes, for reasons stated above. It gets slightly worse if you have a 
 largely linear network (many hosts only have one child), since it also 
 has to check parent hosts until it finds the closest possible up to 
 determine where a possible network outage is happening.

I have nearly no parent/child relationships in my setup, as I
don't monitor network equipment just hosts (and their services)
As such, I only have service deps (more deps than services,
actually).

  - How much performance overhead do service dependencies generate?
I have quite a few NRPE checks and all of them depend on the
NRPE dummy check I always define. Does this stall checking by
any considerable amount?
 
 Not much, and not really, respectively :). You might want to enable 
 soft_service_dependencies though. The dependencies only add a couple 
 of internal checks along the lines of
 
 if (service-state  dependent_state-notification_failure_criteria)
   /* don't send a notification */
   ;

Ok, I'll refrain from dropping them, then :)

Thanks,
Tobias

PS: Sorry for hijacking the thread, I missed deleting the
in-reply-to-line. Thinking how easy that mistake was, I'll
probably be a little less annoyed next time I see someone do it
:)

-- 
Never touch a burning system.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Hugo van der Kooij
On Tue, 19 Dec 2006, Andreas Ericsson wrote:

 Yes, for reasons stated above. It gets slightly worse if you have a
 largely linear network (many hosts only have one child), since it also
 has to check parent hosts until it finds the closest possible up to
 determine where a possible network outage is happening.

Just curious. How will this work if you have something like 5 hosts in 
line in a parent-child relation?

The fastest way would be starting from nagios and work your way to the 
downed host as the average latency on a check on a live host is much 
faster then the timeout you get on downed hosts.

Considere the map as shown on 
http://hvdkooij.xs4all.nl/statusmap-20061219.png

If nagios detects the ipv6 router in the lab to be down and it has to work 
it's way up it has to deal with the timeouts on nlams04 and nlams05.

If it starts polling the other way around it only has to deal with the 
host check latency of the switch and the timeout of nlams05.

Hugo.

-- 
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
This message is using 100% recycled electrons.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Andreas Ericsson
Tobias Klausmann wrote:
 Hi! 
 
 On Tue, 19 Dec 2006, Andreas Ericsson wrote:
 Thanks for an excellently detailed problem report, missing only the 
 Nagios version and system type/version info. I've got some comments and 
 followup questions. See below.
 
 I'm running 2.6 now but I had the troubles with 2.5 initially.
 OS is a Gentoo Linux, Kernel 2.6.15.5 initially, upgrade to
 2.6.19 today.
 
 ---
 Total hosts: 330
 Total scheduled hosts:   0
 No scheduled host-checks. That's good, cause they interfere with normal 
 operations in Nagios.
 
 I've read as much. In my seperate mail I had a few questions
 about it, let's keep them (and the answers there ;)
 
 Host inter-check delay method:   SMART
 Average host check interval: 0.00 sec
 Host inter-check delay:  0.00 sec
 Max host check spread:   10 min
 First scheduled check:   N/A
 Last scheduled check:N/A


 SERVICE SCHEDULING INFORMATION
 ---
 Total services: 2836
 Total scheduled services:   2836
 Service inter-check delay method:   SMART
 Average service check interval: 2225.56 sec
 This is, as you point out below, quite odd. What's your _longest_ 
 normal_check_interval for services?
 
 The longest check_interval is 86400 seconds. It's a SSL cert
 freshness check. I figured it wasn't necesseary to check that
 more often than once a day. I also have check_intervals of 3, 5,
 15, 20, 30 and 1440 seconds. The latter is also a cert freshness
 check which is lower because the customer wanted it to be that
 short.
 

Try changing the really long intervals to something shorter or 
commenting them out completely and see what happens. Checking a 
certificate is not a particularly heavy operation so it doesn't matter 
much if you run it ever 5 minutes. On the server side it just gets 
handed out from cache, so it's not heave there either.

If you have the various normal_check_interval's specified in templates, 
try setting them all to 5 minutes and let Nagios run over-night. If this 
interferes with some fragile services on the network (webservers whose 
sessions don't expire, fe), disable active checks for those services 
during the testing period.

(yes, this might seem braindead, but I really need to know if this bug 
is still in Nagios).

 
 *Or* it is indicative of a misconfiguration on my
 part. If the latter is the case, I'd be eager, nay ecstatic to
 hear what I did wrong. Here are a few of the config vars that
 might influence this:
 There has been a slight thinko in Nagios. I don't know if it's still 
 there in recent CVS versions. The thinko is that it (used to?) calculate 
 average service check interval by adding up all normal_check_interval 
 values and dividing it by the number of services configured (or 
 something along those lines), which leads to long latencies. This 
 normally didn't make those latencies increase though. Humm...
 
 Well, the numbers sure do get whacky after a restart: first it
 skyrockets for about five minutes, then plummets to 1s. From
 there it works its way up the way I described.
 

Are the first checks of things being scheduled with unreasonably long 
delays? Fe, a check with 3 minute normal_check_interval being scheduled 
an hour or so into the future.


 Total Services:   2836
 Services Checked: 2836
 Services Scheduled:   2758
 Active Service Checks:2836
 Passive Service Checks:   0
 All services aren't being scheduled, but you have no passive service 
 checks. Have you disabled checks of 78 services?
 
 Oops, forgot to mention that. Yes, a server farm is being rebuilt
 currently. As I didn't want all the host check timeouts to make
 matters much, much, worse, I disabled them entirely.
 

Ah, that explains it then. It shouldn't matter, but unless the 
experiment I suggested above turns up anything useful, would you mind 
commenting them out and testing that?

 Hardware is a dual-2.8GHz Xeon, 2G RAM and a 100 FDX interface.
 LoadAvg is around 1.6, sometimes gets to 1.9. CPUs are both
 around 40% idle most of the time. I see about 300 context
 switches and 500 interrupts per second. The network load is
 neglible, ditto the packet rate.

 The way these figures look I don't see a performance problem per
 se, but maybe I have overlooked a metric that descirbes the
 usual bottleneck of installations.

 Are the CPU's 64 bit ones running in 32-bit emulation mode? For intel 
 cpu's, that causes up to 60% performance loss (yes, it really is that bad).
 
 Sheesh. Yes, it is a 32-bit installation. I only ever bothered
 with 64-bit installs on Opteron hardware. I might look into
 migrating to 64 bits, then.
 

So the CPU's are 64-bits? Humm... 64-bit mode would boost available 
resources quite a bit, but as you just enabled HT you should now have 3 
extra CPU's (Xeon's are dualcore AFAIR) which 

Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Daniel Meyer
On Tue, 19 Dec 2006, Tobias Klausmann wrote:

 I'm running 2.6 now but I had the troubles with 2.5 initially.
 OS is a Gentoo Linux, Kernel 2.6.15.5 initially, upgrade to
 2.6.19 today.

Same here. Latency-Problems with both 2.5 and 2.6, but on CentOS 4.4 (good 
that you use gentoo, saves me the time to try it on a heavy optimized 
gentoo box :)

 You could lower this to 2 seconds. I've done so on any number of
 installations and it has no negative impact what so ever, but seems to
 make Nagios a bit more responsive.

 I'll give that a try.

I've tried that but had some failing checks when i did that. Very 
strange...

 I also noticed that HT was disabled on the machine. I've changed
 that (and added support for it to the kernel) when I did the
 kernel upgrade today. I'll keep an eye on check latency.

I have HT enabled, no effect on the nagios latency problems.

Danny
-- 
Q: Gentoo is too hard to install  =http://www.cyberdelia.de
and I feel like whining.   = [EMAIL PROTECTED]
A: Please see /dev/null.  =
   (from the gentoo installer FAQ) = \o/

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Andreas Ericsson
Hugo van der Kooij wrote:
 On Tue, 19 Dec 2006, Andreas Ericsson wrote:
 
 Yes, for reasons stated above. It gets slightly worse if you have a
 largely linear network (many hosts only have one child), since it also
 has to check parent hosts until it finds the closest possible up to
 determine where a possible network outage is happening.
 
 Just curious. How will this work if you have something like 5 hosts in 
 line in a parent-child relation?
 
 The fastest way would be starting from nagios and work your way to the 
 downed host as the average latency on a check on a live host is much 
 faster then the timeout you get on downed hosts.
 
 Considere the map as shown on 
 http://hvdkooij.xs4all.nl/statusmap-20061219.png
 
 If nagios detects the ipv6 router in the lab to be down and it has to work 
 it's way up it has to deal with the timeouts on nlams04 and nlams05.
 
 If it starts polling the other way around it only has to deal with the 
 host check latency of the switch and the timeout of nlams05.
 

In the case you posted on your map, it would indeed be faster to start 
walking in - out. However, if the closest parent had been up it would 
have been the other way around. Anyways, I *think* nagios checks 
in-out. Either way, it's important for a host check to return OK 
*immediately* when it finds that the host it's checking actually *is* 
ok, which is why I wrote check_icmp and let it have a check_host mode 
which does just that. The original default hostcheck (I think it's still 
the default, btw) would wait a minimum of 5 seconds no matter if the 
first ping came back ok after 5ms. Since all other checks are stopped, 
this causes quite a bit of slowdown.

When I think about it, it would indeed (especially with check_host) be 
faster to start unreachability checks with root-hosts and then following 
children down to the targeted host since we would that way encounter a 
minimum of host check timeouts. It's programmatically slightly trickier 
though, as you'd have to walk it backwards from the problem host, push 
each parent host to a stack and then pop them from that stack when you 
do the actual checking. I'll look in this and see if a patch is 
necessary and, if so, if I can come up with one.

-- 
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Daniel Meyer
On Tue, 19 Dec 2006, Hugo van der Kooij wrote:

 Just curious. How will this work if you have something like 5 hosts in
 line in a parent-child relation?

 The fastest way would be starting from nagios and work your way to the
 downed host as the average latency on a check on a live host is much
 faster then the timeout you get on downed hosts.

i think it works its way from the failing host up to the nagios server, 
which would be logical from the point of view that nagios knows the 
parents of the failed server. Dont think that nagios uses that information 
to rebuild it into a child list...

That might be the solution to why did nagios send me so many 
notifications today:

consider those host relations:

nagios - router - router - switch - server1 to server n
(up) (up) (down)   (dummy)  (up...)

I had a failed leased line today. The switch in that scenario was up due 
to a dummy check which always replies ok, the router was down. I got a 
host down notification for each of the servers Must be because nagios 
started to check from the servers to the nagios host, and found the switch 
to be up and runnnig.

Danny
-- 
Q: Gentoo is too hard to install  =http://www.cyberdelia.de
and I feel like whining.   = [EMAIL PROTECTED]
A: Please see /dev/null.  =
   (from the gentoo installer FAQ) = \o/

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] start nagios

2006-12-19 Thread Alicia Calvo
hello,
  I configured about manual but at start time Nagios send the following msgs: 
Permiso denegado. 
  I try start at /etc/rc.d/init.d/nagios start.
  Thank you for any idea.
 AC

 __
Correo Yahoo!
Espacio para todos tus mensajes, antivirus y antispam ¡gratis! 
¡Abrí tu cuenta ya! - http://correo.yahoo.com.ar-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Performance issues, too

2006-12-19 Thread Tobias Klausmann
Hi! 

On Tue, 19 Dec 2006, Daniel Meyer wrote:
  You could lower this to 2 seconds. I've done so on any number of
  installations and it has no negative impact what so ever, but seems to
  make Nagios a bit more responsive.
 
  I'll give that a try.
 
 I've tried that but had some failing checks when i did that. Very 
 strange...

I'm still waiting how the kernel change will work out.

  I also noticed that HT was disabled on the machine. I've changed
  that (and added support for it to the kernel) when I did the
  kernel upgrade today. I'll keep an eye on check latency.
 
 I have HT enabled, no effect on the nagios latency problems.

I've now setup a little script that puts host and service check
latency in an RRD file every five minutes. So far, the curve
looks very inconclusive.

Regards,
Tobias
-- 
Never touch a burning system.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] start nagios

2006-12-19 Thread Hugo van der Kooij
On Tue, 19 Dec 2006, Alicia Calvo wrote:

  I configured about manual but at start time Nagios send the following msgs: 
 Permiso denegado.
  I try start at /etc/rc.d/init.d/nagios start.

Try looking for permission denied and nagios with your favorite search 
engine.

You did not include any relevant details like:
  - Nagios version
  - Install method
  - Started as user .
  

Hugo.

-- 
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
This message is using 100% recycled electrons.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage

2006-12-19 Thread Kaplan, Andrew H.
Message to all --

I am sorry about the repeated posts. I was simply desperate for an answer, hence
my 'panic'. I did not intend to irritate anybody on this matter, and if I have
please accept my apologies and assurances that it was completely accidental. 

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Hugo van der
Kooij
Sent: Monday, December 18, 2006 7:11 PM
To: Nagios Users mailinglist
Subject: Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes
fromdaemon ErrorMessage

On Mon, 18 Dec 2006, Marc Powell wrote:

 -Original Message-
 From: [EMAIL PROTECTED] [mailto:nagios-users-
 [EMAIL PROTECTED] On Behalf Of Kaplan, Andrew H.
 Sent: Monday, December 18, 2006 3:18 PM
 To: nagios-users@lists.sourceforge.net
 Subject: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes
 fromdaemon ErrorMessage

 I don't have the answer but frequent and repeated postings generally
 just irritate people enough to not want to help. It's not like we could
 have missed it the first, second or third times today to necessitate
 this fourth. Your urgent problem is not our urgent problem
 unfortunately. You generally want to allow at least 24 hours before
 reposting to give everyone a chance to see it.

I would suggest more like 72 hours at least. And even then I think one 
should question if a verbatim repost is usefull. If no one reacted the 
first time it well be because a posting was unclear in it's intend.

If one is in dire need one needs to pick up the tab and hire some nagios 
consultant. Open Source by no means you get days of support for free. Most 
of the people around here have mortgages or rent to pay and propably the 
need for groceries and so on.

Hugo.

-- 
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
This message is using 100% recycled electrons.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any
issue. 
::: Messages without supporting info will risk being sent to /dev/null

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Server/Router with more than 1 interface

2006-12-19 Thread Herman (ISTD)
Dear all,

I have some servers and routers that have more than 1 interface, and
each interface has IP Address. I would like define a host is alive if
all of its interface ip address are available via check_ping.

So I define as following, but if I shut down the interface with ip
10.254.8.2. Nagios still think the host as alive, since Nagios just
detect the IP 10.16.20.120 as the indicator of the host. Any workaround
to make Nagios aware of both IP Addresses/IP Address ? Any suggestion
will be highly appreciated.

Thank you.

define host{
use generic-host; Name of host
template to use

host_name   NAS-TMMIN-HO
alias   FTP Server
address 10.16.20.120,10.254.8.2
parents HO_Core_SW
max_check_attempts  2
check_command   check-host-alive
contact_groups  admins
notification_interval   120
notification_optionsd,u,r
notification_period 24x7
flap_detection_enabled  1
}

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Server/Router with more than 1 interface

2006-12-19 Thread Hugo van der Kooij
On Tue, 19 Dec 2006, Herman (ISTD) wrote:

 I have some servers and routers that have more than 1 interface, and
 each interface has IP Address. I would like define a host is alive if
 all of its interface ip address are available via check_ping.

 So I define as following, but if I shut down the interface with ip
 10.254.8.2. Nagios still think the host as alive, since Nagios just
 detect the IP 10.16.20.120 as the indicator of the host. Any workaround
 to make Nagios aware of both IP Addresses/IP Address ? Any suggestion
 will be highly appreciated.

This is one of those rare cases where host dependencies may come into 
play.

But I would add the interfaces as PING service to the host or use SNMP to 
check the real interface status. Something like:

# 'check_snmp_int' command definition define command{
 command_namecheck_snmp_int
 command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $ARG1$ 
-n $ARG2$
 }

Getting the proper interface names can be a daunting task with some 
routers.

Hugo.

-- 
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
This message is using 100% recycled electrons.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Use statement

2006-12-19 Thread Saulo Silva

Hi all ,

Just to confirm . Can I use two use statement when I am defining a service .

Like  a example  :

service http{
host_name  some name
use service definition 1
use service definition 2



Best Regards ,


Saulo Augusto Silva
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] notifications.cgi

2006-12-19 Thread Andreas Koch
Hello Group.

i want see all notification for the special User (work with
contact=User) and the Last Week (7 Days).

Can notifications.cgi this or have anybody script who make this?

Regards,
Andreas.


smime.p7s
Description: S/MIME cryptographic signature
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage

2006-12-19 Thread Kaplan, Andrew H.
The NRPE issue that has been very problematic has generated another error. I was
briefly getting output back from the remote client, but now

I am getting the error message: 

 

NRPE: Unable to read output. 

 

Here is a brief listing of the configuration:

 

Nagios server: Nagios 2.6

NRPE client: 2.5.2 running on an HP-UX 11.11 system

Allowed hosts: 0.0.0.0

 

The remote client uses inetd (without tcp wrappers) and here is the
configuration in the inetd.conf file:

 

nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe -c
/usr/local/nagios/nrpe.cfg -inetd

 

The remote client also has a startup script in the /sbin/init.d directory called
nrpe. Here is the text of the script:

 

/usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/nrpe.cfg -d

 

I checked the log file on the remote client, and there was no indication of any
connection problem. The messages and nagios.log files

on the server simply reported the following:

 

UNKNOWN;notify-by-email;NRPE: Unable to read ouput. 

 

I have confirmed there is no firewall activated on the system, and that nrpe
communication is allowed on the client. 

 



From: Steve Shipway [mailto:[EMAIL PROTECTED] 
Sent: Monday, December 18, 2006 8:24 PM
To: Kaplan, Andrew H.; nagios-users@lists.sourceforge.net
Subject: RE: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes
fromdaemon ErrorMessage

 

Sounds like the daemon is comparing the client's IP against the list of
permitted connections, and not getting a match.  For some reason the query of
the source IP on the connection is returning 0.0.0.0 instead of the source IP -
maybe you have some special wrapper or intervening agent.  I'd suggest running
under xinetd or something similar which can do the filtering instead.

 

Steve

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Use statement

2006-12-19 Thread Hugo van der Kooij
On Tue, 19 Dec 2006, Saulo Silva wrote:

 Just to confirm . Can I use two use statement when I am defining a service .

Why do you not use nagios -v to confirm it? It is there for you to verify 
your config. (http://nagios.sourceforge.net/docs/2_0/verifyconfig.html)

Hugo.

-- 
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
This message is using 100% recycled electrons.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytesfromdaemon ErrorMessage

2006-12-19 Thread Phil Costelloe
 The remote client uses inetd (without tcp wrappers) and here is the
configuration in the inetd.conf file:

nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe -c
/usr/local/nagios/nrpe.cfg -inetd

Solaris requires the name of the server program and then the arguments,
which include argument 0 (the name of the program). So that would be :

nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe
/usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg --inetd

Not sure if HP-UX inetd.conf works the same way.

 The remote client also has a startup script in the /sbin/init.d
directory called nrpe. Here is the text of the script:

/usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/nrpe.cfg
-d

I'd say that you either want to run as a daemon or run under inetd, not
both.

Phil
 


Phil Costelloe
Technical Consultant

   
[EMAIL PROTECTED]
http://www.foundation-it.com
Foundation IT
Foundation Court
Old Street
Oare
Hermitage Berkshire RG18 9SE
Switch: +44 (0) 1635 203700
Helpdesk: +44 (0) 800 0121099
DDI: +44 (0) 1635 203719
Fax: +44 (0) 8700 543537
Mobile: +44 (0) 7884 236299

 


This message contains confidential information and is intended only for [EMAIL 
PROTECTED] If you are not nagios-users@lists.sourceforge.net you should not 
disseminate, distribute or copy this e-mail. Please notify Foundation IT 
immediately by e-mail if you have received this e-mail by mistake and delete 
this e-mail from your system. E-mail transmission cannot be guaranteed to be 
secure or error-free as information could be intercepted, corrupted, lost, 
destroyed, arrive late or incomplete, or contain viruses. Foundation IT 
therefore does not accept liability for any errors or omissions in the contents 
of this message, which arise as a result of e-mail transmission. If 
verification is required please request a hard-copy version.


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0bytesfromdaemon ErrorMessage

2006-12-19 Thread Kaplan, Andrew H.
Hi there --

I checked the inetd.conf file for HP-UX and it differs from Solaris in that 
it does not require the name of the program after the user field. I did remove
nrpe from the file to see if that made a difference, and unfortunately it did
not. 

-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Phil Costelloe
Sent: Tuesday, December 19, 2006 10:21 AM
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0bytesfromdaemon
ErrorMessage

 The remote client uses inetd (without tcp wrappers) and here is the
configuration in the inetd.conf file:

nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe -c
/usr/local/nagios/nrpe.cfg -inetd

Solaris requires the name of the server program and then the arguments,
which include argument 0 (the name of the program). So that would be :

nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe
/usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg --inetd

Not sure if HP-UX inetd.conf works the same way.

 The remote client also has a startup script in the /sbin/init.d
directory called nrpe. Here is the text of the script:

/usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/nrpe.cfg
-d

I'd say that you either want to run as a daemon or run under inetd, not
both.

Phil
 


Phil Costelloe
Technical Consultant

   
[EMAIL PROTECTED]
http://www.foundation-it.com
Foundation IT
Foundation Court
Old Street
Oare
Hermitage Berkshire RG18 9SE
Switch: +44 (0) 1635 203700
Helpdesk: +44 (0) 800 0121099
DDI: +44 (0) 1635 203719
Fax: +44 (0) 8700 543537
Mobile: +44 (0) 7884 236299

 


This message contains confidential information and is intended only for
[EMAIL PROTECTED] If you are not
nagios-users@lists.sourceforge.net you should not disseminate, distribute or
copy this e-mail. Please notify Foundation IT immediately by e-mail if you have
received this e-mail by mistake and delete this e-mail from your system. E-mail
transmission cannot be guaranteed to be secure or error-free as information
could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or
contain viruses. Foundation IT therefore does not accept liability for any
errors or omissions in the contents of this message, which arise as a result of
e-mail transmission. If verification is required please request a hard-copy
version.


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any
issue. 
::: Messages without supporting info will risk being sent to /dev/null

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Questions about scheduling

2006-12-19 Thread Marc Powell


 -Original Message-
 From: [EMAIL PROTECTED] [mailto:nagios-users-
 [EMAIL PROTECTED] On Behalf Of Daniel Meyer
 Sent: Tuesday, December 19, 2006 6:19 AM
 To: Nagios Users mailinglist
 Subject: Re: [Nagios-users] Questions about scheduling
 
 On Tue, 19 Dec 2006, Hugo van der Kooij wrote:
 
  Just curious. How will this work if you have something like 5 hosts
in
  line in a parent-child relation?
 
  The fastest way would be starting from nagios and work your way to
the
  downed host as the average latency on a check on a live host is much
  faster then the timeout you get on downed hosts.
 
 i think it works its way from the failing host up to the nagios
server,
 which would be logical from the point of view that nagios knows the
 parents of the failed server. Dont think that nagios uses that
information
 to rebuild it into a child list...

This presentation, starting on page 19, details current and future host
check logic --

http://www.netways.de/de/nagios_konferenz/archiv_2006/programm/nagios_30
_and_beyond/

--
Marc



-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] MRTG link to a specific host.

2006-12-19 Thread Antonio Pereira
Hello,

 

I have Nagios 2.6 installed and I have mrtg 2.13.2 installed and
working. 

 

I would like to create a icon or link that within Nagiso that when you
click on the link that it will go to the mrtg page that I created.

 

Thanks

 

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] MRTG link to a specific host.

2006-12-19 Thread Morris, Patrick
 I have Nagios 2.6 installed and I have mrtg 2.13.2 installed 
 and working. 
 
 I would like to create a icon or link that within Nagiso that 
 when you click on the link that it will go to the mrtg page 
 that I created.

http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#hostextinfo

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Schedule Downtime w/o authentication requirement

2006-12-19 Thread Subin Shakya

Andreas, Thanks for your advice.  But, FYI, I have done what I could
before I painfully signed up and advertised  my issue on this list. 

By the way, I am novice on Nagios (heck also with Linux) and took me a
while to build the Nagios system which I am extremely proud of.

What do you think? Can you pass your knowledge to me now?

- Subin

-Original Message-
From: Andreas Ericsson [mailto:[EMAIL PROTECTED]
Sent: Tuesday, December 19, 2006 1:27 AM
To: Subin Shakya
Cc: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Schedule Downtime w/o authentication
requirement

Subin Shakya wrote:
  I want to configure nagios in such a way that it does not require any
 kind of authorization/authentication including 'guest'.  How can I
 do
 that?
 I do not know where to go in Nagios config to make this happen.
 TIA, Symuser
 =0D
 Right now when I select a Host to schedule a downtime I get:
 =0D
 Sorry Dave, I can't let you do that...
 It seems that you have chosen to not use the authentication
 functionality of the CGIs.
 I don't want to be personally responsible for what may happen as a
 result of allowing unauthorized users to issue commands to
Nagios,so
 you'll have to disable this safeguard if you are really stubborn
and
 want to invite trouble.
 Read the section on CGI authentication in the HTML documentation
to
 learn how you can enable authentication and why you should want
to.


Nagios is more polite than I am. RTFM and do some thinking and you'll
find the answer.

On a sidenote; I know the answer but I'm not interested in giving it to
you when you haven't even bothered trying to find it for yourself.

--
Andreas Ericsson   [EMAIL PROTECTED]
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

===
Notice: This e-mail message, together with any attachments, contains
information of Symyx Technologies, Inc. or any of its affiliates or
subsidiaries that may be confidential, proprietary, copyrighted,
privileged and/or protected work product, and is meant solely for
the intended recipient. If you are not the intended recipient, and
have received this message in error, please contact the sender
immediately, permanently delete the original and any copies of this
email and any attachments thereto.

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] SSL handshake error

2006-12-19 Thread Ludwig Pummer
So, the daemon end of your NRPE check is a Solaris 10 server. What OS is
on the client end? What if you try to connect to localhost on the
Solaris 10 machine with check_nrpe ?
 

--
Ludwig Pummer
System Administrator, Copart Auto Auctions
707-639-5067 [EMAIL PROTECTED]



 





From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Extra
Special Bitter
Sent: Tuesday, December 19, 2006 6:45 AM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] SSL handshake error


Last week we installed the NRPE daemon on a Solaris 10 server
but have had trouble configuring it to monitor anything more
sophisticated than a basic Host Alive check.  Remote checks show up in
the console with a status of unknown with the error message: 


CHECK_NRPE: Received 0 bytes from daemon. Check the remote
server logs for error messages.  


Checking the error log on the remote server reveals the
following: 


[EMAIL PROTECTED] dmesg | grep nrpe | tail -1 
Dec 19 08:45:40 txopprd1 nrpe[27334]: [ID 813741 daemon.error]
Error: Could not complete SSL handshake. 1 


Any clues?  Thanks in advance... 

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Schedule Downtime w/o authentication requirement

2006-12-19 Thread Mike Holloway

Subin, have a look through cgi.cfg

-mike


On Dec 19, 2006, at 12:30 PM, Subin Shakya wrote:


 Andreas, Thanks for your advice.  But, FYI, I have done what I could
 before I painfully signed up and advertised  my issue on this list.

 By the way, I am novice on Nagios (heck also with Linux) and took me a
 while to build the Nagios system which I am extremely proud of.

 What do you think? Can you pass your knowledge to me now?

 - Subin

 -Original Message-
 From: Andreas Ericsson [mailto:[EMAIL PROTECTED]
 Sent: Tuesday, December 19, 2006 1:27 AM
 To: Subin Shakya
 Cc: nagios-users@lists.sourceforge.net
 Subject: Re: [Nagios-users] Schedule Downtime w/o authentication
 requirement

 Subin Shakya wrote:
  I want to configure nagios in such a way that it does not require  
 any
 kind of authorization/authentication including 'guest'.  How  
 can I
 do
 that?
 I do not know where to go in Nagios config to make this happen.
 TIA, Symuser
 =0D
 Right now when I select a Host to schedule a downtime I get:
 =0D
 Sorry Dave, I can't let you do that...
 It seems that you have chosen to not use the authentication
 functionality of the CGIs.
 I don't want to be personally responsible for what may happen  
 as a
 result of allowing unauthorized users to issue commands to
 Nagios,so
 you'll have to disable this safeguard if you are really stubborn
 and
 want to invite trouble.
 Read the section on CGI authentication in the HTML documentation
 to
 learn how you can enable authentication and why you should want
 to.


 Nagios is more polite than I am. RTFM and do some thinking and you'll
 find the answer.

 On a sidenote; I know the answer but I'm not interested in giving  
 it to
 you when you haven't even bothered trying to find it for yourself.

 --
 Andreas Ericsson   [EMAIL PROTECTED]
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 ===
 Notice: This e-mail message, together with any attachments, contains
 information of Symyx Technologies, Inc. or any of its affiliates or
 subsidiaries that may be confidential, proprietary, copyrighted,
 privileged and/or protected work product, and is meant solely for
 the intended recipient. If you are not the intended recipient, and
 have received this message in error, please contact the sender
 immediately, permanently delete the original and any copies of this
 email and any attachments thereto.

 -- 
 ---
 Take Surveys. Earn Cash. Influence the Future of IT
 Join SourceForge.net's Techsay panel and you'll get the chance to  
 share your
 opinions on IT  business topics through brief surveys - and earn cash
 http://www.techsay.com/default.php? 
 page=join.phpp=sourceforgeCID=DEVDEV
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when  
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Schedule Downtime w/o authentication requirement

2006-12-19 Thread frank
No, you didn't.

The last link under Configuring Nagios in the documentation table of 
contents clearly says Configuring authorization for the CGIs. Read it!

http://nagios.sourceforge.net/docs/2_0/cgiauth.html

Authentication On Secured Web Servers

If your web server is located in a secure domain (i.e., behind a firewall) 
or if you are using SSL, you can define a default username that can be 
used to access the CGIs. This is done by defining the default_user_name 
option in the CGI configuration file. By defining a default username that 
can access the CGIs, you can allow users to access the CGIs without 
necessarily having to authenticate to the web server.. You may want to use 
this to avoid having to use basic web authentication, as basic 
authentication transmits passwords in clear text over the Internet.

Important: Do not define a default username unless you are running a 
secure web server and are sure that everyone who has access to the CGIs 
has been authenticated in some manner! If you define this variable, anyone 
who has not authenticated to the web server will inherit all rights you 
assign to this user!


-f

On Tue, 19 Dec 2006, Subin Shakya wrote:

 Date: Tue, 19 Dec 2006 10:30:06 -0800
 From: Subin Shakya [EMAIL PROTECTED]
 To: Andreas Ericsson [EMAIL PROTECTED]
 Cc: nagios-users@lists.sourceforge.net
 Subject: Re: [Nagios-users] Schedule Downtime w/o authentication requirement
 

 Andreas, Thanks for your advice.  But, FYI, I have done what I could
 before I painfully signed up and advertised  my issue on this list.

 By the way, I am novice on Nagios (heck also with Linux) and took me a
 while to build the Nagios system which I am extremely proud of.

 What do you think? Can you pass your knowledge to me now?

 - Subin

 -Original Message-
 From: Andreas Ericsson [mailto:[EMAIL PROTECTED]
 Sent: Tuesday, December 19, 2006 1:27 AM
 To: Subin Shakya
 Cc: nagios-users@lists.sourceforge.net
 Subject: Re: [Nagios-users] Schedule Downtime w/o authentication
 requirement

 Subin Shakya wrote:
  I want to configure nagios in such a way that it does not require any
 kind of authorization/authentication including 'guest'.  How can I
 do
 that?
 I do not know where to go in Nagios config to make this happen.
 TIA, Symuser
 =0D
 Right now when I select a Host to schedule a downtime I get:
 =0D
 Sorry Dave, I can't let you do that...
 It seems that you have chosen to not use the authentication
 functionality of the CGIs.
 I don't want to be personally responsible for what may happen as a
 result of allowing unauthorized users to issue commands to
 Nagios,so
 you'll have to disable this safeguard if you are really stubborn
 and
 want to invite trouble.
 Read the section on CGI authentication in the HTML documentation
 to
 learn how you can enable authentication and why you should want
 to.


 Nagios is more polite than I am. RTFM and do some thinking and you'll
 find the answer.

 On a sidenote; I know the answer but I'm not interested in giving it to
 you when you haven't even bothered trying to find it for yourself.

 --
 Andreas Ericsson   [EMAIL PROTECTED]
 OP5 AB www.op5.se
 Tel: +46 8-230225  Fax: +46 8-230231

 ===
 Notice: This e-mail message, together with any attachments, contains
 information of Symyx Technologies, Inc. or any of its affiliates or
 subsidiaries that may be confidential, proprietary, copyrighted,
 privileged and/or protected work product, and is meant solely for
 the intended recipient. If you are not the intended recipient, and
 have received this message in error, please contact the sender
 immediately, permanently delete the original and any copies of this
 email and any attachments thereto.

 -
 Take Surveys. Earn Cash. Influence the Future of IT
 Join SourceForge.net's Techsay panel and you'll get the chance to share your
 opinions on IT  business topics through brief surveys - and earn cash
 http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net

Re: [Nagios-users] SSL handshake error

2006-12-19 Thread Extra Special Bitter

Ludwig,

Thanks for the reply.

As it turns out, the error went away when I changed my check command from
check_unix_disk_no-ssl to check_unix_disk.  Who knew?

Paul

On 12/19/06, Ludwig Pummer [EMAIL PROTECTED] wrote:


 So, the daemon end of your NRPE check is a Solaris 10 server. What OS is
on the client end? What if you try to connect to localhost on the Solaris 10
machine with check_nrpe ?


--
Ludwig Pummer
System Administrator, Copart Auto Auctions
707-639-5067 [EMAIL PROTECTED]



 --
*From:* [EMAIL PROTECTED] [mailto:
[EMAIL PROTECTED] *On Behalf Of *Extra Special
Bitter
*Sent:* Tuesday, December 19, 2006 6:45 AM
*To:* nagios-users@lists.sourceforge.net
*Subject:* [Nagios-users] SSL handshake error

Last week we installed the NRPE daemon on a Solaris 10 server but have had
trouble configuring it to monitor anything more sophisticated than a basic
Host Alive check.  Remote checks show up in the console with a status of
unknown with the error message:

CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs
for error messages. 

Checking the error log on the remote server reveals the following:

[EMAIL PROTECTED] dmesg | grep nrpe | tail -1
Dec 19 08:45:40 txopprd1 nrpe[27334]: [ID 813741 daemon.error] Error:
Could not complete SSL handshake. 1
Any clues?  Thanks in advance...


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Getting pie charts in host's history

2006-12-19 Thread Craig Van Tassle
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

My boss wants to be able to look at pie charts when ever we are looking at
history for various services. So far I have only been able to get a bar graph
that is showing the uptime and down time, but not I have not been able to get a
pie chart to display.

I am using ubuntu 6.10, with nagios 2.4

Any help or though would be appreciated.

Craig
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFiFotAOTIJ89W4sIRAgbGAJ4geQOzBNgIurspehrapp3THi8xbgCfUGR9
UaNr+RnEzArd6nsNTasN7fM=
=S0Pg
-END PGP SIGNATURE-


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Getting pie charts in host's history

2006-12-19 Thread Az
Craig Van Tassle wrote:
 My boss wants to be able to look at pie charts when ever we are looking at
 history for various services. So far I have only been able to get a bar graph
 that is showing the uptime and down time, but not I have not been able to get 
 a
 pie chart to display.

 I am using ubuntu 6.10, with nagios 2.4

 Any help or though would be appreciated.
By bar graph I assume you mean the one on 'View Trends For This Service' 
(which doesnt look very bar-ish when its all one colour ;) ). There is 
no pie chart in the version I am running (v2.1) nor in the doco 
(http://nagios.sourceforge.net/docs/2_0/cgis.html).

You can either export the data in CSV format from the Availability 
Report and process it yourself, or write your own CGI.

Or do as I do... tell your boss how to get the raw data and they can 
spend all day making purty charts while you can spend your day doing 
more productive things like keeping things running. ;)


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage

2006-12-19 Thread Steve Shipway
I would say that you need to use inetd, OR an init.d script, but not
both.  If you try to then you'll get errors when they try and start up.
 
Your 'unable to read output' error seems to indicate that you have a
timeout.  I don't know what plugin you are attempting to run remotely,
but check to make sure that the timeout on check_nrpe (A), on the
nrpe.cfg file (B) and for the remotely run plugin (C) are set as CBA.
Your error message seems to be timeout A.
 
Another possibility is that your check_nrpe is running in non-ssl mode,
but your nrpe daemon is running in SSL mode...  I notice that you have
given the -n option (non-SSL) in your init.d definition, but not in your
inetd.conf definition!  Are you using -n in your check_nrpe?  Pick one
and use it throughout...
 
Steve
--
Steve Shipway
ITSS, University of Auckland
(09) 3737 599 x 86487
[EMAIL PROTECTED]


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] monitoring streaming

2006-12-19 Thread Rob Brown
You can use the check_real plugin to test basic connectivity to a WMS server:
./check_real -H hostname -u /industrial.wmv

On 12/14/06, Marcel Mitsuto Fucatu Sugano [EMAIL PROTECTED] wrote:
 Does anyone knows of a plugin to monitor windows media services
 streamings, by emulating a user who watches a sample movie? I mean, by
 connecting and evaluating if the server is responding correctly?

 TIA,
 Marcel


 AVISO: A informação contida neste e-mail, bem como em qualquer de seus 
 anexos, é CONFIDENCIAL e destinada ao uso exclusivo do(s) destinatário(s) 
 acima referido(s), podendo conter informações sigilosas e/ou legalmente 
 protegidas. Caso você não seja o destinatário desta mensagem, informamos que 
 qualquer divulgação, distribuição ou cópia deste e-mail e/ou de qualquer de 
 seus anexos é absolutamente proibida. Solicitamos que o remetente seja 
 comunicado imediatamente, respondendo esta mensagem, e que o original desta 
 mensagem e de seus anexos, bem como toda e qualquer cópia e/ou impressão 
 realizada a partir destes, sejam permanentemente apagados e/ou destruídos. 
 Informações adicionais sobre nossa empresa podem ser obtidas no site 
 http://sobre.uol.com.br/.

 NOTICE: The information contained in this e-mail and any attachments thereto 
 is CONFIDENTIAL and is intended only for use by the recipient named herein 
 and may contain legally privileged and/or secret information. If you are not 
 the e-mail´s intended recipient, you are hereby notified that any 
 dissemination, distribution or copy of this e-mail, and/or any attachments 
 thereto, is strictly prohibited. Please immediately notify the sender 
 replying to the above mentioned e-mail address, and permanently delete and/or 
 destroy the original and any copy of this e-mail and/or its attachments, as 
 well as any printout thereof. Additional information about our company may be 
 obtained through the site http://www.uol.com.br/ir/.

 -
 Take Surveys. Earn Cash. Influence the Future of IT
 Join SourceForge.net's Techsay panel and you'll get the chance to share your
 opinions on IT  business topics through brief surveys - and earn cash
 http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when reporting 
 any issue.
 ::: Messages without supporting info will risk being sent to /dev/null


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Using $ in password

2006-12-19 Thread Rob Brown
I ran into a similar issue when using the check_snmp plugin and my
ultimate solution was to write a small wrapper shell script with the
password hard coded in it and call that as your check command.
Example one line script called check_httpp
check_http -p $1 -I $2 -t 120 -a username:passsword

then define your check command:
define command{
 command_namecheck_httpp
 command_line$USER1$/check_httpp $ARG1$ $HOSTADDRESS$
}

then define your service:
define service{
 use generic-service
host_name   
service_description YYY
contact_groups  ZZZ
check_commandcheck_httpp!61002
}



On 12/13/06, Kareem Mahgoub [EMAIL PROTECTED] wrote:
 Thanks again for every one replied to me.
 I tired all what was suggested, yet nothing is working.
 I want to add something, when I add quotes on the password and run the
 command and it worked but when nagios parse the config file it didn't work,
 showing on the web interface HTTP WARNING: HTTP/1.0 401 Unauthorized
 Changing the password will be my last option cause there are hundreds of
 devices with password having the $ character
 Thanks all for your help
 - Original Message -
 From: Mike Holloway [EMAIL PROTECTED]
 To: nagios-users@lists.sourceforge.net
 Sent: Wednesday, December 13, 2006 12:07 AM
 Subject: Re: [Nagios-users] Using $ in password



 In checkcommands.cfg, try wrapping ARG3 in single quotes on the
 command_line.  From experience, you are better off changing the
 password to not use a $ character.


 # 'check_httpp' command definition
 define command{
  command_namecheck_httpp
  command_line$USER1$/check_http -p $ARG1$ -I $ARG2$ -t
 120 -a '$ARG3$'


 -mike






 On Dec 12, 2006, at 1:49 PM, Kareem Mahgoub wrote:

  Thanks for all who replied to me, but unfortunately the three
  solutions
  didn't work.
  Kindly if anyone has other ideas, post it.
  Regards
  - Original Message -
  From: [EMAIL PROTECTED]
  To: nagios-users@lists.sourceforge.net
  Sent: Tuesday, December 12, 2006 9:28 PM
  Subject: Re: [Nagios-users] Using $ in password
 
 
  Normally, doubling the $ sign and eclosing the statement in single
  quotes
  works fine.
 
  check_command check_httpp!61002!192.168.1.4!:'33$$77'
 
 
  __
  __
  ___
 
  Jason Shein
  Network Administrator – Linux Systems
  Iovate Health Sciences Inc.
  5100 Spectrum Way
  Mississauga, ON L4W 5S2
  ( 905 ) - 678 - 3119   x 3136
  1 - 888 - 334 - 4448,x 3136 (toll-free)
  [EMAIL PROTECTED]
 
  Customer Service. Collaboration. Innovation. Efficiency.
  Iovate's Information Technology Team
 
 
  __
  __
  ___
 
  CONFIDENTIALITY NOTICE:
  THIS ELECTRONIC MAIL TRANSMISSION IS PRIVILEGED AND CONFIDENTIAL
  AND IS
  INTENDED ONLY FOR THE REVIEW OF THE PARTY TO WHOM IT IS ADDRESSED.
  THE INFORMATION CONTAINED IN THIS E-MAIL IS CONFIDENTIAL AND IS
  DISCLOSED
  TO YOU UNDER THE EXPRESS UNDERSTANDING THAT YOU WILL NOT DISCLOSE IT
  OR ITS CONTENTS TO ANY THIRD PARTY WITHOUT THE EXPRESS WRITTEN
  CONSENT
  OF AN AUTHORIZED OFFICER OF IOVATE HEALTH SCIENCES SERVICES INC.
  IF YOU
  HAVE
  RECEIVED THIS TRANSMISSION IN ERROR, PLEASE IMMEDIATELY RETURN IT
  TO THE SENDER.
 
  __
  __
  ___
 
 
 
  Kareem Mahgoub [EMAIL PROTECTED]
  Sent by: [EMAIL PROTECTED]
  12/12/2006 02:10 PM
 
  To
  nagios-users@lists.sourceforge.net
  cc
 
  Subject
  [Nagios-users] Using $ in password
 
 
 
 
 
 
  Dear All,
 
  I have a simple question. What should I do to include the
  character $ as
  one
  of the password characters in the check command?
 
  Here are some required information:
 
  1- OS FC 5
 
  2- kernel 2.6.18-1
 
  3- Nagios version 2.5
 
  4- The command definition
 
  # 'check_httpp' command definition
  define command{
  command_namecheck_httpp
  command_line$USER1$/check_http -p $ARG1$ -I $ARG2$ -t
  120 -a
  $ARG3$
  }
  5- The service configuration
 
  define service{
  use generic-service
  host_name   
  service_description YYY
  contact_groups  ZZZ
  check_command check_httpp!61002!192.168.1.4!:33$77
  }
  When I execute the command manually it works.
 
  Any help will be much appreciated
 
  Regards
 
 
  -
  
  Take Surveys. Earn Cash. Influence the Future of IT
  Join SourceForge.net's Techsay panel and you'll get the chance to
  share
  your
  opinions on IT  business topics through brief surveys - and earn
  cash
  http://www.techsay.com/default.php?
  

Re: [Nagios-users] Server/Router with more than 1 interface

2006-12-19 Thread Herman (ISTD)
Hi Hugo, thank you very much for your suggestion.

Currently, I do the same thing with :

define command{
command_namecheck_interface
command_line$USER1$/check_snmp_cisco_ifstatus -H
$HOSTADDRESS$ -C $ARG1$ -i $ARG2$
}

However, is it possible to change HOST status in Status Map to DOWN if
one of the ip address could not be contacted ?

Best Regards,

Herman


 -Original Message-
 From: [EMAIL PROTECTED] [mailto:nagios-users-
 [EMAIL PROTECTED] On Behalf Of Hugo van der Kooij
 Sent: Tuesday, December 19, 2006 9:01 PM
 To: Nagios Users mailinglist
 Subject: Re: [Nagios-users] Server/Router with more than 1 interface
 
 On Tue, 19 Dec 2006, Herman (ISTD) wrote:
 
  I have some servers and routers that have more than 1 interface, and
  each interface has IP Address. I would like define a host is alive
if
  all of its interface ip address are available via check_ping.
 
  So I define as following, but if I shut down the interface with ip
  10.254.8.2. Nagios still think the host as alive, since Nagios just
  detect the IP 10.16.20.120 as the indicator of the host. Any
workaround
  to make Nagios aware of both IP Addresses/IP Address ? Any
suggestion
  will be highly appreciated.
 
 This is one of those rare cases where host dependencies may come into
 play.
 
 But I would add the interfaces as PING service to the host or use SNMP
to
 check the real interface status. Something like:
 
 # 'check_snmp_int' command definition define command{
  command_namecheck_snmp_int
  command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C
 $ARG1$ -n $ARG2$
  }
 
 Getting the proper interface names can be a daunting task with some
 routers.
 
 Hugo.
 
 --
   [EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
   This message is using 100% recycled electrons.
 


-
 Take Surveys. Earn Cash. Influence the Future of IT
 Join SourceForge.net's Techsay panel and you'll get the chance to
share
 your
 opinions on IT  business topics through brief surveys - and earn cash

http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDE
V
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] New condiguration not loading

2006-12-19 Thread John Longland
Hi all !!!

Just saw something very weird for the first time since I have installed my
distributed
Nagios system ( 2.4 ). ( Running on CentOS4.0 )

I needed to take out 2 hosts. But after taking them out of my minimal.cfg
and
restarting nagios, I still see that these hosts are been scanned !
Do I need to remove another file ??

Thanks !
John
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT  business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null