Re: [Nagios-users] Return Code 127
Kaplan, Andrew H. wrote: I checked the commands.cfg file and modified the command syntax of check_nrpe from /usr/lib/nagios/plugins/check_nrpe to $USER1$/check_nrpe. The result was I am no longer getting the 127 error, but I am now faced with another issue. The new error that I am seeing is: CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. I checked the HP-UX 11.11 box and the syslog.log file had the following entry: Host 0.0.0.0 is not allowed to talk to us! It seems there's a bug in HP-UX or you have some extremely odd firewall installed. This is supposed to print the ip-address of the connecting host. I checked the nrpe.cfg file and under SERVER ADDRESS, I have nrpe bound to client's ip address, and under ALLOWED HOSTS, I have several addresses, including the Nagios 2.6 server, that are permitted to talk to NRPE on the client. Quick and dirty hack: Add 0.0.0.0 to the list of allowed_hosts. This will make it work but will most likely also make it possible to query the host from any other server on the network. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Schedule Downtime w/o authentication requirement
Subin Shakya wrote: I want to configure nagios in such a way that it does not require any kind of authorization/authentication including 'guest'. How can I do that? I do not know where to go in Nagios config to make this happen. TIA, Symuser =0D Right now when I select a Host to schedule a downtime I get: =0D Sorry Dave, I can't let you do that... It seems that you have chosen to not use the authentication functionality of the CGIs. I don't want to be personally responsible for what may happen as a result of allowing unauthorized users to issue commands to Nagios,so you'll have to disable this safeguard if you are really stubborn and want to invite trouble. Read the section on CGI authentication in the HTML documentation to learn how you can enable authentication and why you should want to. Nagios is more polite than I am. RTFM and do some thinking and you'll find the answer. On a sidenote; I know the answer but I'm not interested in giving it to you when you haven't even bothered trying to find it for yourself. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios monitoring bacula - service dependencies
NOBODY KNOWS ??? Jeronimo Zucco escreveu: Hi, all. I have problem using service dependencies. I have a backup server (bacula), and I check status of backup through a mysql query ( http://darcs.complete.org/debian/bacula.upstream/examples/nagios.txt ) on server bacula using nrpe in server bacula. I have many others servers with I do backups in bacula (like server1 below), then I check status of backup of this server, with this definition: define service{ use generic-service ; Name of service template to use nameTEMPLATE-REMOTE-BACULA-BACKUP service_description Backup is_volatile 0 check_period24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval1 notification_optionsw,u,c,r notification_interval 0 notification_period 24x7 register0 } define service{ use TEMPLATE-REMOTE-BACULA-BACKUP host_nameserver1 contact_groupsadmins register1 check_command check_remote_bacula_server1Job } define servicedependency{ host_name server1 service_description Backup dependent_host_name bacula dependent_service_description PING execution_failure_criteria u,c notification_failure_criteria w,u,c } I configure service dependency because when I shutdown bacula/mysql server, I got many notifications about backups. What is wrong in my definition of service dependency ? I want no notifications when bacula server is down. Thank you. -- Jeronimo Zucco LPIC-1 Linux Professional Institute Certified Núcleo de Processamento de Dados Universidade de Caxias do Sul http://jczucco.blogspot.com - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Performance issues, too
Hi! Recently I have run into the very same performance issues as Daniel Meyer (or so it seems). However, I'm not quite sure about it. Here's the gist of it. Currently, service check latency slowly creeps up. As it is now, it starts out at a little over 1s and after about 12 hours it's in the area of about 90s. It keeps climbing after that. Here's the output of nagios -s: Nagios 2.6 Copyright (c) 1999-2006 Ethan Galstad (http://www.nagios.org) Last Modified: 11-27-2006 License: GPL Warning: Contact group 'Singles-Truppe' is not used in any host/service definitions or host/service escalations! Projected scheduling information for host and service checks is listed below. This information assumes that you are going to start running Nagios with your current config files. HOST SCHEDULING INFORMATION --- Total hosts: 330 Total scheduled hosts: 0 Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 10 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 2836 Total scheduled services: 2836 Service inter-check delay method: SMART Average service check interval: 2225.56 sec Inter-check delay: 0.21 sec Interleave factor method: SMART Average services per host: 8.59 Service interleave factor: 9 Max service check spread: 10 min First scheduled check: Tue Dec 19 11:21:45 2006 Last scheduled check: Tue Dec 19 11:31:47 2006 CHECK PROCESSING INFORMATION Service check reaper interval: 5 sec Max concurrent service checks: Unlimited PERFORMANCE SUGGESTIONS --- I have no suggestions - things look okay. This all looks peachy - I think. What I don't get is this line: Average service check interval: 2225.56 sec It seems to me that this is either a skewed value, stemming from my history of looong latencies (at one point we were beyonf 9000 seconds). *Or* it is indicative of a misconfiguration on my part. If the latter is the case, I'd be eager, nay ecstatic to hear what I did wrong. Here are a few of the config vars that might influence this: sleep_time=0.25 service_reaper_frequency=5 max_concurrent_checks=0 max_host_check_spread=10 host_inter_check_delay_method=s service_interleave_factor=s command_check_interval=1 obsess_over_services=0 aggregate_status_updates=1 status_update_interval=20 Also, here's the output from nagiostats: Nagios Stats 2.6 Copyright (c) 2003-2005 Ethan Galstad (www.nagios.org) Last Modified: 11-27-2006 License: GPL CURRENT STATUS DATA Status File: /var/nagios/status.dat Status File Age: 0d 0h 0m 3s Status File Version: 2.6 Program Running Time: 0d 1h 59m 5s Total Services: 2836 Services Checked: 2836 Services Scheduled: 2758 Active Service Checks:2836 Passive Service Checks: 0 Total Service State Change: 0.000 / 12.370 / 0.007 % Active Service Latency: 0.006 / 10.237 / 0.906 sec Active Service Execution Time:0.047 / 10.159 / 0.180 sec Active Service State Change: 0.000 / 12.370 / 0.007 % Active Services Last 1/5/15/60 min: 477 / 2678 / 2745 / 2754 Passive Service State Change: 0.000 / 0.000 / 0.000 % Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 Services Ok/Warn/Unk/Crit:2814 / 6 / 0 / 16 Services Flapping:0 Services In Downtime: 0 Total Hosts: 330 Hosts Checked:330 Hosts Scheduled: 0 Active Host Checks: 330 Passive Host Checks: 0 Total Host State Change: 0.000 / 0.000 / 0.000 % Active Host Latency: 0.000 / 1.000 / 0.888 sec Active Host Execution Time: 0.030 / 4.059 / 0.112 sec Active Host State Change: 0.000 / 0.000 / 0.000 % Active Hosts Last 1/5/15/60 min: 0 / 12 / 12 / 12 Passive Host State Change:0.000 / 0.000 / 0.000 % Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Hosts Up/Down/Unreach:329 / 1 / 0 Hosts Flapping: 0 Hosts In Downtime:0 Hardware is a dual-2.8GHz Xeon, 2G RAM and a 100 FDX interface. LoadAvg is around 1.6, sometimes gets to 1.9. CPUs are both around 40% idle most of the time. I see about 300 context switches and 500 interrupts per second. The network load is neglible, ditto the packet rate. The way these figures look I don't see a performance problem per se, but maybe I
[Nagios-users] Questions about scheduling
Hi! I have a few questions about scheduling in Nagios. - How does the scheduling queue work? From the docs it seems the whole queue is held up as soon as a host check is necessary. As far as I know, Nagios parallelizes checks, so my question is if the current checking thread is held up only or if all of the checks are stopped immediately? - If the whole set of workers is stopped, this would mean that a failing check would result in an immeidate host check which in turn holds up all the queues until it is complete. Does it really work this way? - How much performance overhead do service dependencies generate? I have quite a few NRPE checks and all of them depend on the NRPE dummy check I always define. Does this stall checking by any considerable amount? Thansk for any help, Tobias - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios monitoring bacula - service dependencies
Jeronimo Zucco wrote: NOBODY KNOWS ??? Lots of people know. The answer is ofcourse that you should read and understand the manual before asking questions here. The Nagios config documentation really is quite extensive. Perhaps you missed this particular snippet when configuring? ---%---%---%--- One important thing to note is that by default, Nagios will use the most current hard state of the service(s) that is/are being depended upon when it does the dependeny checks. If you want Nagios to use the most current state of the services (regardless of whether its a soft or hard state), enable the soft_service_dependencies option. ---%---%---%--- There are lots of other goodies in the docs. Jeronimo Zucco escreveu: Hi, all. I have problem using service dependencies. I have a backup server (bacula), and I check status of backup through a mysql query ( http://darcs.complete.org/debian/bacula.upstream/examples/nagios.txt ) on server bacula using nrpe in server bacula. I have many others servers with I do backups in bacula (like server1 below), then I check status of backup of this server, with this definition: define service{ use generic-service ; Name of service template to use nameTEMPLATE-REMOTE-BACULA-BACKUP service_description Backup is_volatile 0 check_period24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval1 notification_optionsw,u,c,r notification_interval 0 notification_period 24x7 register0 } define service{ use TEMPLATE-REMOTE-BACULA-BACKUP host_nameserver1 contact_groupsadmins register1 check_command check_remote_bacula_server1Job } define servicedependency{ host_name server1 service_description Backup dependent_host_name bacula dependent_service_description PING execution_failure_criteria u,c notification_failure_criteria w,u,c } I configure service dependency because when I shutdown bacula/mysql server, I got many notifications about backups. What is wrong in my definition of service dependency ? I want no notifications when bacula server is down. Thank you. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about scheduling
On Tue, 19 Dec 2006, Tobias Klausmann wrote: - How does the scheduling queue work? From the docs it seems the whole queue is held up as soon as a host check is necessary. As far as I know, Nagios parallelizes checks, so my question is if the current checking thread is held up only or if all of the checks are stopped immediately? - If the whole set of workers is stopped, this would mean that a failing check would result in an immeidate host check which in turn holds up all the queues until it is complete. Does it really work this way? Judging from the past months traffic on this mailinglist I am inclined to say: yes. I think you will find it interresting to delve a bit into the archives of this mailinglist. Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Performance issues, too
Thanks for an excellently detailed problem report, missing only the Nagios version and system type/version info. I've got some comments and followup questions. See below. Tobias Klausmann wrote: Hi! Recently I have run into the very same performance issues as Daniel Meyer (or so it seems). However, I'm not quite sure about it. Here's the gist of it. Currently, service check latency slowly creeps up. As it is now, it starts out at a little over 1s and after about 12 hours it's in the area of about 90s. It keeps climbing after that. Here's the output of nagios -s: HOST SCHEDULING INFORMATION --- Total hosts: 330 Total scheduled hosts: 0 No scheduled host-checks. That's good, cause they interfere with normal operations in Nagios. Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 10 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 2836 Total scheduled services: 2836 Service inter-check delay method: SMART Average service check interval: 2225.56 sec This is, as you point out below, quite odd. What's your _longest_ normal_check_interval for services? Inter-check delay: 0.21 sec Interleave factor method: SMART Average services per host: 8.59 Service interleave factor: 9 Max service check spread: 10 min First scheduled check: Tue Dec 19 11:21:45 2006 Last scheduled check: Tue Dec 19 11:31:47 2006 CHECK PROCESSING INFORMATION Service check reaper interval: 5 sec You could lower this to 2 seconds. I've done so on any number of installations and it has no negative impact what so ever, but seems to make Nagios a bit more responsive. Max concurrent service checks: Unlimited I assume you aren't running in to hardware limits on this machine. What's the normal load when you're running nagios? If it's NUM_CPUS then you most likely don't have beefy enough hardware. That's hardly ever the case though, so don't bother looking into it unless all else fails. Nvm, question answered below. Hardware resources should be no problem what so ever. This all looks peachy - I think. What I don't get is this line: Average service check interval: 2225.56 sec It seems to me that this is either a skewed value, stemming from my history of looong latencies (at one point we were beyonf 9000 seconds). Nopes. Nagios doesn't bother reading logfiles when it calculates the scheduling numbers. *Or* it is indicative of a misconfiguration on my part. If the latter is the case, I'd be eager, nay ecstatic to hear what I did wrong. Here are a few of the config vars that might influence this: There has been a slight thinko in Nagios. I don't know if it's still there in recent CVS versions. The thinko is that it (used to?) calculate average service check interval by adding up all normal_check_interval values and dividing it by the number of services configured (or something along those lines), which leads to long latencies. This normally didn't make those latencies increase though. Humm... sleep_time=0.25 service_reaper_frequency=5 max_concurrent_checks=0 max_host_check_spread=10 host_inter_check_delay_method=s service_interleave_factor=s command_check_interval=1 obsess_over_services=0 aggregate_status_updates=1 status_update_interval=20 Also, here's the output from nagiostats: Nagios Stats 2.6 Copyright (c) 2003-2005 Ethan Galstad (www.nagios.org) Last Modified: 11-27-2006 License: GPL CURRENT STATUS DATA Status File: /var/nagios/status.dat Status File Age: 0d 0h 0m 3s Status File Version: 2.6 Program Running Time: 0d 1h 59m 5s Total Services: 2836 Services Checked: 2836 Services Scheduled: 2758 Active Service Checks:2836 Passive Service Checks: 0 All services aren't being scheduled, but you have no passive service checks. Have you disabled checks of 78 services? Total Service State Change: 0.000 / 12.370 / 0.007 % Active Service Latency: 0.006 / 10.237 / 0.906 sec Active Service Execution Time:0.047 / 10.159 / 0.180 sec Active Service State Change: 0.000 / 12.370 / 0.007 % Active Services Last 1/5/15/60 min: 477 / 2678 / 2745 / 2754 Passive Service State Change: 0.000 / 0.000 / 0.000 % Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 Services Ok/Warn/Unk/Crit:2814 / 6 / 0 / 16 Services Flapping:
Re: [Nagios-users] Questions about scheduling
Tobias Klausmann wrote: Hi! I have a few questions about scheduling in Nagios. - How does the scheduling queue work? From the docs it seems the whole queue is held up as soon as a host check is necessary. As far as I know, Nagios parallelizes checks, so my question is if the current checking thread is held up only or if all of the checks are stopped immediately? All the checks are stopped immediately. This is to prevent sending service notifications when hosts go down, so it's sort of inevitable. Nagios 3.0 will support asynchronous host checks, but we're not there yet. - If the whole set of workers is stopped, this would mean that a failing check would result in an immeidate host check which in turn holds up all the queues until it is complete. Does it really work this way? Yes, for reasons stated above. It gets slightly worse if you have a largely linear network (many hosts only have one child), since it also has to check parent hosts until it finds the closest possible up to determine where a possible network outage is happening. - How much performance overhead do service dependencies generate? I have quite a few NRPE checks and all of them depend on the NRPE dummy check I always define. Does this stall checking by any considerable amount? Not much, and not really, respectively :). You might want to enable soft_service_dependencies though. The dependencies only add a couple of internal checks along the lines of if (service-state dependent_state-notification_failure_criteria) /* don't send a notification */ ; -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Performance issues, too
Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: Thanks for an excellently detailed problem report, missing only the Nagios version and system type/version info. I've got some comments and followup questions. See below. I'm running 2.6 now but I had the troubles with 2.5 initially. OS is a Gentoo Linux, Kernel 2.6.15.5 initially, upgrade to 2.6.19 today. --- Total hosts: 330 Total scheduled hosts: 0 No scheduled host-checks. That's good, cause they interfere with normal operations in Nagios. I've read as much. In my seperate mail I had a few questions about it, let's keep them (and the answers there ;) Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 10 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 2836 Total scheduled services: 2836 Service inter-check delay method: SMART Average service check interval: 2225.56 sec This is, as you point out below, quite odd. What's your _longest_ normal_check_interval for services? The longest check_interval is 86400 seconds. It's a SSL cert freshness check. I figured it wasn't necesseary to check that more often than once a day. I also have check_intervals of 3, 5, 15, 20, 30 and 1440 seconds. The latter is also a cert freshness check which is lower because the customer wanted it to be that short. CHECK PROCESSING INFORMATION Service check reaper interval: 5 sec You could lower this to 2 seconds. I've done so on any number of installations and it has no negative impact what so ever, but seems to make Nagios a bit more responsive. I'll give that a try. Max concurrent service checks: Unlimited I assume you aren't running in to hardware limits on this machine. What's the normal load when you're running nagios? If it's NUM_CPUS then you most likely don't have beefy enough hardware. That's hardly ever the case though, so don't bother looking into it unless all else fails. Nvm, question answered below. Hardware resources should be no problem what so ever. I also noticed that HT was disabled on the machine. I've changed that (and added support for it to the kernel) when I did the kernel upgrade today. I'll keep an eye on check latency. *Or* it is indicative of a misconfiguration on my part. If the latter is the case, I'd be eager, nay ecstatic to hear what I did wrong. Here are a few of the config vars that might influence this: There has been a slight thinko in Nagios. I don't know if it's still there in recent CVS versions. The thinko is that it (used to?) calculate average service check interval by adding up all normal_check_interval values and dividing it by the number of services configured (or something along those lines), which leads to long latencies. This normally didn't make those latencies increase though. Humm... Well, the numbers sure do get whacky after a restart: first it skyrockets for about five minutes, then plummets to 1s. From there it works its way up the way I described. Total Services: 2836 Services Checked: 2836 Services Scheduled: 2758 Active Service Checks:2836 Passive Service Checks: 0 All services aren't being scheduled, but you have no passive service checks. Have you disabled checks of 78 services? Oops, forgot to mention that. Yes, a server farm is being rebuilt currently. As I didn't want all the host check timeouts to make matters much, much, worse, I disabled them entirely. Hardware is a dual-2.8GHz Xeon, 2G RAM and a 100 FDX interface. LoadAvg is around 1.6, sometimes gets to 1.9. CPUs are both around 40% idle most of the time. I see about 300 context switches and 500 interrupts per second. The network load is neglible, ditto the packet rate. The way these figures look I don't see a performance problem per se, but maybe I have overlooked a metric that descirbes the usual bottleneck of installations. Are the CPU's 64 bit ones running in 32-bit emulation mode? For intel cpu's, that causes up to 60% performance loss (yes, it really is that bad). Sheesh. Yes, it is a 32-bit installation. I only ever bothered with 64-bit installs on Opteron hardware. I might look into migrating to 64 bits, then. I'm puzzled. Please let me know if you find the answer to this problem. I'll help you debug it as best I can, but please continue posting on-list. Thanks. Sure. I'll first check if the processor upgrade and kernel update helped anything, then try lowering the reaper interval to 2. I'll post the results as soon as I have them. Regards Thanks, Tobias -- Never
Re: [Nagios-users] Questions about scheduling
Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: - How does the scheduling queue work? From the docs it seems the whole queue is held up as soon as a host check is necessary. As far as I know, Nagios parallelizes checks, so my question is if the current checking thread is held up only or if all of the checks are stopped immediately? All the checks are stopped immediately. This is to prevent sending service notifications when hosts go down, so it's sort of inevitable. Nagios 3.0 will support asynchronous host checks, but we're not there yet. Good, good, at least there's a solution on the horizon. - If the whole set of workers is stopped, this would mean that a failing check would result in an immeidate host check which in turn holds up all the queues until it is complete. Does it really work this way? Yes, for reasons stated above. It gets slightly worse if you have a largely linear network (many hosts only have one child), since it also has to check parent hosts until it finds the closest possible up to determine where a possible network outage is happening. I have nearly no parent/child relationships in my setup, as I don't monitor network equipment just hosts (and their services) As such, I only have service deps (more deps than services, actually). - How much performance overhead do service dependencies generate? I have quite a few NRPE checks and all of them depend on the NRPE dummy check I always define. Does this stall checking by any considerable amount? Not much, and not really, respectively :). You might want to enable soft_service_dependencies though. The dependencies only add a couple of internal checks along the lines of if (service-state dependent_state-notification_failure_criteria) /* don't send a notification */ ; Ok, I'll refrain from dropping them, then :) Thanks, Tobias PS: Sorry for hijacking the thread, I missed deleting the in-reply-to-line. Thinking how easy that mistake was, I'll probably be a little less annoyed next time I see someone do it :) -- Never touch a burning system. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about scheduling
On Tue, 19 Dec 2006, Andreas Ericsson wrote: Yes, for reasons stated above. It gets slightly worse if you have a largely linear network (many hosts only have one child), since it also has to check parent hosts until it finds the closest possible up to determine where a possible network outage is happening. Just curious. How will this work if you have something like 5 hosts in line in a parent-child relation? The fastest way would be starting from nagios and work your way to the downed host as the average latency on a check on a live host is much faster then the timeout you get on downed hosts. Considere the map as shown on http://hvdkooij.xs4all.nl/statusmap-20061219.png If nagios detects the ipv6 router in the lab to be down and it has to work it's way up it has to deal with the timeouts on nlams04 and nlams05. If it starts polling the other way around it only has to deal with the host check latency of the switch and the timeout of nlams05. Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Performance issues, too
Tobias Klausmann wrote: Hi! On Tue, 19 Dec 2006, Andreas Ericsson wrote: Thanks for an excellently detailed problem report, missing only the Nagios version and system type/version info. I've got some comments and followup questions. See below. I'm running 2.6 now but I had the troubles with 2.5 initially. OS is a Gentoo Linux, Kernel 2.6.15.5 initially, upgrade to 2.6.19 today. --- Total hosts: 330 Total scheduled hosts: 0 No scheduled host-checks. That's good, cause they interfere with normal operations in Nagios. I've read as much. In my seperate mail I had a few questions about it, let's keep them (and the answers there ;) Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 10 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 2836 Total scheduled services: 2836 Service inter-check delay method: SMART Average service check interval: 2225.56 sec This is, as you point out below, quite odd. What's your _longest_ normal_check_interval for services? The longest check_interval is 86400 seconds. It's a SSL cert freshness check. I figured it wasn't necesseary to check that more often than once a day. I also have check_intervals of 3, 5, 15, 20, 30 and 1440 seconds. The latter is also a cert freshness check which is lower because the customer wanted it to be that short. Try changing the really long intervals to something shorter or commenting them out completely and see what happens. Checking a certificate is not a particularly heavy operation so it doesn't matter much if you run it ever 5 minutes. On the server side it just gets handed out from cache, so it's not heave there either. If you have the various normal_check_interval's specified in templates, try setting them all to 5 minutes and let Nagios run over-night. If this interferes with some fragile services on the network (webservers whose sessions don't expire, fe), disable active checks for those services during the testing period. (yes, this might seem braindead, but I really need to know if this bug is still in Nagios). *Or* it is indicative of a misconfiguration on my part. If the latter is the case, I'd be eager, nay ecstatic to hear what I did wrong. Here are a few of the config vars that might influence this: There has been a slight thinko in Nagios. I don't know if it's still there in recent CVS versions. The thinko is that it (used to?) calculate average service check interval by adding up all normal_check_interval values and dividing it by the number of services configured (or something along those lines), which leads to long latencies. This normally didn't make those latencies increase though. Humm... Well, the numbers sure do get whacky after a restart: first it skyrockets for about five minutes, then plummets to 1s. From there it works its way up the way I described. Are the first checks of things being scheduled with unreasonably long delays? Fe, a check with 3 minute normal_check_interval being scheduled an hour or so into the future. Total Services: 2836 Services Checked: 2836 Services Scheduled: 2758 Active Service Checks:2836 Passive Service Checks: 0 All services aren't being scheduled, but you have no passive service checks. Have you disabled checks of 78 services? Oops, forgot to mention that. Yes, a server farm is being rebuilt currently. As I didn't want all the host check timeouts to make matters much, much, worse, I disabled them entirely. Ah, that explains it then. It shouldn't matter, but unless the experiment I suggested above turns up anything useful, would you mind commenting them out and testing that? Hardware is a dual-2.8GHz Xeon, 2G RAM and a 100 FDX interface. LoadAvg is around 1.6, sometimes gets to 1.9. CPUs are both around 40% idle most of the time. I see about 300 context switches and 500 interrupts per second. The network load is neglible, ditto the packet rate. The way these figures look I don't see a performance problem per se, but maybe I have overlooked a metric that descirbes the usual bottleneck of installations. Are the CPU's 64 bit ones running in 32-bit emulation mode? For intel cpu's, that causes up to 60% performance loss (yes, it really is that bad). Sheesh. Yes, it is a 32-bit installation. I only ever bothered with 64-bit installs on Opteron hardware. I might look into migrating to 64 bits, then. So the CPU's are 64-bits? Humm... 64-bit mode would boost available resources quite a bit, but as you just enabled HT you should now have 3 extra CPU's (Xeon's are dualcore AFAIR) which
Re: [Nagios-users] Performance issues, too
On Tue, 19 Dec 2006, Tobias Klausmann wrote: I'm running 2.6 now but I had the troubles with 2.5 initially. OS is a Gentoo Linux, Kernel 2.6.15.5 initially, upgrade to 2.6.19 today. Same here. Latency-Problems with both 2.5 and 2.6, but on CentOS 4.4 (good that you use gentoo, saves me the time to try it on a heavy optimized gentoo box :) You could lower this to 2 seconds. I've done so on any number of installations and it has no negative impact what so ever, but seems to make Nagios a bit more responsive. I'll give that a try. I've tried that but had some failing checks when i did that. Very strange... I also noticed that HT was disabled on the machine. I've changed that (and added support for it to the kernel) when I did the kernel upgrade today. I'll keep an eye on check latency. I have HT enabled, no effect on the nagios latency problems. Danny -- Q: Gentoo is too hard to install =http://www.cyberdelia.de and I feel like whining. = [EMAIL PROTECTED] A: Please see /dev/null. = (from the gentoo installer FAQ) = \o/ - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about scheduling
Hugo van der Kooij wrote: On Tue, 19 Dec 2006, Andreas Ericsson wrote: Yes, for reasons stated above. It gets slightly worse if you have a largely linear network (many hosts only have one child), since it also has to check parent hosts until it finds the closest possible up to determine where a possible network outage is happening. Just curious. How will this work if you have something like 5 hosts in line in a parent-child relation? The fastest way would be starting from nagios and work your way to the downed host as the average latency on a check on a live host is much faster then the timeout you get on downed hosts. Considere the map as shown on http://hvdkooij.xs4all.nl/statusmap-20061219.png If nagios detects the ipv6 router in the lab to be down and it has to work it's way up it has to deal with the timeouts on nlams04 and nlams05. If it starts polling the other way around it only has to deal with the host check latency of the switch and the timeout of nlams05. In the case you posted on your map, it would indeed be faster to start walking in - out. However, if the closest parent had been up it would have been the other way around. Anyways, I *think* nagios checks in-out. Either way, it's important for a host check to return OK *immediately* when it finds that the host it's checking actually *is* ok, which is why I wrote check_icmp and let it have a check_host mode which does just that. The original default hostcheck (I think it's still the default, btw) would wait a minimum of 5 seconds no matter if the first ping came back ok after 5ms. Since all other checks are stopped, this causes quite a bit of slowdown. When I think about it, it would indeed (especially with check_host) be faster to start unreachability checks with root-hosts and then following children down to the targeted host since we would that way encounter a minimum of host check timeouts. It's programmatically slightly trickier though, as you'd have to walk it backwards from the problem host, push each parent host to a stack and then pop them from that stack when you do the actual checking. I'll look in this and see if a patch is necessary and, if so, if I can come up with one. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about scheduling
On Tue, 19 Dec 2006, Hugo van der Kooij wrote: Just curious. How will this work if you have something like 5 hosts in line in a parent-child relation? The fastest way would be starting from nagios and work your way to the downed host as the average latency on a check on a live host is much faster then the timeout you get on downed hosts. i think it works its way from the failing host up to the nagios server, which would be logical from the point of view that nagios knows the parents of the failed server. Dont think that nagios uses that information to rebuild it into a child list... That might be the solution to why did nagios send me so many notifications today: consider those host relations: nagios - router - router - switch - server1 to server n (up) (up) (down) (dummy) (up...) I had a failed leased line today. The switch in that scenario was up due to a dummy check which always replies ok, the router was down. I got a host down notification for each of the servers Must be because nagios started to check from the servers to the nagios host, and found the switch to be up and runnnig. Danny -- Q: Gentoo is too hard to install =http://www.cyberdelia.de and I feel like whining. = [EMAIL PROTECTED] A: Please see /dev/null. = (from the gentoo installer FAQ) = \o/ - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] start nagios
hello, I configured about manual but at start time Nagios send the following msgs: Permiso denegado. I try start at /etc/rc.d/init.d/nagios start. Thank you for any idea. AC __ Correo Yahoo! Espacio para todos tus mensajes, antivirus y antispam ¡gratis! ¡Abrí tu cuenta ya! - http://correo.yahoo.com.ar- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Performance issues, too
Hi! On Tue, 19 Dec 2006, Daniel Meyer wrote: You could lower this to 2 seconds. I've done so on any number of installations and it has no negative impact what so ever, but seems to make Nagios a bit more responsive. I'll give that a try. I've tried that but had some failing checks when i did that. Very strange... I'm still waiting how the kernel change will work out. I also noticed that HT was disabled on the machine. I've changed that (and added support for it to the kernel) when I did the kernel upgrade today. I'll keep an eye on check latency. I have HT enabled, no effect on the nagios latency problems. I've now setup a little script that puts host and service check latency in an RRD file every five minutes. So far, the curve looks very inconclusive. Regards, Tobias -- Never touch a burning system. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] start nagios
On Tue, 19 Dec 2006, Alicia Calvo wrote: I configured about manual but at start time Nagios send the following msgs: Permiso denegado. I try start at /etc/rc.d/init.d/nagios start. Try looking for permission denied and nagios with your favorite search engine. You did not include any relevant details like: - Nagios version - Install method - Started as user . Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage
Message to all -- I am sorry about the repeated posts. I was simply desperate for an answer, hence my 'panic'. I did not intend to irritate anybody on this matter, and if I have please accept my apologies and assurances that it was completely accidental. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Hugo van der Kooij Sent: Monday, December 18, 2006 7:11 PM To: Nagios Users mailinglist Subject: Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage On Mon, 18 Dec 2006, Marc Powell wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Kaplan, Andrew H. Sent: Monday, December 18, 2006 3:18 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage I don't have the answer but frequent and repeated postings generally just irritate people enough to not want to help. It's not like we could have missed it the first, second or third times today to necessitate this fourth. Your urgent problem is not our urgent problem unfortunately. You generally want to allow at least 24 hours before reposting to give everyone a chance to see it. I would suggest more like 72 hours at least. And even then I think one should question if a verbatim repost is usefull. If no one reacted the first time it well be because a posting was unclear in it's intend. If one is in dire need one needs to pick up the tab and hire some nagios consultant. Open Source by no means you get days of support for free. Most of the people around here have mortgages or rent to pay and propably the need for groceries and so on. Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Server/Router with more than 1 interface
Dear all, I have some servers and routers that have more than 1 interface, and each interface has IP Address. I would like define a host is alive if all of its interface ip address are available via check_ping. So I define as following, but if I shut down the interface with ip 10.254.8.2. Nagios still think the host as alive, since Nagios just detect the IP 10.16.20.120 as the indicator of the host. Any workaround to make Nagios aware of both IP Addresses/IP Address ? Any suggestion will be highly appreciated. Thank you. define host{ use generic-host; Name of host template to use host_name NAS-TMMIN-HO alias FTP Server address 10.16.20.120,10.254.8.2 parents HO_Core_SW max_check_attempts 2 check_command check-host-alive contact_groups admins notification_interval 120 notification_optionsd,u,r notification_period 24x7 flap_detection_enabled 1 } - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Server/Router with more than 1 interface
On Tue, 19 Dec 2006, Herman (ISTD) wrote: I have some servers and routers that have more than 1 interface, and each interface has IP Address. I would like define a host is alive if all of its interface ip address are available via check_ping. So I define as following, but if I shut down the interface with ip 10.254.8.2. Nagios still think the host as alive, since Nagios just detect the IP 10.16.20.120 as the indicator of the host. Any workaround to make Nagios aware of both IP Addresses/IP Address ? Any suggestion will be highly appreciated. This is one of those rare cases where host dependencies may come into play. But I would add the interfaces as PING service to the host or use SNMP to check the real interface status. Something like: # 'check_snmp_int' command definition define command{ command_namecheck_snmp_int command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $ARG1$ -n $ARG2$ } Getting the proper interface names can be a daunting task with some routers. Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Use statement
Hi all , Just to confirm . Can I use two use statement when I am defining a service . Like a example : service http{ host_name some name use service definition 1 use service definition 2 Best Regards , Saulo Augusto Silva - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] notifications.cgi
Hello Group. i want see all notification for the special User (work with contact=User) and the Last Week (7 Days). Can notifications.cgi this or have anybody script who make this? Regards, Andreas. smime.p7s Description: S/MIME cryptographic signature - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage
The NRPE issue that has been very problematic has generated another error. I was briefly getting output back from the remote client, but now I am getting the error message: NRPE: Unable to read output. Here is a brief listing of the configuration: Nagios server: Nagios 2.6 NRPE client: 2.5.2 running on an HP-UX 11.11 system Allowed hosts: 0.0.0.0 The remote client uses inetd (without tcp wrappers) and here is the configuration in the inetd.conf file: nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg -inetd The remote client also has a startup script in the /sbin/init.d directory called nrpe. Here is the text of the script: /usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/nrpe.cfg -d I checked the log file on the remote client, and there was no indication of any connection problem. The messages and nagios.log files on the server simply reported the following: UNKNOWN;notify-by-email;NRPE: Unable to read ouput. I have confirmed there is no firewall activated on the system, and that nrpe communication is allowed on the client. From: Steve Shipway [mailto:[EMAIL PROTECTED] Sent: Monday, December 18, 2006 8:24 PM To: Kaplan, Andrew H.; nagios-users@lists.sourceforge.net Subject: RE: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage Sounds like the daemon is comparing the client's IP against the list of permitted connections, and not getting a match. For some reason the query of the source IP on the connection is returning 0.0.0.0 instead of the source IP - maybe you have some special wrapper or intervening agent. I'd suggest running under xinetd or something similar which can do the filtering instead. Steve - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Use statement
On Tue, 19 Dec 2006, Saulo Silva wrote: Just to confirm . Can I use two use statement when I am defining a service . Why do you not use nagios -v to confirm it? It is there for you to verify your config. (http://nagios.sourceforge.net/docs/2_0/verifyconfig.html) Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytesfromdaemon ErrorMessage
The remote client uses inetd (without tcp wrappers) and here is the configuration in the inetd.conf file: nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg -inetd Solaris requires the name of the server program and then the arguments, which include argument 0 (the name of the program). So that would be : nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe /usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg --inetd Not sure if HP-UX inetd.conf works the same way. The remote client also has a startup script in the /sbin/init.d directory called nrpe. Here is the text of the script: /usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/nrpe.cfg -d I'd say that you either want to run as a daemon or run under inetd, not both. Phil Phil Costelloe Technical Consultant [EMAIL PROTECTED] http://www.foundation-it.com Foundation IT Foundation Court Old Street Oare Hermitage Berkshire RG18 9SE Switch: +44 (0) 1635 203700 Helpdesk: +44 (0) 800 0121099 DDI: +44 (0) 1635 203719 Fax: +44 (0) 8700 543537 Mobile: +44 (0) 7884 236299 This message contains confidential information and is intended only for [EMAIL PROTECTED] If you are not nagios-users@lists.sourceforge.net you should not disseminate, distribute or copy this e-mail. Please notify Foundation IT immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system. E-mail transmission cannot be guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or contain viruses. Foundation IT therefore does not accept liability for any errors or omissions in the contents of this message, which arise as a result of e-mail transmission. If verification is required please request a hard-copy version. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0bytesfromdaemon ErrorMessage
Hi there -- I checked the inetd.conf file for HP-UX and it differs from Solaris in that it does not require the name of the program after the user field. I did remove nrpe from the file to see if that made a difference, and unfortunately it did not. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Phil Costelloe Sent: Tuesday, December 19, 2006 10:21 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0bytesfromdaemon ErrorMessage The remote client uses inetd (without tcp wrappers) and here is the configuration in the inetd.conf file: nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg -inetd Solaris requires the name of the server program and then the arguments, which include argument 0 (the name of the program). So that would be : nrpe stream tcp nowait bin /usr/local/nagios/bin/nrpe /usr/local/nagios/bin/nrpe -c /usr/local/nagios/nrpe.cfg --inetd Not sure if HP-UX inetd.conf works the same way. The remote client also has a startup script in the /sbin/init.d directory called nrpe. Here is the text of the script: /usr/local/nagios/bin/nrpe -n -c /usr/local/nagios/nrpe.cfg -d I'd say that you either want to run as a daemon or run under inetd, not both. Phil Phil Costelloe Technical Consultant [EMAIL PROTECTED] http://www.foundation-it.com Foundation IT Foundation Court Old Street Oare Hermitage Berkshire RG18 9SE Switch: +44 (0) 1635 203700 Helpdesk: +44 (0) 800 0121099 DDI: +44 (0) 1635 203719 Fax: +44 (0) 8700 543537 Mobile: +44 (0) 7884 236299 This message contains confidential information and is intended only for [EMAIL PROTECTED] If you are not nagios-users@lists.sourceforge.net you should not disseminate, distribute or copy this e-mail. Please notify Foundation IT immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system. E-mail transmission cannot be guaranteed to be secure or error-free as information could be intercepted, corrupted, lost, destroyed, arrive late or incomplete, or contain viruses. Foundation IT therefore does not accept liability for any errors or omissions in the contents of this message, which arise as a result of e-mail transmission. If verification is required please request a hard-copy version. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Questions about scheduling
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Daniel Meyer Sent: Tuesday, December 19, 2006 6:19 AM To: Nagios Users mailinglist Subject: Re: [Nagios-users] Questions about scheduling On Tue, 19 Dec 2006, Hugo van der Kooij wrote: Just curious. How will this work if you have something like 5 hosts in line in a parent-child relation? The fastest way would be starting from nagios and work your way to the downed host as the average latency on a check on a live host is much faster then the timeout you get on downed hosts. i think it works its way from the failing host up to the nagios server, which would be logical from the point of view that nagios knows the parents of the failed server. Dont think that nagios uses that information to rebuild it into a child list... This presentation, starting on page 19, details current and future host check logic -- http://www.netways.de/de/nagios_konferenz/archiv_2006/programm/nagios_30 _and_beyond/ -- Marc - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] MRTG link to a specific host.
Hello, I have Nagios 2.6 installed and I have mrtg 2.13.2 installed and working. I would like to create a icon or link that within Nagiso that when you click on the link that it will go to the mrtg page that I created. Thanks - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] MRTG link to a specific host.
I have Nagios 2.6 installed and I have mrtg 2.13.2 installed and working. I would like to create a icon or link that within Nagiso that when you click on the link that it will go to the mrtg page that I created. http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#hostextinfo - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Schedule Downtime w/o authentication requirement
Andreas, Thanks for your advice. But, FYI, I have done what I could before I painfully signed up and advertised my issue on this list. By the way, I am novice on Nagios (heck also with Linux) and took me a while to build the Nagios system which I am extremely proud of. What do you think? Can you pass your knowledge to me now? - Subin -Original Message- From: Andreas Ericsson [mailto:[EMAIL PROTECTED] Sent: Tuesday, December 19, 2006 1:27 AM To: Subin Shakya Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Schedule Downtime w/o authentication requirement Subin Shakya wrote: I want to configure nagios in such a way that it does not require any kind of authorization/authentication including 'guest'. How can I do that? I do not know where to go in Nagios config to make this happen. TIA, Symuser =0D Right now when I select a Host to schedule a downtime I get: =0D Sorry Dave, I can't let you do that... It seems that you have chosen to not use the authentication functionality of the CGIs. I don't want to be personally responsible for what may happen as a result of allowing unauthorized users to issue commands to Nagios,so you'll have to disable this safeguard if you are really stubborn and want to invite trouble. Read the section on CGI authentication in the HTML documentation to learn how you can enable authentication and why you should want to. Nagios is more polite than I am. RTFM and do some thinking and you'll find the answer. On a sidenote; I know the answer but I'm not interested in giving it to you when you haven't even bothered trying to find it for yourself. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 === Notice: This e-mail message, together with any attachments, contains information of Symyx Technologies, Inc. or any of its affiliates or subsidiaries that may be confidential, proprietary, copyrighted, privileged and/or protected work product, and is meant solely for the intended recipient. If you are not the intended recipient, and have received this message in error, please contact the sender immediately, permanently delete the original and any copies of this email and any attachments thereto. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SSL handshake error
So, the daemon end of your NRPE check is a Solaris 10 server. What OS is on the client end? What if you try to connect to localhost on the Solaris 10 machine with check_nrpe ? -- Ludwig Pummer System Administrator, Copart Auto Auctions 707-639-5067 [EMAIL PROTECTED] From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Extra Special Bitter Sent: Tuesday, December 19, 2006 6:45 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] SSL handshake error Last week we installed the NRPE daemon on a Solaris 10 server but have had trouble configuring it to monitor anything more sophisticated than a basic Host Alive check. Remote checks show up in the console with a status of unknown with the error message: CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. Checking the error log on the remote server reveals the following: [EMAIL PROTECTED] dmesg | grep nrpe | tail -1 Dec 19 08:45:40 txopprd1 nrpe[27334]: [ID 813741 daemon.error] Error: Could not complete SSL handshake. 1 Any clues? Thanks in advance... - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Schedule Downtime w/o authentication requirement
Subin, have a look through cgi.cfg -mike On Dec 19, 2006, at 12:30 PM, Subin Shakya wrote: Andreas, Thanks for your advice. But, FYI, I have done what I could before I painfully signed up and advertised my issue on this list. By the way, I am novice on Nagios (heck also with Linux) and took me a while to build the Nagios system which I am extremely proud of. What do you think? Can you pass your knowledge to me now? - Subin -Original Message- From: Andreas Ericsson [mailto:[EMAIL PROTECTED] Sent: Tuesday, December 19, 2006 1:27 AM To: Subin Shakya Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Schedule Downtime w/o authentication requirement Subin Shakya wrote: I want to configure nagios in such a way that it does not require any kind of authorization/authentication including 'guest'. How can I do that? I do not know where to go in Nagios config to make this happen. TIA, Symuser =0D Right now when I select a Host to schedule a downtime I get: =0D Sorry Dave, I can't let you do that... It seems that you have chosen to not use the authentication functionality of the CGIs. I don't want to be personally responsible for what may happen as a result of allowing unauthorized users to issue commands to Nagios,so you'll have to disable this safeguard if you are really stubborn and want to invite trouble. Read the section on CGI authentication in the HTML documentation to learn how you can enable authentication and why you should want to. Nagios is more polite than I am. RTFM and do some thinking and you'll find the answer. On a sidenote; I know the answer but I'm not interested in giving it to you when you haven't even bothered trying to find it for yourself. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 === Notice: This e-mail message, together with any attachments, contains information of Symyx Technologies, Inc. or any of its affiliates or subsidiaries that may be confidential, proprietary, copyrighted, privileged and/or protected work product, and is meant solely for the intended recipient. If you are not the intended recipient, and have received this message in error, please contact the sender immediately, permanently delete the original and any copies of this email and any attachments thereto. -- --- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php? page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Schedule Downtime w/o authentication requirement
No, you didn't. The last link under Configuring Nagios in the documentation table of contents clearly says Configuring authorization for the CGIs. Read it! http://nagios.sourceforge.net/docs/2_0/cgiauth.html Authentication On Secured Web Servers If your web server is located in a secure domain (i.e., behind a firewall) or if you are using SSL, you can define a default username that can be used to access the CGIs. This is done by defining the default_user_name option in the CGI configuration file. By defining a default username that can access the CGIs, you can allow users to access the CGIs without necessarily having to authenticate to the web server.. You may want to use this to avoid having to use basic web authentication, as basic authentication transmits passwords in clear text over the Internet. Important: Do not define a default username unless you are running a secure web server and are sure that everyone who has access to the CGIs has been authenticated in some manner! If you define this variable, anyone who has not authenticated to the web server will inherit all rights you assign to this user! -f On Tue, 19 Dec 2006, Subin Shakya wrote: Date: Tue, 19 Dec 2006 10:30:06 -0800 From: Subin Shakya [EMAIL PROTECTED] To: Andreas Ericsson [EMAIL PROTECTED] Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Schedule Downtime w/o authentication requirement Andreas, Thanks for your advice. But, FYI, I have done what I could before I painfully signed up and advertised my issue on this list. By the way, I am novice on Nagios (heck also with Linux) and took me a while to build the Nagios system which I am extremely proud of. What do you think? Can you pass your knowledge to me now? - Subin -Original Message- From: Andreas Ericsson [mailto:[EMAIL PROTECTED] Sent: Tuesday, December 19, 2006 1:27 AM To: Subin Shakya Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Schedule Downtime w/o authentication requirement Subin Shakya wrote: I want to configure nagios in such a way that it does not require any kind of authorization/authentication including 'guest'. How can I do that? I do not know where to go in Nagios config to make this happen. TIA, Symuser =0D Right now when I select a Host to schedule a downtime I get: =0D Sorry Dave, I can't let you do that... It seems that you have chosen to not use the authentication functionality of the CGIs. I don't want to be personally responsible for what may happen as a result of allowing unauthorized users to issue commands to Nagios,so you'll have to disable this safeguard if you are really stubborn and want to invite trouble. Read the section on CGI authentication in the HTML documentation to learn how you can enable authentication and why you should want to. Nagios is more polite than I am. RTFM and do some thinking and you'll find the answer. On a sidenote; I know the answer but I'm not interested in giving it to you when you haven't even bothered trying to find it for yourself. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 === Notice: This e-mail message, together with any attachments, contains information of Symyx Technologies, Inc. or any of its affiliates or subsidiaries that may be confidential, proprietary, copyrighted, privileged and/or protected work product, and is meant solely for the intended recipient. If you are not the intended recipient, and have received this message in error, please contact the sender immediately, permanently delete the original and any copies of this email and any attachments thereto. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net
Re: [Nagios-users] SSL handshake error
Ludwig, Thanks for the reply. As it turns out, the error went away when I changed my check command from check_unix_disk_no-ssl to check_unix_disk. Who knew? Paul On 12/19/06, Ludwig Pummer [EMAIL PROTECTED] wrote: So, the daemon end of your NRPE check is a Solaris 10 server. What OS is on the client end? What if you try to connect to localhost on the Solaris 10 machine with check_nrpe ? -- Ludwig Pummer System Administrator, Copart Auto Auctions 707-639-5067 [EMAIL PROTECTED] -- *From:* [EMAIL PROTECTED] [mailto: [EMAIL PROTECTED] *On Behalf Of *Extra Special Bitter *Sent:* Tuesday, December 19, 2006 6:45 AM *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] SSL handshake error Last week we installed the NRPE daemon on a Solaris 10 server but have had trouble configuring it to monitor anything more sophisticated than a basic Host Alive check. Remote checks show up in the console with a status of unknown with the error message: CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. Checking the error log on the remote server reveals the following: [EMAIL PROTECTED] dmesg | grep nrpe | tail -1 Dec 19 08:45:40 txopprd1 nrpe[27334]: [ID 813741 daemon.error] Error: Could not complete SSL handshake. 1 Any clues? Thanks in advance... - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Getting pie charts in host's history
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 My boss wants to be able to look at pie charts when ever we are looking at history for various services. So far I have only been able to get a bar graph that is showing the uptime and down time, but not I have not been able to get a pie chart to display. I am using ubuntu 6.10, with nagios 2.4 Any help or though would be appreciated. Craig -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.5 (MingW32) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFFiFotAOTIJ89W4sIRAgbGAJ4geQOzBNgIurspehrapp3THi8xbgCfUGR9 UaNr+RnEzArd6nsNTasN7fM= =S0Pg -END PGP SIGNATURE- - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Getting pie charts in host's history
Craig Van Tassle wrote: My boss wants to be able to look at pie charts when ever we are looking at history for various services. So far I have only been able to get a bar graph that is showing the uptime and down time, but not I have not been able to get a pie chart to display. I am using ubuntu 6.10, with nagios 2.4 Any help or though would be appreciated. By bar graph I assume you mean the one on 'View Trends For This Service' (which doesnt look very bar-ish when its all one colour ;) ). There is no pie chart in the version I am running (v2.1) nor in the doco (http://nagios.sourceforge.net/docs/2_0/cgis.html). You can either export the data in CSV format from the Availability Report and process it yourself, or write your own CGI. Or do as I do... tell your boss how to get the raw data and they can spend all day making purty charts while you can spend your day doing more productive things like keeping things running. ;) - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] URGENT REPOST: CHECK_NRPE: Received 0 bytes fromdaemon ErrorMessage
I would say that you need to use inetd, OR an init.d script, but not both. If you try to then you'll get errors when they try and start up. Your 'unable to read output' error seems to indicate that you have a timeout. I don't know what plugin you are attempting to run remotely, but check to make sure that the timeout on check_nrpe (A), on the nrpe.cfg file (B) and for the remotely run plugin (C) are set as CBA. Your error message seems to be timeout A. Another possibility is that your check_nrpe is running in non-ssl mode, but your nrpe daemon is running in SSL mode... I notice that you have given the -n option (non-SSL) in your init.d definition, but not in your inetd.conf definition! Are you using -n in your check_nrpe? Pick one and use it throughout... Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring streaming
You can use the check_real plugin to test basic connectivity to a WMS server: ./check_real -H hostname -u /industrial.wmv On 12/14/06, Marcel Mitsuto Fucatu Sugano [EMAIL PROTECTED] wrote: Does anyone knows of a plugin to monitor windows media services streamings, by emulating a user who watches a sample movie? I mean, by connecting and evaluating if the server is responding correctly? TIA, Marcel AVISO: A informação contida neste e-mail, bem como em qualquer de seus anexos, é CONFIDENCIAL e destinada ao uso exclusivo do(s) destinatário(s) acima referido(s), podendo conter informações sigilosas e/ou legalmente protegidas. Caso você não seja o destinatário desta mensagem, informamos que qualquer divulgação, distribuição ou cópia deste e-mail e/ou de qualquer de seus anexos é absolutamente proibida. Solicitamos que o remetente seja comunicado imediatamente, respondendo esta mensagem, e que o original desta mensagem e de seus anexos, bem como toda e qualquer cópia e/ou impressão realizada a partir destes, sejam permanentemente apagados e/ou destruídos. Informações adicionais sobre nossa empresa podem ser obtidas no site http://sobre.uol.com.br/. NOTICE: The information contained in this e-mail and any attachments thereto is CONFIDENTIAL and is intended only for use by the recipient named herein and may contain legally privileged and/or secret information. If you are not the e-mail´s intended recipient, you are hereby notified that any dissemination, distribution or copy of this e-mail, and/or any attachments thereto, is strictly prohibited. Please immediately notify the sender replying to the above mentioned e-mail address, and permanently delete and/or destroy the original and any copy of this e-mail and/or its attachments, as well as any printout thereof. Additional information about our company may be obtained through the site http://www.uol.com.br/ir/. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using $ in password
I ran into a similar issue when using the check_snmp plugin and my ultimate solution was to write a small wrapper shell script with the password hard coded in it and call that as your check command. Example one line script called check_httpp check_http -p $1 -I $2 -t 120 -a username:passsword then define your check command: define command{ command_namecheck_httpp command_line$USER1$/check_httpp $ARG1$ $HOSTADDRESS$ } then define your service: define service{ use generic-service host_name service_description YYY contact_groups ZZZ check_commandcheck_httpp!61002 } On 12/13/06, Kareem Mahgoub [EMAIL PROTECTED] wrote: Thanks again for every one replied to me. I tired all what was suggested, yet nothing is working. I want to add something, when I add quotes on the password and run the command and it worked but when nagios parse the config file it didn't work, showing on the web interface HTTP WARNING: HTTP/1.0 401 Unauthorized Changing the password will be my last option cause there are hundreds of devices with password having the $ character Thanks all for your help - Original Message - From: Mike Holloway [EMAIL PROTECTED] To: nagios-users@lists.sourceforge.net Sent: Wednesday, December 13, 2006 12:07 AM Subject: Re: [Nagios-users] Using $ in password In checkcommands.cfg, try wrapping ARG3 in single quotes on the command_line. From experience, you are better off changing the password to not use a $ character. # 'check_httpp' command definition define command{ command_namecheck_httpp command_line$USER1$/check_http -p $ARG1$ -I $ARG2$ -t 120 -a '$ARG3$' -mike On Dec 12, 2006, at 1:49 PM, Kareem Mahgoub wrote: Thanks for all who replied to me, but unfortunately the three solutions didn't work. Kindly if anyone has other ideas, post it. Regards - Original Message - From: [EMAIL PROTECTED] To: nagios-users@lists.sourceforge.net Sent: Tuesday, December 12, 2006 9:28 PM Subject: Re: [Nagios-users] Using $ in password Normally, doubling the $ sign and eclosing the statement in single quotes works fine. check_command check_httpp!61002!192.168.1.4!:'33$$77' __ __ ___ Jason Shein Network Administrator – Linux Systems Iovate Health Sciences Inc. 5100 Spectrum Way Mississauga, ON L4W 5S2 ( 905 ) - 678 - 3119 x 3136 1 - 888 - 334 - 4448,x 3136 (toll-free) [EMAIL PROTECTED] Customer Service. Collaboration. Innovation. Efficiency. Iovate's Information Technology Team __ __ ___ CONFIDENTIALITY NOTICE: THIS ELECTRONIC MAIL TRANSMISSION IS PRIVILEGED AND CONFIDENTIAL AND IS INTENDED ONLY FOR THE REVIEW OF THE PARTY TO WHOM IT IS ADDRESSED. THE INFORMATION CONTAINED IN THIS E-MAIL IS CONFIDENTIAL AND IS DISCLOSED TO YOU UNDER THE EXPRESS UNDERSTANDING THAT YOU WILL NOT DISCLOSE IT OR ITS CONTENTS TO ANY THIRD PARTY WITHOUT THE EXPRESS WRITTEN CONSENT OF AN AUTHORIZED OFFICER OF IOVATE HEALTH SCIENCES SERVICES INC. IF YOU HAVE RECEIVED THIS TRANSMISSION IN ERROR, PLEASE IMMEDIATELY RETURN IT TO THE SENDER. __ __ ___ Kareem Mahgoub [EMAIL PROTECTED] Sent by: [EMAIL PROTECTED] 12/12/2006 02:10 PM To nagios-users@lists.sourceforge.net cc Subject [Nagios-users] Using $ in password Dear All, I have a simple question. What should I do to include the character $ as one of the password characters in the check command? Here are some required information: 1- OS FC 5 2- kernel 2.6.18-1 3- Nagios version 2.5 4- The command definition # 'check_httpp' command definition define command{ command_namecheck_httpp command_line$USER1$/check_http -p $ARG1$ -I $ARG2$ -t 120 -a $ARG3$ } 5- The service configuration define service{ use generic-service host_name service_description YYY contact_groups ZZZ check_command check_httpp!61002!192.168.1.4!:33$77 } When I execute the command manually it works. Any help will be much appreciated Regards - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?
Re: [Nagios-users] Server/Router with more than 1 interface
Hi Hugo, thank you very much for your suggestion. Currently, I do the same thing with : define command{ command_namecheck_interface command_line$USER1$/check_snmp_cisco_ifstatus -H $HOSTADDRESS$ -C $ARG1$ -i $ARG2$ } However, is it possible to change HOST status in Status Map to DOWN if one of the ip address could not be contacted ? Best Regards, Herman -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Hugo van der Kooij Sent: Tuesday, December 19, 2006 9:01 PM To: Nagios Users mailinglist Subject: Re: [Nagios-users] Server/Router with more than 1 interface On Tue, 19 Dec 2006, Herman (ISTD) wrote: I have some servers and routers that have more than 1 interface, and each interface has IP Address. I would like define a host is alive if all of its interface ip address are available via check_ping. So I define as following, but if I shut down the interface with ip 10.254.8.2. Nagios still think the host as alive, since Nagios just detect the IP 10.16.20.120 as the indicator of the host. Any workaround to make Nagios aware of both IP Addresses/IP Address ? Any suggestion will be highly appreciated. This is one of those rare cases where host dependencies may come into play. But I would add the interfaces as PING service to the host or use SNMP to check the real interface status. Something like: # 'check_snmp_int' command definition define command{ command_namecheck_snmp_int command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $ARG1$ -n $ARG2$ } Getting the proper interface names can be a daunting task with some routers. Hugo. -- [EMAIL PROTECTED] http://hvdkooij.xs4all.nl/ This message is using 100% recycled electrons. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDE V ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] New condiguration not loading
Hi all !!! Just saw something very weird for the first time since I have installed my distributed Nagios system ( 2.4 ). ( Running on CentOS4.0 ) I needed to take out 2 hosts. But after taking them out of my minimal.cfg and restarting nagios, I still see that these hosts are been scanned ! Do I need to remove another file ?? Thanks ! John - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null