Re: [Nagios-users-br] Manutenção de CFG em modo gr áfico
NagioSQL me pareceu ser uma boa idéia. Nagat é mais antigo, mas não gostei muito... sd, Edgar Em 14/02/07, Jose Oliveira [EMAIL PROTECTED] escreveu: Pessoal Voces têm utilizado alguma ferramenta para manutenção dos arquivos de configuração do Nagios em modo gráfico (web ou não) que funcione ? Obrigado -- Abraços JGeraldo - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV -- Nagios-users-br@lists.sourceforge.net mailing list https://lists.sourceforge.net/lists/listinfo/nagios-users-br Wiki: http://nagios-br.sf.net/wiki - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV -- Nagios-users-br@lists.sourceforge.net mailing list https://lists.sourceforge.net/lists/listinfo/nagios-users-br Wiki: http://nagios-br.sf.net/wiki
Re: [Nagios-users] Checking Windows counter via nagios, nsclient++, and nrpe
Does anyone have the source code for checkcounter.exe or know where it came from? -Robert Carsten Koop wrote: Hi all, If you're using NSClient++, you can check all the WMI-Counters with the following command: /./check_nt -H 127.0.0.1 -p 1234 -v COUNTER -l '\\Terminaldienste\\Sitzungen Insgesamt'/ /(You have to search the correct Counter with the Windows Performance Monitor)/ Mit freundlichen Grüßen / best regards Carsten Koop *James Whittington [EMAIL PROTECTED]* Gesendet von: [EMAIL PROTECTED] 14.02.2007 17:00 An Robert Wolfe [EMAIL PROTECTED], Nagios-users@lists.sourceforge.net Kopie Thema Re: [Nagios-users] Checking Windows counter via nagios, nsclient++, and nrpe checkCounter can check any performance counter object in windows as far as I know. I found out if you are using nsclient++ the command “nsclient++ CheckSystem listpdh” would dump all the counters it found. I looked for the process I was wanting to watch to find the right syntax for the process and instance. James Whittington Systems Engineer VC3, Inc. 803-978-2723 direct 800-787-1160 main *From:* Robert Wolfe [mailto:[EMAIL PROTECTED] * Sent:* Wednesday, February 14, 2007 10:34 AM* To:* James Whittington; [EMAIL PROTECTED] Subject:* RE: [Nagios-users] Checking Windows counter via nagios, nsclient++,and nrpe Can this also be used to monitor memory use by a process/program as well? *From:* [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] *On Behalf Of *James Whittington* Sent:* Wednesday, February 14, 2007 10:30 AM* To:* [EMAIL PROTECTED] Subject:* [Nagios-users] Checking Windows counter via nagios, nsclient++,and nrpe Hello all, I have a Nagios 2 system used in monitoring several application environments and I’m looking for the best way to monitor a Windows 2003 process. I already have nsclient++ in place, and I’m using the checkCounter command in nrpe. The main issue at this point is the counter in question returns a value in scientific notation, which isn’t very readable for non-scientists. Here’s the command and response: ./check_nrpe -H 192.168.197.183 -p 5666 -c checkCounter -a Counter:PrivBytes=\\Process(ProductATasksService)\\Private Bytes ShowAll OK: PrivBytes: 2.38641e+08 Is there an option to convert the format to human readable? If not I might have to write something to convert it which might be a bad idea anyway, since I don’t see performance data coming from the plugin either and I write the performance data to a sql backend. I figured I would toss the question out to the list to see if others had run into this before. Thanks, James Whittington - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] FW: Reports using NDOutils?
-Original Message- From: nagios-users On Behalf Of Marcel Mitsuto Fucatu Sugano Sent: 14 February 2007 19:27 On Wed, 2007-02-14 at 09:58 -0800, Trask wrote: Are there any projects, addons, or home-made scripts out there that people are using that pulls data from the NDO output and creates reports? I've done a good bit of searching and haven't found anything, but it seems like such a logical thing to have that I figure someone has done this already. I am waiting for same sort of thing as well. I am doing some researching around NDOUtils too, 'cause I'll need Nagios to watch over Service Level Agreement thresholds. I have a PHP script which gets information from the NDOUtils MySQL tables to display machine status on a web page (we have a home-grown script which provides a single page display of our farm of 800+ servers). I also plan to write scripts that will get plain-text output from the MySQL tables for use when administrators do not have access to the Nagios web pages (snip) 2) Is it crazy to think I can keep *all* the NDO data forever? (~500 hosts / 6000+ srvcs) Well, considering that only state changes matters, it isn't that crazy. The only place where I have had to do anything is with the logentries table which (in our case) has written more records than is allowed by MySQL and sometimes generates MySQL errors. Deleting old entries solves the problem (I have a script that deletes entries more than 6 weeks old). (snip) Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Plugin time out
Hi guys Im fairly new to Nagios and in need of some assistence with the following: How would I go about extending the plugin time for this to 5 min. There is a lot of traffic on our clients lines which makes the 10 second interval to short for them to respond. Following is the mail output from Nagios * Nagios * Notification Type: PROBLEM Host: xxx.xxx.co.za State: DOWN Address: x.x.x.x Info: CRITICAL - Plugin timed out after 10 seconds Any help will be much appreciated - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugin time out
Hi Sean, If it's a standard plugin (from nagios-plugins), you can normally use the -t option in your check command definition to set the timeout to 300 seconds (i.e. 5 minutes.) E.g. /usr/local/nagios/libexec/check_ping -H [IP address] -w 1000,20% -c 2000,50% -t 300 Andy. Sean Thomas wrote: Hi guys Im fairly new to Nagios and in need of some assistence with the following: How would I go about extending the plugin time for this to 5 min. There is a lot of traffic on our clients lines which makes the 10 second interval to short for them to respond. Following is the mail output from Nagios * Nagios * Notification Type: PROBLEM Host: xxx.xxx.co.za State: DOWN Address: x.x.x.x Info: CRITICAL - Plugin timed out after 10 seconds Any help will be much appreciated !DSPAM:37,45d4376718511044721252! - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV !DSPAM:37,45d4376718511044721252! ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null !DSPAM:37,45d4376718511044721252! -- Andy Shellam NetServe Support Team the Mail Network an alternative in a standardised world p: +44 (0) 121 288 0832/0839 m: +44 (0) 7818 000834 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios and DHCP
Hi guys Here is an example of my define host entries: define host{ usegeneric-host host_name random.random.org alias Random Server address xxx.xxx.xxx.xxx parents random check_command check-host-alive max_check_attempts 20 notification_interval 60 notification_period24x7 notification_options d,u,r } How do configure the address line to work with a client who is given an address via DHCP? - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and DHCP
On Thu, 2007-02-15 at 14:10 +0200, Sean Thomas wrote: Hi guys Here is an example of my define host entries: define host{ usegeneric-host host_name random.random.org alias Random Server address xxx.xxx.xxx.xxx parents random check_command check-host-alive max_check_attempts 20 notification_interval 60 notification_period24x7 notification_options d,u,r } How do configure the address line to work with a client who is given an address via DHCP? I believe the host would have to have a static IP to get this to work. Otherwise, you would have to change these entries each time the IP address changed. I have a router at home and run NAGIOS, but have it set to where anything at 192.168.1.100 is a dynamic IP and everything below it is static. My two boxes that I monitor have static IP addresses so that Nagios (amongst other things) can work properly. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and DHCP
Sean Thomas wrote: Hi guys Here is an example of my define host entries: define host{ usegeneric-host host_name random.random.org alias Random Server address xxx.xxx.xxx.xxx parents random check_command check-host-alive max_check_attempts 20 notification_interval 60 notification_period24x7 notification_options d,u,r } How do configure the address line to work with a client who is given an address via DHCP? We use DNS together with DHCP for our clients (and actually some of the servers as well). The idea is that the DHCP client reports its hostname for update of our own DNS server. In the service address field we use the host name, not the IP address. In DHCP/DNS lingo this is called Dynamic DNS Update (DDNS). Hope this helps /Lars - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] AIX installation
Hi Akash, I have compiled and installed Linux on AIX 5.1L in the past. Though from memory I believe we had to retrieve all the header files to do it. Thanks, Gavin Akash Jhamb wrote: Hi I want to install nagios on AIX version. I understand that it was designed for Linux but is compatible with UNIX variants as well. I want to know specifically if it will work with AIX version 5.2.0.0 Many thanks Akash - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- UKS Ltd, Birmingham Road, Studley, Warwickshire, B80 7BG Tel: 08700 681 333 - Fax: 01527 851 301 - AS: 20547 [EMAIL PROTECTED] - www.uksolutions.co.uk - Please, consider the environment before printing this email - - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_smtp socket timeout problem
Please always respond on list so that others benefit from your experience and to maximize your chances of finding a solution. -Original Message- From: Lane, Jim [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 14, 2007 9:39 AM To: Marc Powell Subject: RE: [Nagios-users] check_smtp socket timeout problem Marc: thanks for this. I've tried it as the nagios user and I get the same result. Also I can telnet to port 25 and quit as quickly as I can type. It can't be anything like greet_pause because I don't know enough about sendmail to have enabled anything like that. Are you using the latest version of the plugins? There was a bug a long time ago that prevented the plugin from successfully quitting in some circumstances. If you're familiar with the usage of strace or truss, the output around where the plugin sends the QUIT command might be informative. -- Marc - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_smtp socket timeout problem
It now seems the problem must have been with sendmail on the monitored system. I was modifying debug options and restarting the sendmail daemon in hopes of getting useful info. After a couple of restarts all of a sudden the plug-in started to work properly. Beats me! -Jim -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Marc Powell Sent: February 15, 2007 9:28 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] check_smtp socket timeout problem Please always respond on list so that others benefit from your experience and to maximize your chances of finding a solution. -Original Message- From: Lane, Jim [mailto:[EMAIL PROTECTED] Sent: Wednesday, February 14, 2007 9:39 AM To: Marc Powell Subject: RE: [Nagios-users] check_smtp socket timeout problem Marc: thanks for this. I've tried it as the nagios user and I get the same result. Also I can telnet to port 25 and quit as quickly as I can type. It can't be anything like greet_pause because I don't know enough about sendmail to have enabled anything like that. Are you using the latest version of the plugins? There was a bug a long time ago that prevented the plugin from successfully quitting in some circumstances. If you're familiar with the usage of strace or truss, the output around where the plugin sends the QUIT command might be informative. -- Marc - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDE V ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] R: R: packet loss -25% with check_ping
I have not fping installed on my HP-UX server, I use standard ping. Others ideas? Marco -Messaggio originale- Da: Michael Gale [mailto:[EMAIL PROTECTED] Inviato: mercoledì 14 febbraio 2007 19.06 A: Marco Borsani Cc: 'Morris, Patrick'; 'Ben O'Hara'; nagios-users@lists.sourceforge.net Oggetto: Re: [Nagios-users] R: packet loss -25% with check_ping Hey, Do you have fping install ? I remember some requirement about having fping, if you are missing it maybe that is your problem. Michael Marco Borsani wrote: I have this problem: 1) only with that client (I am monitoring about 300 servers) 2) sometime , not always Marco -Messaggio originale- Da: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Per conto di Morris, Patrick Inviato: martedì 13 febbraio 2007 18.52 A: Ben O'Hara Cc: nagios-users@lists.sourceforge.net Oggetto: Re: [Nagios-users] packet loss -25% with check_ping Ah -- good point. This has come up on the list before. I believe some versions of the check_ping plugin had a pug that would send an extra ping. I believe it's been fixed in the latest plugin release. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Ben O'Hara Sent: Tuesday, February 13, 2007 9:09 AM To: Morris, Patrick Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] packet loss -25% with check_ping On 2/13/07, Morris, Patrick [EMAIL PROTECTED] wrote: This is the situation: - Nagios version 1.4 - Nagios plug-in verision 1.3.1 Performing a check_ping on a Linux RedHat Enterprise 4.0, sometime the answer is: PING UNKNOWN - Packet loss = -25%, RTA = 14.00 ms Normally that server (like the others) answer correctly. Have you got any idea? You have some packet loss on your network. Or is that packet generation ;-) notice its -25% loss Ben -- --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057; dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- A Scientist will earn a living by taking a really difficult problem and spends many years solving it, an engineer earns a living by finding really difficult problems and side stepping them - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Michael Gale Red Hat Certified Engineer Network Administrator Pason Systems Corp. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin
[Nagios-users] NSClient - ERROR:Malformed request or internal error.
Hi all I am monitoring about 300 servers and 800 services with nagios 1.4.1 and NsClient 2.0.1 Check_nt of CPULOAD, in ONLY one client (windows 2000) give me following answer: NSClient - ERROR:Malformed request or internal error. Check EventLog:None25 Have you got any idea? Marco Borsani Technical Operation tel.:+390104310115 e-fax: +390683175950 * [EMAIL PROTECTED] ITnet S.r.l. Direzione e Coordinamento di Wind Telecomunicazioni S.p.A. Via Pacinotti, 39 16151 - Genova In ottemperanza al D. lgs 196/03 in materia di protezione dei dati personali, le informazioni contenute in questo messaggio sono strettamente riservate e sono esclusivamente indirizzate al destinatario indicato (oppure alla persona responsabile di rimetterlo al destinatario). Qualsiasi uso, riproduzione o divulgazione di questo messaggio è vietata. Nel caso in cui aveste ricevuto questa mail per errore, Vi invitiamo ad avvertire il mittente al più presto a mezzo posta elettronica e distruggere il messaggio erroneamente ricevuto. According to Italys new data protection code (Legislative Decree no. 196/2003) in force on January 1st 2004, Italian Law 196/03 concerning privacy, the information contained in this e-mail is confidential and is intended for the addressee only. If you are not the correct recipient, please note that any use, dissemination or copy of this document/information is strictly prohibited.If you have received this message in error, you should destroy it and please notify us immediately by e-mail. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Passive checks
Hi All... I seem to have a problem with passive checks. When I look at the Services Detail window in Nagios, all my passive checks are pending, with a status of Service is not scheduled to be checked. I manually submit a trap using the submit_check_result, the event shows in the log, but the status doesn't change. Here is what I submit : /usr/local/nagios/libexec/eventhandlers/submit_check_result (servername) Short term alloc failed 1 $3 requests for short term memory could not be granted on server $1 Below is an example of my service: define service{ host_name (server name) service_description Short term alloc failed is_volatile 1 max_check_attempts 1 normal_check_interval 1 retry_check_interval1 passive_checks_enabled 1 check_command check-host-alive check_periodnone notification_interval 31536000 notification_period 24x7 notification_optionsw,u,c,r contact_groups it-dept } Any help would be greatly appreciated. Thanks, Kevin - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Scheduling queue falling behind.
I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? Thanks, -- - Robert Stafford Systems Engineer [EMAIL PROTECTED] -- - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Quick Question
I have been working with nagios for a few years now, but I have never really understood how Nagios decides to retry host checks. I understand that service checks have a check and retry interval and you can hard code a check interval for host checks(if you want to kill your performance...), but when nagios detects that a host has entered a problem state, how often does it retry the host check between each check attempt. Currently I am running Nagios 2.5 on CentOS 4.4 For my real-world issue, I have some nortel VPN's that seem somewhat slow to re-negotiate their connections. I currently have all hosts using a template that has 5 max check attempts. It seems that the only way to delay a Hard-Down notification for these VPN's would be to increase the max-check attempts for these hosts. The only problem is that I have no way of correlating the retry check interval with max check attempts to determine my acceptable detected downtime for these hosts before issuing notifications. The other option would be to remove the host-check command for these VPN's and just alert on a service for check_icmp that is configured the way I want it. -Doug - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Passive checks.... Got It
When executing the submit_check_result, quotes need to be around the trap name and plugin output. Like so... /usr/local/nagios/libexec/eventhandlers/submit_check_result (servername) Short term alloc failed 1 $3 requests for short term memory could not be granted on server $1 Nagios then recognized the command and changed the status and sent a notification. Kevin Andy Shellam (Mailing Lists) [EMAIL PROTECTED] 2/15/2007 11:40 AM Assuming this is Nagios 2.x, you don't have active_checks_enabled 1 in your service definition (active checks are not enabled.) Andy. Kevin DaSilva wrote: Hi All... I seem to have a problem with passive checks. When I look at the Services Detail window in Nagios, all my passive checks are pending, with a status of Service is not scheduled to be checked. I manually submit a trap using the submit_check_result, the event shows in the log, but the status doesn't change. Here is what I submit : /usr/local/nagios/libexec/eventhandlers/submit_check_result (servername) Short term alloc failed 1 $3 requests for short term memory could not be granted on server $1 Below is an example of my service: define service{ host_name (server name) service_description Short term alloc failed is_volatile 1 max_check_attempts 1 normal_check_interval 1 retry_check_interval1 passive_checks_enabled 1 check_command check-host-alive check_periodnone notification_interval 31536000 notification_period 24x7 notification_optionsw,u,c,r contact_groups it-dept } Any help would be greatly appreciated. Thanks, Kevin - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null !DSPAM:37,45d48a7618513788438255! -- Andy Shellam NetServe Support Team the Mail Network an alternative in a standardised world p: +44 (0) 121 288 0832/0839 m: +44 (0) 7818 000834 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Quick Question
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Doug Woodgate Sent: Thursday, February 15, 2007 11:13 AM To: Nagios Users Subject: [Nagios-users] Quick Question I have been working with nagios for a few years now, but I have never really understood how Nagios decides to retry host checks. I understand that service checks have a check and retry interval and you can hard code a check interval for host checks(if you want to kill your performance...), but when nagios detects that a host has entered a problem state, how often does it retry the host check between each check attempt. Immediately after the previous attempt, up to max_check_attempts. There is no significant delay between checks. Currently I am running Nagios 2.5 on CentOS 4.4 For my real-world issue, I have some nortel VPN's that seem somewhat slow to re-negotiate their connections. I currently have all hosts using a template that has 5 max check attempts. It seems that the only way to delay a Hard-Down notification for these VPN's would be to increase the max-check attempts for these hosts. The only problem is that I have no way of correlating the retry check interval with max check attempts to determine my acceptable detected downtime for these hosts before issuing notifications. Escalations might be useful here. If the host has been down for some amount of time an escalated notification could be sent out. You probably don't want to just increase max_check_attempts as that's only preventing nagios from doing other things for a longer time. The other option would be to remove the host-check command for these VPN's and just alert on a service for check_icmp that is configured the way I want it. That's what we do. -- Marc - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
On 2/15/07, Robert Stafford [EMAIL PROTECTED] wrote: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? Are you reporting these checks passively to another server? - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
On 2/15/07, Trask [EMAIL PROTECTED] wrote: On 2/15/07, Robert Stafford [EMAIL PROTECTED] wrote: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? Are you reporting these checks passively to another server? Nope. Everything is on one box. I do have syslog-ng installed and doing passive to nagios on the machine. I've tried setting max_concurent_checks to 0 and changing it up throught 100 but it doesn't seem to make a differance. -- - Robert Stafford Systems Engineer - Xpressbet [EMAIL PROTECTED] [EMAIL PROTECTED] -- - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
Nope. Everything is on one box. I do have syslog-ng installed and doing passive to nagios on the machine. I've tried setting max_concurent_checks to 0 and changing it up throught 100 but it doesn't seem to make a differance. Did nagios ever run on time or has it always been like this? Verify the time on the nagios machine. What sort of latencies are you seeing when you look at Performance Info? What is the load you are seeing (run top)? Run something like the following command to see how many nagios related processes are running (this assumes the directory where nagios is installed includes the word nagios). If you don't have watch, just run the command a bun manually or put it in a never-ending while loop. Watch it for a few minutes (or get fancy and capture / graph the output) and get a feel for the numbers and any patterns you see. watch -n 1 'ps auxw | grep nagios | wc -l' Reply with your nagios.cfg. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Trask Sent: Thursday, February 15, 2007 12:45 PM To: Robert Stafford Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Scheduling queue falling behind. [suggestions removed] Reply with your nagios.cfg. /path/to/bin/nagios -s /path/to/etc/nagios.cfg will also be informative. -- Marc - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
Hey folks this is Trevor here, While i used to have a lot of nagios deployments for clients i did have these issues more than one. I had about 25-30 servers and all the checks on these servers would be way behind the actual scheduled time. It would wary from a few mins to hours at length. Never got around to solving the prob. Thought i had a bad config but i too had the same settings like Robert here. Was baffled as to why was nagios delaying the checks. The box wasnt topping out and the network wasnt an issue either. I have found such issues when running nagios with a huge number of metrics being monitored. 400-500 in total. Am not sure if this helps..but this is precisely what i too went through. Thanks and Regards Trevor On 2/15/07, Trask [EMAIL PROTECTED] wrote: On 2/15/07, Robert Stafford [EMAIL PROTECTED] wrote: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? Are you reporting these checks passively to another server? - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ ( - / Scaling FreeSoftware OpenSource\ - ) /~\ / In the Enterprise \ /~\ | \) \ | www.fsf.org | www.opensource.org | / (/ | |_|_ \/ _|_| An eye for an eye will soon turn the world blind - MKG You cannot see me, yet I am the light you see by. You cannot hear me, yet I speak through your voice. You cannot feel me, yet I am the power at work in your hands. I am at work, though you do not understand my ways. I am not strange visions. I am not mysteries. Only in absolute stillness, beyond self, can you know me as I am, and then but as a feeling and a faith. Yet I am here. Yet I hear. Yet I answer. -anon - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? You don't have a lot of hosts down, do you? That can also cause backups in service checks. Otherwise, you're right -- that number of checks shouldn't be an issue. I'm looking at a box right now that's monitoring over 1,000 hosts with about 11,000 service checks, and the next scheduled check is always shown at the current time. It really does sound like the time on your Nagios box may just be out of sync. - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Fwd: Scheduling queue falling behind.
also /usr/local/nagios/bin/nagios -s /usr/local/nagios/etc/nagios.cfg Nagios 2.5 Copyright (c) 1999-2006 Ethan Galstad (http://www.nagios.org) Last Modified: 07-13-2006 License: GPL Warning: Contact group 'Adam' is not used in any host/service definitions or host/service escalations! Warning: Contact group 'Elaine' is not used in any host/service definitions or host/service escalations! Warning: Contact group 'SA_Page' is not used in any host/service definitions or host/service escalations! Warning: Contact group 'admins' is not used in any host/service definitions or host/service escalations! Projected scheduling information for host and service checks is listed below. This information assumes that you are going to start running Nagios with your current config files. HOST SCHEDULING INFORMATION --- Total hosts: 49 Total scheduled hosts: 0 Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 5 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 142 Total scheduled services: 107 Service inter-check delay method: SMART Average service check interval: 339.25 sec Inter-check delay: 2.80 sec Interleave factor method: SMART Average services per host: 2.90 Service interleave factor: 3 Max service check spread: 5 min First scheduled check: Thu Feb 15 11:22:52 2007 Last scheduled check: Thu Feb 15 11:27:50 2007 CHECK PROCESSING INFORMATION Service check reaper interval: 10 sec Max concurrent service checks: 100 PERFORMANCE SUGGESTIONS --- I have no suggestions - things look okay. -- Forwarded message -- From: Robert Stafford [EMAIL PROTECTED] Date: Feb 15, 2007 11:18 AM Subject: Re: [Nagios-users] Scheduling queue falling behind. To: Trask [EMAIL PROTECTED] Performance wise the system isn't hit very hard (2 proc xenon system). Load averages over the past few days floats between 0.25 and 0.65. Time matches up with all my servers and the machine time in nagios itself. checking on how many nagios proccesses are running it seems to stay around 5-8 processes related to nagios... I see mainly 1 nagios server process 3 or so send sysylog services. If i spam refresh top i'll see 2-5 ncsa a proccesses pop out every once in a while, but they last less than a half a second. Wow the latencies are bad: MetricMin.Max.Average Check Execution Time: 0.00 sec0.56 sec0.133 sec Check Latency:0.00 sec761.00 sec716.944 sec Percent State Change:0.00%0.00%0.00% nagios.cfg: (took out all commented out fields --- log_file=/usr/local/nagios/var/nagios.log cfg_file=/usr/local/nagios/etc/checkcommands.cfg cfg_file=/usr/local/nagios/etc/contact_templates.cfg cfg_file=/usr/local/nagios/etc/contactgroups.cfg cfg_file=/usr/local/nagios/etc/contacts.cfg cfg_file=/usr/local/nagios/etc/escalations.cfg cfg_file=/usr/local/nagios/etc/extended_host_info.cfg cfg_file=/usr/local/nagios/etc/extended_host_info_templates.cfg cfg_file=/usr/local/nagios/etc/host_templates.cfg cfg_file=/usr/local/nagios/etc/hostgroups.cfg cfg_file=/usr/local/nagios/etc/hosts_hosts.cfg cfg_file=/usr/local/nagios/etc/misccommands.cfg cfg_file=/usr/local/nagios/etc/service_templates.cfg cfg_file=/usr/local/nagios/etc/servicegroups.cfg cfg_file=/usr/local/nagios/etc/services.cfg cfg_file=/usr/local/nagios/etc/time_periods.cfg resource_file=/usr/local/nagios/etc/private/resource.cfg temp_file=/usr/local/nagios/var/nagios.tmp status_file=/usr/local/nagios/var/status.log aggregate_status_updates=1 status_update_interval=15 nagios_user=nagios nagios_group=nagios enable_notifications=1 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=0 accept_passive_host_checks=0 enable_event_handlers=1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/spool/nagios.cmd downtime_file=/usr/local/nagios/var/nagiosdowntime.log comment_file=/usr/local/nagios/var/nagioscomment.log lock_file=/tmp/nagios.lock retain_state_information=1 state_retention_file=/usr/local/nagios/var/nagiosstatus.sav retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=0 use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 sleep_time=1 service_inter_check_delay_method=s max_service_check_spread=5 service_interleave_factor=s max_concurrent_checks=100 service_reaper_frequency=10 max_host_check_spread=5
Re: [Nagios-users] Scheduling queue falling behind.
Morris, Patrick wrote: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. I ran into something like this, not that long ago. Host and service check latencies being very high, like almost 15-20 minutes. After some research, the suggested fix was to modify the check_host_alive and related commands. Change them so they only use a single packet to check. Add -p 1 as a flag to your ping-type check commands. Once I followed this advice, my average latency went from 900 seconds or so, to usually less than a second. We monitor about 250-300 devices, 95% of them only have a single service check. (Usually SNMP) -- Jason Byrns Production Manager System Administrator http://www.MicroLnk.com/ 402-328-8600 ext. 653 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
I had this problem and Jason describes one work-around to it (making host checks complete as fast as possible). My solution was to disable all host checks (my latency was still too high even with fast host checks). My understanding is that Nagios must complete the host checks of a down host before ANY service checks on any host are performed. If a host is down and your checks are configured to run over X number of minutes Y times then you are stalled for that long. If you have multiple hosts down then you can just multiply your latency. Prove this to yourself by going in to each configured host that you have that is down and disabling the host checks. Your service checks should come right up to time. Randal -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Behalf Of Jason Byrns Sent: Thursday, February 15, 2007 11:36 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Scheduling queue falling behind. Morris, Patrick wrote: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. I ran into something like this, not that long ago. Host and service check latencies being very high, like almost 15-20 minutes. After some research, the suggested fix was to modify the check_host_alive and related commands. Change them so they only use a single packet to check. Add -p 1 as a flag to your ping-type check commands. Once I followed this advice, my average latency went from 900 seconds or so, to usually less than a second. We monitor about 250-300 devices, 95% of them only have a single service check. (Usually SNMP) -- Jason Byrns Production Manager System Administrator http://www.MicroLnk.com/ 402-328-8600 ext. 653 - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
Hello You did read this page maybe many times but it's allways smart to read again and again and find ways to do things smarter. http://nagios.sourceforge.net/docs/2_0/tuning.html Sjaak Robert Stafford schreef: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? Thanks, -- - Robert Stafford Systems Engineer [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] -- - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] spending time with nagios to finetune
Hello nagios users. We run about 50 hosts with approx 400 services to check (webservice apache/mail/mysql/mysql_replica/ftp/load/procs/disks/mailq/temp/backups). Spending a lot of time to fine tune nagios, every day nearly 15-60 min , still we have 5 ~ 20 notifications a day. I have some questions, how you run nagios. How much time do you spend a day or week to run nagios ? How much notifications a day do you have ? Must be every notify a real allert or do you also have 80% notifications just for an annoying mobile telephone ring. Would be interesting to hear your idea how you run nagios. Nagios helped me a lot with uptime on my servers. No diskfull errors, no unknown downtime anymore and more happy customers. I'ts Better to spend time on nagios than fixing problems. Thanks in advance. Sjaak - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Scheduling queue falling behind.
Now I feel like a Noobie!!Parallelizing checks was disabled. On 2/15/07, Sjaak Nabuurs [EMAIL PROTECTED] wrote: Hello You did read this page maybe many times but it's allways smart to read again and again and find ways to do things smarter. http://nagios.sourceforge.net/docs/2_0/tuning.html Sjaak Robert Stafford schreef: I'm not sure this is the right place to ask this, but I've been trying to figure out why the scheduling queue on my nagios server is always 20-30 minutes behind the current time. What I mean by that is the next check item is 20-30 minutes past the current time. I have 49 servers and 3 checks per server (ping, a process check, and syslog-ng(passive). I've tried tweaking all the settings for scheduling checks and only seem to make it worse. I'm currently running version 2.5. I didn't think that it would run into problems with this few since I've seen 300+ servers with 1000+ service checks. Am I just missing something? Thanks, -- - Robert Stafford Systems Engineer [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] -- -- - Robert Stafford Systems Engineer - Xpressbet [EMAIL PROTECTED] [EMAIL PROTECTED] -- - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Request new functionality: Off Hours state.
What we need / would like is a per-service and per-host configuration option that allows a host or service to enter an Off hours state in the CGI displays. (Or perhaps there should also be a global option for this?) The way we do this is via a small modification to the status.cgi script, so that there is an extra filter option which only displays things currently within their alerting window (its an extra flag in the bitfield options to serviceprops and hostprops). So, with this flag set, the screen will only show items which are currently able to send out alerts. Works well for us - I can send a copy of the modified status.c if you want, but we're using nagios 1.4... Steve -- Steve Shipway ITSS, University of Auckland (09) 3737 599 x 86487 [EMAIL PROTECTED] - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null