Re: [Nagios-users] Nagios-3.x Solaris 9 Build Problem
On Tue, Oct 09, 2007 at 09:16:00PM -0400, Richard Aseltine wrote: > # ls -l /opt/nagios/bin/nagios > -rwxrwxr-- 1 nagios nagios 1268772 Oct 9 18:41 /opt/nagios/bin/nagios > > > truss: cannot execute program: /opt/nagios/bin/nagios Unless you're 'nagios' or in the 'nagios' group, you don't have execute permission. 'others' only have read permission, so you can't run it (and truss will give that messagee). Can you run/truss it as nagios? -- Darren Dunham [EMAIL PROTECTED] Senior Technical Consultant TAOShttp://www.taos.com/ Got some Dr Pepper? San Francisco, CA bay area < This line left intentionally blank to confuse you. > - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios-3.x Solaris 9 Build Problem
hmm, what about file /opt/nagios/bin/nagios On Tue, Oct 09, 2007 at 09:16:00PM -0400, Richard Aseltine wrote: > > That's what I was suppose to show instead of the startup > > # ls -l /opt/nagios/bin/nagios > -rwxrwxr-- 1 nagios nagios 1268772 Oct 9 18:41 /opt/nagios/bin/nagios > > > Date: Tue, 9 Oct 2007 20:22:47 -0400 > > From: [EMAIL PROTECTED] > > To: nagios-users@lists.sourceforge.net > > Subject: Re: [Nagios-users] Nagios-3.x Solaris 9 Build Problem > > > > what is the output of > > > > ls -l /opt/nagios/bin/nagios > > > > On Tue, Oct 09, 2007 at 03:46:50PM -0400, Richard Aseltine wrote: > > > > > > Has anyone successfully build Nagios-3.x on Solaris 2.9? I have build > > > previous releases without any problems. I hope someone can point me in > > > the right direction. Here is what happens when I try to start the damon. > > > > > > # /etc/init.d/nagios start > > > -n Starting nagios: > > > Killed > > > CONFIG ERROR! Start aborted. Check your Nagios configuration. > > > > > > # truss /opt/nagios/bin/nagios -v /opt/nagios/etc/nagios.cfg > > > truss: cannot execute program: /opt/nagios/bin/nagios > > > > > > -rwxr-xr-x 1 root root5163 Oct 9 18:42 /etc/init.d/nagios > > > > > > Thanks > > > > > > _ > > > Peek-a-boo FREE Tricks & Treats for You! > > > http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us > > > - > > > This SF.net email is sponsored by: Splunk Inc. > > > Still grepping through log files to find problems? Stop. > > > Now Search log events and configuration files using AJAX and a browser. > > > Download your FREE copy of Splunk now >> http://get.splunk.com/ > > > ___ > > > Nagios-users mailing list > > > Nagios-users@lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > ::: Please include Nagios version, plugin version (-v) and OS when > > > reporting any issue. > > > ::: Messages without supporting info will risk being sent to /dev/null > > > > - > > This SF.net email is sponsored by: Splunk Inc. > > Still grepping through log files to find problems? Stop. > > Now Search log events and configuration files using AJAX and a browser. > > Download your FREE copy of Splunk now >> http://get.splunk.com/ > > ___ > > Nagios-users mailing list > > Nagios-users@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > _ > Climb to the top of the charts! Play Star Shuffle: the word scramble > challenge with star power. > http://club.live.com/star_shuffle.aspx?icid=starshuffle_wlmailtextlink_oct - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios-3.x Solaris 9 Build Problem
That's what I was suppose to show instead of the startup # ls -l /opt/nagios/bin/nagios -rwxrwxr-- 1 nagios nagios 1268772 Oct 9 18:41 /opt/nagios/bin/nagios > Date: Tue, 9 Oct 2007 20:22:47 -0400 > From: [EMAIL PROTECTED] > To: nagios-users@lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios-3.x Solaris 9 Build Problem > > what is the output of > > ls -l /opt/nagios/bin/nagios > > On Tue, Oct 09, 2007 at 03:46:50PM -0400, Richard Aseltine wrote: > > > > Has anyone successfully build Nagios-3.x on Solaris 2.9? I have build > > previous releases without any problems. I hope someone can point me in > > the right direction. Here is what happens when I try to start the damon. > > > > # /etc/init.d/nagios start > > -n Starting nagios: > > Killed > > CONFIG ERROR! Start aborted. Check your Nagios configuration. > > > > # truss /opt/nagios/bin/nagios -v /opt/nagios/etc/nagios.cfg > > truss: cannot execute program: /opt/nagios/bin/nagios > > > > -rwxr-xr-x 1 root root5163 Oct 9 18:42 /etc/init.d/nagios > > > > Thanks > > > > _ > > Peek-a-boo FREE Tricks & Treats for You! > > http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us > > - > > This SF.net email is sponsored by: Splunk Inc. > > Still grepping through log files to find problems? Stop. > > Now Search log events and configuration files using AJAX and a browser. > > Download your FREE copy of Splunk now >> http://get.splunk.com/ > > ___ > > Nagios-users mailing list > > Nagios-users@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > - > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null _ Climb to the top of the charts! Play Star Shuffle: the word scramble challenge with star power. http://club.live.com/star_shuffle.aspx?icid=starshuffle_wlmailtextlink_oct- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios-3.x Solaris 9 Build Problem
what is the output of ls -l /opt/nagios/bin/nagios On Tue, Oct 09, 2007 at 03:46:50PM -0400, Richard Aseltine wrote: > > Has anyone successfully build Nagios-3.x on Solaris 2.9? I have build > previous releases without any problems. I hope someone can point me in > the right direction. Here is what happens when I try to start the damon. > > # /etc/init.d/nagios start > -n Starting nagios: > Killed > CONFIG ERROR! Start aborted. Check your Nagios configuration. > > # truss /opt/nagios/bin/nagios -v /opt/nagios/etc/nagios.cfg > truss: cannot execute program: /opt/nagios/bin/nagios > > -rwxr-xr-x 1 root root5163 Oct 9 18:42 /etc/init.d/nagios > > Thanks > > _ > Peek-a-boo FREE Tricks & Treats for You! > http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us > - > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 3.0b5 - ePN and perl caching
The problem I am having is with subroutines inside of perl scripts ran under the ePN. If a subroutine is defined the next time the script or similar script is executed the ePN compile fails with a duplicate subroutine reported as the problem. I believe this problem does not exist under the 2.x code. Larry Low 4150 N Drinkwater Blvd., 5th Floor Scottsdale, AZ 85251 E-mail: [EMAIL PROTECTED]' Telesphere Networks, Inc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_traffic
Hi all!! Im triying to use the check_traffic plugin but the results that im getting are not realistic For example im checking a Giga interfaces that has a lot of traffic going trough and the result is the following: Total RX Bytes: 0.00 MB, Total TX Bytes: 0.00 MB Average Traffic: 0.00 B/s (0.0%) in, 0.00 B/s (0.0%) out Is anyboby using this plugin? Any help? -- Danilo Perdomo Ingeniero de Operaciones NOC VP Operaciones Técnicas Inter Barquisimeto - MSO Ext.: 5393 Tlf.: 58-251-3355393 58-251-9355393 Cel.: 58-414-5750938 www.inter.com.ve - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] escalation.cfg
> -Original Message- > From: [EMAIL PROTECTED] [mailto:nagios-users- > [EMAIL PROTECTED] On Behalf Of Siwei Zhang > Sent: Tuesday, October 09, 2007 5:39 PM > To: nagios-users > Subject: [Nagios-users] escalation.cfg > > Hi there, > > Is it possible to specify the hosts and services using '*' to represent > all hosts and services? > It takes too much time to configure each individual host and service in > escalation.cfg. Any help will be greatly appreciated. http://nagios.sourceforge.net/docs/2_0/templatetricks.html -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] escalation.cfg
Hi there, Is it possible to specify the hosts and services using '*' to represent all hosts and services? It takes too much time to configure each individual host and service in escalation.cfg. Any help will be greatly appreciated. Kevin SZ - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] HP SNMP
[EMAIL PROTECTED] wrote: > I am trying to use the check_hpjd plugin from the standard plugins > to monitor a couple of hp laserjet printers > > I am having some difficulty with the jetdirect cards that a few of them > are hooked too. I seem to be walking its MIB tree, and not that of the > printer. Is there a way to kind of "bypass" the jetdirect card of these > printers, so that I can accurately poll the correct snmp data? >From a network point of view the jetdirect card is THE printer. Without the card there is no network attached to the printer. If you can not obtain from the printer what you want to know this way I suggest you drop a feature request at the HP doorstep. Hugo. -- [EMAIL PROTECTED] http://hugo.vanderkooij.org/ Don't meddle in the affairs of sysadmins, for they are subtle and quick to anger. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] HP SNMP
Hello all, I am trying to use the check_hpjd plugin from the standard plugins to monitor a couple of hp laserjet printers I am having some difficulty with the jetdirect cards that a few of them are hooked too. I seem to be walking its MIB tree, and not that of the printer. Is there a way to kind of "bypass" the jetdirect card of these printers, so that I can accurately poll the correct snmp data? Stephen Valdinger For some reason the email I have subscribed to this list on is no longer allowing me to send messages to it, however I receive messages from it. I will post new messages to the list from now on using this address. begin:vcard n:VALDINGER;STEPHEN fn:STEPHEN VALDINGER tel;cell:740-491-0958 version:2.1 email;internet:[EMAIL PROTECTED] end:vcard - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios-3.x Solaris 9 Build Problem
Has anyone successfully build Nagios-3.x on Solaris 2.9? I have build previous releases without any problems. I hope someone can point me in the right direction. Here is what happens when I try to start the damon. # /etc/init.d/nagios start -n Starting nagios: Killed CONFIG ERROR! Start aborted. Check your Nagios configuration. # truss /opt/nagios/bin/nagios -v /opt/nagios/etc/nagios.cfg truss: cannot execute program: /opt/nagios/bin/nagios -rwxr-xr-x 1 root root5163 Oct 9 18:42 /etc/init.d/nagios Thanks _ Peek-a-boo FREE Tricks & Treats for You! http://www.reallivemoms.com?ocid=TXT_TAGHM&loc=us- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] (no subject)
Has anyone successfully build Nagios-3.x on Solaris 2.9? I have build previous releases without any problems. I hope someone can point me in the right direction. Here is what happens when I try to start the damon. # /etc/init.d/nagios start -n Starting nagios: Killed CONFIG ERROR! Start aborted. Check your Nagios configuration. # truss /opt/nagios/bin/nagios -v /opt/nagios/etc/nagios.cfg truss: cannot execute program: /opt/nagios/bin/nagios -rwxr-xr-x 1 root root5163 Oct 9 18:42 /etc/init.d/nagios Thanks _ Boo! Scare away worms, viruses and so much more! Try Windows Live OneCare! http://onecare.live.com/standard/en-us/purchase/trial.aspx?s_cid=wl_hotmailnews- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] notify contact only once
Terry wrote: > On 10/8/07, Terry <[EMAIL PROTECTED]> wrote: >> On 10/5/07, Aidan Anderson <[EMAIL PROTECTED]> wrote: >>> Terry wrote: Thanks for the reply. Let me be more specific: version: 2.9 OS: centos 5 I have regular contacts set up, me for example. I want to get notified every 30 minutes indefinitely if a service is in a hard state of warning or critical. However, I want another contact to only get notified one time when that hard state is achieved.That's it. >From what I can tell, I can only achieve this through the notification_interval which is only set at the host/service level, not the contact level. If this is true, I will need to create 2 services, each with a different notification_interval and of course apply the different contact groups to each service. Am I correct or is there another way around this? Thanks! On 10/5/07, Aidan Anderson <[EMAIL PROTECTED]> wrote: > Terry wrote: > >> I have a contact that I only want to receive one notification. How >> can I set this up? >> >> >> >>> Hi Terry, >>> >>> I've just posted you another message before seeing this one. You want >>> to use host or service escalations to achieve this. I've briefly >>> explained in the previous post but if you need more help, just shout. >>> >>> Aidan >>> >> Thanks for your reply. It looks like for me to achieve this, I will >> need to create 2 escalations for every service. One for the single >> notification and the other for standard notifications. That seems >> like a lot of configuration. >> > > Anyone have any additional thoughts on this? I am considering writing > a custom email handler that looks for the first notification and > handles it appropriately. That way I don't have to have an > escalation for every single service. > Seems like a sane solution. Besides, it's really not hard to do just that. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New Nagios Plugins website
-Original Message- From: Andreas Ericsson [mailto:[EMAIL PROTECTED] Sent: 09 October 2007 19:54 To: Ton Voon Cc: Welsh,T,Tom,DLS C; [EMAIL PROTECTED]; nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] New Nagios Plugins website Ton Voon wrote: > Thanks Tom, > > We were thinking that the --help pages are different from the man > pages. For instance, dpkg --help or rpm --help gives a list of > options, whereas man dpkg or man rpm gives more commentary and > examples. However, that requires more work. > > If --help on the site is sufficient, we could setup a job that takes > the latest code and populate the web site with it. > I should think --help output as a header for a wiki page on each plugin could be handy. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Hi Guys, That's why I originally created the doc. I just cut and pasted it into our confluence wiki. One page per plugin. Regards Tom - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New Nagios Plugins website
Ton Voon wrote: > Thanks Tom, > > We were thinking that the --help pages are different from the man > pages. For instance, dpkg --help or rpm --help gives a list of > options, whereas man dpkg or man rpm gives more commentary and > examples. However, that requires more work. > > If --help on the site is sufficient, we could setup a job that takes > the latest code and populate the web site with it. > I should think --help output as a header for a wiki page on each plugin could be handy. -- Andreas Ericsson [EMAIL PROTECTED] OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_qmailq.pl or check_mailq
On Tue, 9 Oct 2007, zack kenton wrote: > unfortunately that didnt work...i dont think it needs sudo > privileges...it still returns OK locally, but nrpe gives a critical... See Marc's reply. > on a side note, did you translate the script out of german? Have a look: --- /usr/lib/nagios/plugins/contrib/check_qmailq.pl 2006-12-12 09:03:29.0 -0800 +++ check_qmailq.pl 2007-10-09 10:29:18.0 -0700 @@ -27,11 +27,11 @@ ### CONFIGURATION SECTION my $statcommand = "/var/qmail/bin/qmail-qstat"; -my $queuewarn = 5; # Warning, if more than x mail in Queue -my $queuecrit = 10; # Critical if "--" -my $prewarn = 1;# Warning, if more than x unhandled mails +my $queuewarn = 20; # Warning, if more than x mail in Queue +my $queuecrit = 30; # Critical if "--" +my $prewarn = 5;# Warning, if more than x unhandled mails # (not in Queue -my $precrit = 5;# Critical, if "--" +my $precrit = 10;# Critical, if "--" ### CONFIURATION SECTION END @@ -56,12 +56,12 @@ #sub usage { # printf "\nMissing arguments!\n"; # printf "\n"; -# printf "Printer Server Queue Nagios Plugin\n"; -# printf "monitors jobs in lpr queues\n"; +# printf "qmail Queue Nagios Plugin\n"; +# printf "monitors qmail queue\n"; # printf "usage: \n"; -# printf "check_lpq.pl \n"; +# printf "check_qmailq.pl \n"; # printf "Copyright (C) 2000 Benjamin Schmid\n"; -# printf "check_lpq.pl comes with ABSOLUTELY NO WARRANTY\n"; +# printf "check_qmailq.pl comes with ABSOLUTELY NO WARRANTY\n"; # printf "This programm is licensed under the terms of the "; # printf "GNU General Public License\n(check source code for details)\n"; # printf "\n\n"; @@ -70,7 +70,7 @@ # Just in case of problems, let's not hang Nagios $SIG{'ALRM'} = sub { - print ("ERROR: check_lpq.pl Time-Out $TIMEOUT s \n"); + print ("ERROR: check_qmailq.pl Time-Out $TIMEOUT s \n"); exit $ERRORS{"UNKNOWN"}; }; alarm($TIMEOUT); @@ -100,18 +100,21 @@ $state='CRITICAL' if ($anzq >= $queuecrit); } else { $state='CRITICAL'; - $answer="Keine gueltigte Antwort (Zeile #1) von $statcommand\n"; +# $answer="Keine gueltigte Antwort (Zeile #1) von $statcommand\n"; + $answer="No valid answer (Line #1) from $statcommand\n"; } # Unverarbeite Mails if ($lines[1]=~/^messages in queue but not yet preprocessed: (\d+)/) { my $anzp = $1; - $answer = $answer . " E-Mail(s) nicht ausgeliefert, $anzp unverarbeitet."; +# $answer = $answer . " E-Mail(s) nicht ausgeliefert, $anzp unverarbeitet."; + $answer = $answer . " E-Mail(s) not delivered, $anzp unprocessed."; $state='WARNING' if ($anzp >= $prewarn && $state eq 'UNKNOWN'); $state='CRITICAL' if ($anzp >= $precrit); } else { $state='CRITICAL'; - $answer=$answer . "Keine gueltigte Antwort (Zeile #2) von $statcommand\n"; +# $answer=$answer . "Keine gueltigte Antwort (Zeile #2) von $statcommand\n"; + $answer=$answer . "No valid answer (Line #2) from $statcommand\n"; } $state = 'OK' if ($state eq 'UNKNOWN'); - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_qmailq.pl or check_mailq
> -Original Message- > From: [EMAIL PROTECTED] [mailto:nagios-users- > [EMAIL PROTECTED] On Behalf Of zack kenton > Sent: Tuesday, October 09, 2007 12:50 PM > To: nagios-users@lists.sourceforge.net > Subject: Re: [Nagios-users] check_qmailq.pl or check_mailq > > Hi Jeremy > > unfortunately that didnt work...i dont think it needs sudo privileges...it > still returns OK locally, but nrpe gives a critical...on a side note, did I'll bet it does. For security reasons, mail queues are not usually readable by normal users, only root and/or the mail user. Are you performing your test as the user that NRPE runs as? I suspect that you're testing as root. -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] separate webfront
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Donnell Lewis wrote: > Or you could use the distributed monitoring method: > > have a master nagios server (running the web front-end) running at one > locationa and then have another (or multiple) nagios daemons running on > the other boxes that actually check the local machines at that colo and > send passive check results via nsca to the master hosting the frontend. > > -Don You shouldn't even have to be that elaborate, just use an external web server as a proxy (using mod_rewrite and mod_proxy) and restrict things at the external portal (i.e., no commands, namespace exclusions, etc.). Cheers, - -=Tom Nail -BEGIN PGP SIGNATURE- Version: GnuPG v1.2.1 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFHC8FF1zsLRjt/NQ0RAk/zAJwPzYz6k+VyejM8qvD3+56ToI6w1ACfVnS8 1CS0HfTNcCWwku9Vro28Q/U= =7ArF -END PGP SIGNATURE- - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Comments between versions 2 and 3 of Nagios
Hi Martine, This is a known issue between Nagios 2 and 3 - it's also the case for downtime. See the following package for how-to migrate the comments from one system to another: http://nagios.sourceforge.net/docs/3_0/upgrading.html#nagios2x I don't know if anyone's written a script for it, but it's a 10-second job to do manually as I did it on my test Nagios 3 system. Andy Martine Carannante wrote: > If we update a nagios 2 version into a nagios3.0b4, the comments set on > hosts and services are lost. > In nagios 2, the comments are saved in a file comments.dat . > In nagios 3, the comments are saved in status.dat. > Is there a script to keep comments between versions 2 and 3 of Nagios > > Thanks for your help > Martine > > - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_qmailq.pl or check_mailq
Hi Jeremy unfortunately that didnt work...i dont think it needs sudo privileges...it still returns OK locally, but nrpe gives a critical...on a side note, did you translate the script out of german? - zack kenton (zkenton) --- This thread is located in the archive at this URL: http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi1[showUid]=5215 - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_qmailq.pl or check_mailq
On Tue, 9 Oct 2007, zack kenton wrote: > So I apologize for the duplicate posts, not sure why it was happening. > However, this will be my last posting if in fact duplicate posts occur. > Now to my problem; the standard check_mailq and the check_qmailq (found > in the contrib directory of plugins) both run on the local hosts and > return mail queue ok, no queued emails, etc...but when running from the > nagios machine with nrpe, they both return errors, along the lines of: > Error code 111 returned from /usr/bin/mailq or CRITICAL: Keine gueltigte > Antwort (Zeile #1) von /var/qmail/bin/qmail-qstat (its a german plugin > apparently) but translated means bad answer from line 1 in > qmail-qstat...has anyone run into these same problems...and forgive the > duplicates ahead of time...I wont post with this email after this I use check_qmailq.pl. I run it with sudo like: command[check_qmailq]=sudo /usr/lib/nagios/plugins/contrib/check_qmailq.pl and my sudoers is setup so my nrpe user can run that specific script. Jeremy C. Reed - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] check_qmailq.pl or check_mailq
Hi list So I apologize for the duplicate posts, not sure why it was happening. However, this will be my last posting if in fact duplicate posts occur. Now to my problem; the standard check_mailq and the check_qmailq (found in the contrib directory of plugins) both run on the local hosts and return mail queue ok, no queued emails, etc...but when running from the nagios machine with nrpe, they both return errors, along the lines of: Error code 111 returned from /usr/bin/mailq or CRITICAL: Keine gueltigte Antwort (Zeile #1) von /var/qmail/bin/qmail-qstat (its a german plugin apparently) but translated means bad answer from line 1 in qmail-qstat...has anyone run into these same problems...and forgive the duplicates ahead of time...I wont post with this email after this - zack kenton (zkenton) --- The mailing list archive is found here: http://www.nagiosexchange.org/nagios-users.34.0.html - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Timeouts for send_nsca program
Make sure NSCA is still accepting the results from send_nsca. to do this send over a result from the command line. reviewing your submit_check_result.sh should give you the connection details of the destination nsca, and the send_nsca --help should assist with the commanad line, This is assuming that it used to work, and would only detect that it is currently broken. but not why it broke nor how to fix it. if it is intermittent, you can trial and error to find a good Timeout value using the command line. other people would be better at diagnosing the cause of the issue. Tony (author of NC_Net) On 10/9/07, Wheeler, JF (Jonathan) <[EMAIL PROTECTED]> wrote: > In /var/log/nagios/nagios.log on (at least) one of my slave servers, I > am seeing messages like: > > [1191905959] Warning: OCSP command > '/usr/lib/nagios/plugins/tier1/submit_check_result.sh "HOST" > "SERVICE_CHECK" OK "MESSAGE" for service "SERVICE NAME" on host "HOST" > timed out after 5 seconds > > There have been 712 occurrences today (so far). Can anyone offer an > explanation ? As far as I can tell there is no configuration to > increase the timeout limit (can it be increased by installing from > source ?), but perhaps the message indicates another problem (network ?) > > Jonathan Wheeler > e-Science Centre > Rutherford Appleton Laboratory > > - > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and a browser. > Download your FREE copy of Splunk now >> http://get.splunk.com/ > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Timeouts for send_nsca program
> -Original Message- > From: [EMAIL PROTECTED] [mailto:nagios-users- > [EMAIL PROTECTED] On Behalf Of Wheeler, JF (Jonathan) > Sent: Tuesday, October 09, 2007 10:24 AM > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] Timeouts for send_nsca program > > In /var/log/nagios/nagios.log on (at least) one of my slave servers, I > am seeing messages like: > > [1191905959] Warning: OCSP command > '/usr/lib/nagios/plugins/tier1/submit_check_result.sh "HOST" > "SERVICE_CHECK" OK "MESSAGE" for service "SERVICE NAME" on host "HOST" > timed out after 5 seconds > > There have been 712 occurrences today (so far). Can anyone offer an > explanation ? Something is causing a delay when your submit_check_result.sh script presumably attempts to use send_nsca to send a small bit of data to another host. Can you replicate it from the command line? You should be able to. > As far as I can tell there is no configuration to > increase the timeout limit (can it be increased by installing from ocsp_timeout in nagios.cfg. > source ?), but perhaps the message indicates another problem (network ?) Probably. If it's close to the documented submit_check_result, it's sending only a small bit of data and should only take a fraction of a second. I can use it to send data to two hosts in 0.011s. Maybe network problems or load on either end are causing a delay in the data transmission. -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] notify contact only once
On 10/8/07, Terry <[EMAIL PROTECTED]> wrote: > On 10/5/07, Aidan Anderson <[EMAIL PROTECTED]> wrote: > > Terry wrote: > > > Thanks for the reply. Let me be more specific: > > > > > > version: 2.9 > > > OS: centos 5 > > > > > > I have regular contacts set up, me for example. I want to get > > > notified every 30 minutes indefinitely if a service is in a hard state > > > of warning or critical. However, I want another contact to only get > > > notified one time when that hard state is achieved.That's it. > > > >From what I can tell, I can only achieve this through the > > > notification_interval which is only set at the host/service level, not > > > the contact level. If this is true, I will need to create 2 services, > > > each with a different notification_interval and of course apply the > > > different contact groups to each service. Am I correct or is there > > > another way around this? > > > > > > Thanks! > > > > > > On 10/5/07, Aidan Anderson <[EMAIL PROTECTED]> wrote: > > > > > >> Terry wrote: > > >> > > >>> I have a contact that I only want to receive one notification. How > > >>> can I set this up? > > >>> > > >>> > > >>> > > Hi Terry, > > > > I've just posted you another message before seeing this one. You want > > to use host or service escalations to achieve this. I've briefly > > explained in the previous post but if you need more help, just shout. > > > > Aidan > > > > Thanks for your reply. It looks like for me to achieve this, I will > need to create 2 escalations for every service. One for the single > notification and the other for standard notifications. That seems > like a lot of configuration. > Anyone have any additional thoughts on this? I am considering writing a custom email handler that looks for the first notification and handles it appropriately. That way I don't have to have an escalation for every single service. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Timeouts for send_nsca program
In /var/log/nagios/nagios.log on (at least) one of my slave servers, I am seeing messages like: [1191905959] Warning: OCSP command '/usr/lib/nagios/plugins/tier1/submit_check_result.sh "HOST" "SERVICE_CHECK" OK "MESSAGE" for service "SERVICE NAME" on host "HOST" timed out after 5 seconds There have been 712 occurrences today (so far). Can anyone offer an explanation ? As far as I can tell there is no configuration to increase the timeout limit (can it be increased by installing from source ?), but perhaps the message indicates another problem (network ?) Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios slow restarting/reloading possible NDOUtilsissue?
> -Original Message- > From: [EMAIL PROTECTED] [mailto:nagios-users- > [EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] > Sent: Tuesday, October 09, 2007 9:12 AM > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] Nagios slow restarting/reloading possible > NDOUtilsissue? > When I reload nagios (/etc/init.d/nagios reload) it takes upwards of a > full minute to bring everything back up for viewing. In the meantime a > message is displayed concerning the status of the nagios daemon: > > Error: Could not read host and service status information! > > This goes away in between 90 second and two minutes and everything begins > to function normally and the system goes on that way until the next time I > have to reload the configuration. Nagios would appear to be doing other things like reading config files, parsing associations, etc, before writing the cache file read by the cgi's. How large is the file pointed to by object_cache_file when everything is working? Does it exist during the non-working window? Do you have retention enabled? It may help if you do not. > 2. Always verify configuration options using the -v command-line option > before starting or restarting Nagios! > > Done and nagios is actually running What is that output? > 3. Make sure you've compiled the main program and the CGIs to use the same > status data storage options (i.e. text file or database). If the main > program is storing status data in a text file and the CGIs are trying to > read status data from a database, you'll have problems. > > This is where I suspect the issue might be hiding. I am using NDOUtils and Actually, this is a holdover from the days of nagios-1 when database support could be integrated directly into the nagios core if compiled that way. It is very different than the current ndomod support. The main program and the CGI's always use the same data storage options with 2.x+ (text file). In any event, your logs show your ndo module successfully connecting to the database immediately after restart. If you have lots of hosts/services (10's of thousands) or complex relationships between them, then that delay is likely normal for 2.x. If not, you really need to determine what nagios is doing during that window. I would recompile nagios with debugging enabled (./configure --enable-DEBUGALL) and run the daemon as a foreground process. If that wasn't interesting, I'd resort to strace to see what the process was doing at a system level. -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios slow restarting/reloading possible NDOUtils issue?
I have gone over the documents again this morning and don't think I missed anything. Here is all the relevant information I can think of as well: OS: SLES 9.3 Nagios: 2.9 Plugins: 1.4.10 NRPE: 2.9 NDOUtils: 1.4.9 Addons: NagiosQL, NagVis This system is in the testing phase at this time so I can do anything that needs to be done. When I reload nagios (/etc/init.d/nagios reload) it takes upwards of a full minute to bring everything back up for viewing. In the meantime a message is displayed concerning the status of the nagios daemon: Error: Could not read host and service status information! This goes away in between 90 second and two minutes and everything begins to function normally and the system goes on that way until the next time I have to reload the configuration. The suggestion on this page are: 1. Check the Nagios log file for messages relating to startup or status data errors. [1191938804] Caught SIGHUP, restarting... [1191938804] ndomod: Shutdown complete. [1191938804] Event broker module '/usr/local/nagios/bin/ndomod.o' deinitialized successfully. [1191938804] Nagios 2.9 starting... (PID=22953) [1191938804] LOG VERSION: 2.0 [1191938804] ndomod: NDOMOD 1.4b6 (09-27-2007) Copyright (c) 2005-2007 Ethan Galstad ([EMAIL PROTECTED]) [1191938804] ndomod: Successfully connected to data sink. 0 queued items to flush. [1191938804] Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully. [1191938804] Warning: Contact group 'infrasyalerts' is not used in any host/service definitions or host/service esc alations! [1191938804] Warning: Contact group 'notesadmins' is not used in any host/service definitions or host/service escal ations! [1191938804] Warning: Host performance file processing command 'process-host-perfdata-file' was not found - host pe rformance data file will not be processed! [1191938804] Warning: Service performance file processing command 'process-service-perfdata-file' was not found - s ervice performance data file will not be processed! 2. Always verify configuration options using the -v command-line option before starting or restarting Nagios! Done and nagios is actually running 3. Make sure you've compiled the main program and the CGIs to use the same status data storage options (i.e. text file or database). If the main program is storing status data in a text file and the CGIs are trying to read status data from a database, you'll have problems. This is where I suspect the issue might be hiding. I am using NDOUtils and did not compile nagios is any special manner. I suspect, if this is actually an error, that it is somewhere in the text files vs. DB issue. I could be insane and this just could be how long it takes nagios to HUP when using NDOUtils. No matter the issue I suspect this list is where I will find my answer. Thanks in advance for everyone's time. Best Regards, Mark L. Potter Systems Engineer Academy Sports & Outdoors - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios and cluster setup...few questions
Lacking the time and knowledge to understand monitoring clusters, I concocted a home brew web page that has the concept of a virtual service, and virtual service groups. It's all configured in an XML file. I have a virtual service called "clipcache 01", one called "clipcache 02", and so on. Easch service has two checks pulled from the nagios status file, using hostname and service description. If both are OK, then the virtual service is ok, otherwise it's critical. I then have a group called "Clipcaches", which looks at the number of virtual services, and is critical if none ar running, warning if 1 is running, or OK if 3 or more are running. A group called "live system" monitors the clipcaches group, amongst others, and exports it's status back up. It also looks for shceduled downtime and acknowlegments. Another group that's a member of "live system" is "midtier", which consists of checks of virtual service "Main midtier" (which monitors a certain process on one of 3 machines, must be on one, and one only, to be OK), "Search instance", which is OK if 2 instances are found on one of 4 machines, warning if 1, and critical if 0, and a few other checks. The program then displays this as a tree on a webpage, expanding branches with problems, it gives a quick comforting overview of the whole system, while nagios' "Service Problems" page gives a list of things to fix (which might not be of immediate importance to the overall health of the system, but need fixing anyway) It does it using Perl's "Nagios::StatusLog" module. No idea how well it scales, and I'm sure there's a better way of doing it. It's definatly a work in progress, and has made me think a lot more about defining system health. -- Paul Weaver Systems Development Engineer News Production Facilities, BBC News Work: 020 822 58109 Room 1244 Television Centre, Wood Lane, London, W12 7RJ > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf > Of Tarak Patel > Sent: 09 October 2007 14:32 > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] Nagios and cluster setup...few questions > > > Hi all, > > Here is a quick background of my current setup for monitoring: > > I have an in-house tool monitoring clusters. The tool simply > uses ssh to > launch perl scripts on remote machines and grab all of the output to > stores it on a central location in a logfile. This output is > parsed and > for any pre-defined tags (WARNING/CRITICAL/ERROR). If any of > these tags > are noticed the message is logged using syslog. The scripts > residing on > remote hosts is a collection of perl functions. Each one is > executed one > after another. Some of these functions utilize a status file from > previous run to verify if state of items changed from last > time. Some of > these functions can be given a special argument to set the > current state > as default state for next iteration of checks. > > Cluster are monitored from the head nodes since not all nodes are > accessible from central location. Head node checks contain a special > function that simply use DSH to launch checks on all nodes. > > After looking at nagios and its check_cluster plugins I > realized I would > really like to monitor each of the nodes individually since I > want to be > able to disable a particular check on a particular node. Also > I want to > be able to use status files for some of the checks. As of now > I have yet > to find any plugin that utilizes a status file to monitor hosts. All > plugin simply use current output from commands to verify the status. > > I will be using active checks on the clusters therefore I > will configure > nrpe on all nodes. My plan of attack was to simply use head node as a > gateway and all nodes and services to be defined on the head node > (under nrpe). From central location I can simply execute a check_nrpe > type script to verify backend nodes. > > I still haven't figured out how I can use status files from each > iteration of checks to validate status. I'd appreciate some > inputs as to > what are the best options in monitoring clusters where > backend nodes are > hidden from the central monitoring server. Also some help with use of > state files. > > Thanks all, > > TP. > > > -- > --- > This SF.net email is sponsored by: Splunk Inc. > Still grepping through log files to find problems? Stop. > Now Search log events and configuration files using AJAX and > a browser. Download your FREE copy of Splunk now >> > http://get.splunk.com/ ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ht
[Nagios-users] Nagios and cluster setup...few questions
Hi all, Here is a quick background of my current setup for monitoring: I have an in-house tool monitoring clusters. The tool simply uses ssh to launch perl scripts on remote machines and grab all of the output to stores it on a central location in a logfile. This output is parsed and for any pre-defined tags (WARNING/CRITICAL/ERROR). If any of these tags are noticed the message is logged using syslog. The scripts residing on remote hosts is a collection of perl functions. Each one is executed one after another. Some of these functions utilize a status file from previous run to verify if state of items changed from last time. Some of these functions can be given a special argument to set the current state as default state for next iteration of checks. Cluster are monitored from the head nodes since not all nodes are accessible from central location. Head node checks contain a special function that simply use DSH to launch checks on all nodes. After looking at nagios and its check_cluster plugins I realized I would really like to monitor each of the nodes individually since I want to be able to disable a particular check on a particular node. Also I want to be able to use status files for some of the checks. As of now I have yet to find any plugin that utilizes a status file to monitor hosts. All plugin simply use current output from commands to verify the status. I will be using active checks on the clusters therefore I will configure nrpe on all nodes. My plan of attack was to simply use head node as a gateway and all nodes and services to be defined on the head node (under nrpe). From central location I can simply execute a check_nrpe type script to verify backend nodes. I still haven't figured out how I can use status files from each iteration of checks to validate status. I'd appreciate some inputs as to what are the best options in monitoring clusters where backend nodes are hidden from the central monitoring server. Also some help with use of state files. Thanks all, TP. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Comments between versions 2 and 3 of Nagios
If we update a nagios 2 version into a nagios3.0b4, the comments set on hosts and services are lost. In nagios 2, the comments are saved in a file comments.dat . In nagios 3, the comments are saved in status.dat. Is there a script to keep comments between versions 2 and 3 of Nagios Thanks for your help Martine -- Martine Carannante System Software Development R&D Bull, Architect of an Open World TM Tél : (+33) 1 30 80 71 87 http://www.bull.com - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Plugins for health-check of IBM servers
Thanks for the help. I am in the process of configuring the check_ipmi_chassis plugin. In my earlier email I mentioned that I use the check_cciss plugin for health-check of HP servers (by mistake). Actually the plugin used is check_hpasm. Regards Yogesh On 10/8/07, Gerhard Lausser <[EMAIL PROTECTED]> wrote: > > Have a look at the numerous check_ipmi[tool] plugins or google for > "nagios ipmitool". > > Gerhard > > -- > *Von:* [EMAIL PROTECTED] [mailto: > [EMAIL PROTECTED] *Im Auftrag von *Yogesh > Hasabnis > *Gesendet:* Montag, 8. Oktober 2007 14:02 > *An:* nagios-users@lists.sourceforge.net > *Betreff:* [Nagios-users] Plugins for health-check of IBM servers > > Hi, > > I have been using check_cciss plugin for monitoring the health of our HP > Proliant server. Would like to know about a similar plugin for monitoring > IBM X-series servers. > > Thanks in advance. > > Yogesh > > - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_http
> -Original Message- > From: [EMAIL PROTECTED] [mailto:nagios-users- > [EMAIL PROTECTED] On Behalf Of alexus > Sent: Monday, October 08, 2007 9:55 PM > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] check_http > > how can i create a command/template using check_http and as paramert1 > pass virtual host (website address) > > sounds too easy? Yes ;) > here is the "big" problem... > > i want to be able to check several websites that resides on 1 IP, so > i'm going be using same template,check_command to query multiple Not so much of a problem. You'll want to use the -I and -H parameters to check_http and pass the hostname from the service definition as an argument -- define command{ command_name http_vhost Command_line $USER1$/check_http -I $HOSTADDRESS$ -H $ARG1$ } define host{ host_name web_server address 192.168.1.2 ... } define service{ host_name webserver service_description web site 1 check_command http_vhost!www.site1.example ... } define service{ define service{ host_name webserver service_description web site 2 check_command http_vhost!www.site2.example ... } -- Marc - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] french nagios site
Hi! Information in english : Like the nagios plugin team and his new website, the french speaking nagios community has also a new website on http://nagios-fr.org. Here you'll find at the moment news and technical reference about nagios in french. You can meet me at the nagios konferenz about questions on this project. Information in french : J'ai démarré un nouveau site autour de Nagios et de la supervision sur http://nagios-fr.org et j'espère qu'il deviendra le rendez-vous de la communauté francophone nagios. Vous y trouverez pour l'instant un blog que je maintiens et un wiki. Ce domaine espérant devenir le vôtre, vous pouvez vous enregistrer et poster des commentaires dans le blog ainsi que ajouter,modifier des articles dans le wiki. Ceux qui souhaiterait prendre un rôle dans la tenue de nagios-fr.org de façon plus formelle peuvent me contacter par le formulaire de contact du site. Vous pouvez aussi me rencontrer lors de la Nagios Konferenz à laquelle j'assisterai cette semaine. A bientôt sur nagios-fr.org See you soon on nagios-fr.org Olivier Jan - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] New Nagios Plugins website
Thanks Tom, We were thinking that the --help pages are different from the man pages. For instance, dpkg --help or rpm --help gives a list of options, whereas man dpkg or man rpm gives more commentary and examples. However, that requires more work. If --help on the site is sufficient, we could setup a job that takes the latest code and populate the web site with it. Ton On 9 Oct 2007, at 10:54, <[EMAIL PROTECTED]> <[EMAIL PROTECTED]> wrote: > Hi Ton, > > I notice you have a Manuals page with the help files for each > plugin. I > went and collected a whole bunch off stuff recently. I have > attached a > MS word doc with what I have. Hope this doc saves you or some others > some work > > Regards > > Tom > > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of Ton > Voon > Sent: 09 October 2007 10:44 > To: Nagios Users mailinglist; [EMAIL PROTECTED] > Subject: [Nagios-users] New Nagios Plugins website > > Hi! > > Just to let you all know that a new website for the Nagios Plugins > project has just been launched: http://nagiosplugins.org > > We'll be using this as a way of keeping you all up to date on the > latest > information about the plugins. > > But it is your site too. We've enabled comments so you can add your > information to various pages. We're especially looking forward to some > innovative ways you use the plugins. > http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios script quits with unknown error
Hi again, Sorry about not writing these in the previous email. Yes, the values are correct in script. KR, Cem From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of hindrek murdsalu Sent: Tuesday, October 09, 2007 11:25 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] nagios script quits with unknown error Are you using the appropriate return codes? Const intCritical = 2 Const intWarning = 1 Const intOK = 0 Const intUnknown = 3 From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Keles, Cem Sent: 9. oktoober 2007. a. 10:57 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] nagios script quits with unknown error Hi, I use Nagios 2.9. I have a vbscript running on windows servers that checks the cpu usage and outputs with performance data. Here is the output of script when executed on command prompt. Processor Time=41% |PerProc=41;60;90;0;100 The last lines of script are like this: If PercentProcessorTime > warnperc Then if PercentProcessorTime > critperc Then Wscript.Quit(2) 'Quit with critical Else Wscript.Quit(1) 'Quit with warning End If End If Wscript.Quit(0) 'Quit with OK So the script quits with warning or critical if those criterias in if statements are met and sure it should at least quit with ok. But most of the time I get these notifications from nagios. -- * Nagios * Notification Type: PROBLEM Service: CPU load Host: hostname Address: xx.xx.xx.xx State: UNKNOWN Date/Time: Tue Oct 9 09:25:10 CEST 2007 Additional Info: Processor Time=52% --- You see the output is ok, that it gives Processor Time=52%. My question is: Why do you think nagios gives unknown even script executes well? Any help will be greatly appreciated. KR, Cem The information contained in this message or any of its attachments may be confidential and is intended for the exclusive use of the addressee(s). Any disclosure, reproduction, distribution or other dissemination or use of this communication is strictly prohibited without the express permission of the sender. The views expressed in this email are those of the individual and not necessarily those of Sony or Sony affiliated companies. Sony email is for business use only. This email and any response may be monitored by Sony to be in compliance with Sony's global policies and standards The information contained in this message or any of its attachments may be confidential and is intended for the exclusive use of the addressee(s). Any disclosure, reproduction, distribution or other dissemination or use of this communication is strictly prohibited without the express permission of the sender. The views expressed in this email are those of the individual and not necessarily those of Sony or Sony affiliated companies. Sony email is for business use only. This email and any response may be monitored by Sony to be in compliance with Sonys global policies and standards- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] New Nagios Plugins website
Hi! Just to let you all know that a new website for the Nagios Plugins project has just been launched: http://nagiosplugins.org We'll be using this as a way of keeping you all up to date on the latest information about the plugins. But it is your site too. We've enabled comments so you can add your information to various pages. We're especially looking forward to some innovative ways you use the plugins. See you at the site! Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios script quits with unknown error
Are you using the appropriate return codes? Const intCritical = 2 Const intWarning = 1 Const intOK = 0 Const intUnknown = 3 From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Keles, Cem Sent: 9. oktoober 2007. a. 10:57 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] nagios script quits with unknown error Hi, I use Nagios 2.9. I have a vbscript running on windows servers that checks the cpu usage and outputs with performance data. Here is the output of script when executed on command prompt. Processor Time=41% |PerProc=41;60;90;0;100 The last lines of script are like this: If PercentProcessorTime > warnperc Then if PercentProcessorTime > critperc Then Wscript.Quit(2) 'Quit with critical Else Wscript.Quit(1) 'Quit with warning End If End If Wscript.Quit(0) 'Quit with OK So the script quits with warning or critical if those criterias in if statements are met and sure it should at least quit with ok. But most of the time I get these notifications from nagios. -- * Nagios * Notification Type: PROBLEM Service: CPU load Host: hostname Address: xx.xx.xx.xx State: UNKNOWN Date/Time: Tue Oct 9 09:25:10 CEST 2007 Additional Info: Processor Time=52% --- You see the output is ok, that it gives Processor Time=52%. My question is: Why do you think nagios gives unknown even script executes well? Any help will be greatly appreciated. KR, Cem The information contained in this message or any of its attachments may be confidential and is intended for the exclusive use of the addressee(s). Any disclosure, reproduction, distribution or other dissemination or use of this communication is strictly prohibited without the express permission of the sender. The views expressed in this email are those of the individual and not necessarily those of Sony or Sony affiliated companies. Sony email is for business use only. This email and any response may be monitored by Sony to be in compliance with Sony's global policies and standards - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nagios script quits with unknown error
Hi, I use Nagios 2.9. I have a vbscript running on windows servers that checks the cpu usage and outputs with performance data. Here is the output of script when executed on command prompt. Processor Time=41% |PerProc=41;60;90;0;100 The last lines of script are like this: If PercentProcessorTime > warnperc Then if PercentProcessorTime > critperc Then Wscript.Quit(2) 'Quit with critical Else Wscript.Quit(1) 'Quit with warning End If End If Wscript.Quit(0) 'Quit with OK So the script quits with warning or critical if those criterias in if statements are met and sure it should at least quit with ok. But most of the time I get these notifications from nagios. -- * Nagios * Notification Type: PROBLEM Service: CPU load Host: hostname Address: xx.xx.xx.xx State: UNKNOWN Date/Time: Tue Oct 9 09:25:10 CEST 2007 Additional Info: Processor Time=52% --- You see the output is ok, that it gives Processor Time=52%. My question is: Why do you think nagios gives unknown even script executes well? Any help will be greatly appreciated. KR, Cem The information contained in this message or any of its attachments may be confidential and is intended for the exclusive use of the addressee(s). Any disclosure, reproduction, distribution or other dissemination or use of this communication is strictly prohibited without the express permission of the sender. The views expressed in this email are those of the individual and not necessarily those of Sony or Sony affiliated companies. Sony email is for business use only. This email and any response may be monitored by Sony to be in compliance with Sonys global policies and standards- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] escalation
First, you don't really want to add r to escalation_options as you only want it to escalate when the status is critical, not when it's critical and recovery. Second, it doesn't make much sense as an "escalation" as the notification interval is supposed to increase when you define your escalation, you do exactly the opposite, although it probably is not the cause, you could add a bigger notification_interval to service than to service escalation. (forgot to change the "To..." address to Nagios-user, now the original poster gets 2 mails, but oh well) From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Siwei Zhang Sent: 8. oktoober 2007. a. 22:30 To: nagios-users Subject: [Nagios-users] escalation Hi there, I am trying to configure the escalation but never received the escalation email. What might be wrong? The followings are my configurations. Thanks. Kevin SZ define service{ use generic-service hostgroup_name host_name service_description check-host-alive check_command check-host-alive is_volatile 0 check_period24x7 max_check_attempts 1 contact_groups g1,g2 normal_check_interval 5 retry_check_interval5 process_perf_data 1 notification_interval 0 notification_period 24x7 notification_optionsw,u,c,r notifications_enabled 1 } define serviceescalation{ host_name service_description check-host-alive contact_groups g1,g2 first_notification 2 last_notification 0 notification_interval 300 escalation_options c,r } - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Availability reports & Retrospective Scheduled Downtime
Hi there, Is there anyway to retrospectively add in scheduled downtime so that the downtime does not appear as unscheduled in the availability report in Nagios 2.9? At present, if I add downtime after the outage (e.g. if it turns that an outage was due to a customer doing maintanence rather than a fault in our service) it doesn't change the report, it still appears as unscheduled downtime. Kind Regards, Alan -- Alan Cooper Lumison T: 0131 514 4033 F: 0845 119 9901 PS. Do you know that we have opened a new datacentre in Croydon? Click https://www.lumison.net/services/pdfs/colo_croydon.pdf or give me a call if you want to know more. -- This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify the sender. Any offers or quotation of service are subject to formal specification. Errors and omissions excepted. Please note that any views or opinions presented in this email are solely those of the author and do not necessarily represent those of Lumison, nplusone or lightershade ltd. Finally, the recipient should check this email and any attachments for the presence of viruses. Lumison, nplusone and lightershade ltd accepts no liability for any damage caused by any virus transmitted by this email. -- -- Virus scanned by Lumison. - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null