Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Marc Powell
On May 19, 2010, at 12:08 PM, Michael Maxwell wrote: > Well, I didn't exactly leave this out. Remember I mentioned that it's a > clean install from the Quickstart document. I've changed nothing in it > other than the admin contact in contacts.cfg. So whatever's there is there > in the default

Re: [Nagios-users] check_openmanage weirdness

2010-05-19 Thread Trond Hasle Amundsen
Greg Etling writes: > I have just started implementing some check_openmanage checks on my > servers, and have run into some odd behavior with the combination of > Windows 2003, OM 6.2 and the SNMP check. It appears that this > combination is having issues with the drive/controller reporting.

Re: [Nagios-users] NRPE: Unable to read output (after upgrading sudo to 1.7.2p1-6.el5_5)

2010-05-19 Thread Luc I. Suryo
1) do : uname -n the output must match the host defined, in your case ds2 JUST in case put a space between NOPASSWD: and /usr/ 2) make sure nagios has a correct/auth shell (/etc/shells) 3) add the following lines in the sudo file Cmnd_Alias LIST= /usr/bin/sudo -l ALL

[Nagios-users] check_openmanage weirdness

2010-05-19 Thread Greg Etling
I have just started implementing some check_openmanage checks on my servers, and have run into some odd behavior with the combination of Windows 2003, OM 6.2 and the SNMP check. It appears that this combination is having issues with the drive/controller reporting. Initially things worked fine u

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread Corey Hickey
On 2010-05-19 11:47, Corey Hickey wrote: > On 2010-05-18 18:39, Mathieu Gagné wrote: >> On 2010-05-18 21:29, Corey Hickey wrote: >>> Hello, >>> >>> I have inherited maintenance of a medium-sized Nagios installation. We >>> currently have 649 hosts and 5415 services. Our setup works nicely, with >>>

Re: [Nagios-users] NRPE: Unable to read output (after upgrading sudo to 1.7.2p1-6.el5_5)

2010-05-19 Thread Ben O'Hara
I found the same problem with sudo after an update. Seems the comment line in the sudoers file causes a synax with #comment, you need a space, so # comment works ok. Ben On Wed, May 19, 2010 at 5:48 PM, Yungwei Chen wrote: > Hi, > >        I have been using a check_ldap_fd command on a nagios c

Re: [Nagios-users] NRPE with command arguments

2010-05-19 Thread shadih rahman
try nrpe call "-n" option On Wed, May 19, 2010 at 11:06 AM, Ludse Verhoeven wrote: > > I think that's fine. It all depends on what you want to pass to the command > itself (in this case the /usr/locall.nagios/libexec/check_disk-plugin). If > f.i. -w "$ARG1$" can use space-seperated values (which

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread Corey Hickey
On 2010-05-18 18:39, Mathieu Gagné wrote: > On 2010-05-18 21:29, Corey Hickey wrote: >> Hello, >> >> I have inherited maintenance of a medium-sized Nagios installation. We >> currently have 649 hosts and 5415 services. Our setup works nicely, with >> one exception: Nagios falls behind on host/servi

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Marcel
Perhaps if you're upgrading and haven't touched the resources.cfg, then you could be having issues with $USER1$ defined for your previous installation. On Wed, May 19, 2010 at 3:09 PM, Michael Maxwell wrote: > > > P.S. My apologies if I seem snippy. It's a bad day. If you provide > > the info,

[Nagios-users] Escalations - Warning to Critical, without skipping?

2010-05-19 Thread Mike Lindsey
So, here's my situation. I've got around 10k checks, Warnings do not notify, because we have historically had issues with Warning notifications (from the contact group setting) going out, then a service turning critical and the pager escalations (which only include critical) skipping directly

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Rick Mangus
On Wed, May 19, 2010 at 1:09 PM, Michael Maxwell wrote: > >> P.S.  My apologies if I seem snippy.  It's a bad day.  If you provide >> the info, I promise a speedy, polite response.  :) > > No thanks.  I'll use an earlier version.  I'm not exactly having a great day > either.  A little civility wou

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Michael Maxwell
> P.S. My apologies if I seem snippy. It's a bad day. If you provide > the info, I promise a speedy, polite response. :) No thanks. I'll use an earlier version. I'm not exactly having a great day either. A little civility would get a better response from me as I've been nothing but civil a

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Trisha Hoang
We used nagios-3.2.1 and Centos5.2 and experienced somewhat the same problem as Michael described, but I don't see the same problem after moving to nagios-3.2.0. -- ___ Nagios-us

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Rick Mangus
Michael: What check are you using for the host check? The line we're looking for is under your host definition and looks like so: check_command Then, copy/paste the command definition for that command. Patrick asked the vital question, and you blew him off. The Quickstart page d

Re: [Nagios-users] Full Throttle Nagios

2010-05-19 Thread Kyle Bader
>> There are several tweaks one can do to make Nagios more or less saturate the >> CPU, but none of them are enough if the number of checks rise above a >> certain >> level. We've noticed that we can run about 60K service-checks so long as we >> don't have many state-changes in the network. That's

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Michael Maxwell
On 5/19/10 9:33 AM, "Lacayo, Luis F" wrote: > What happens when you run it from the command line as the Nagios user? > I had a similar issue that required the check to run as root. Everything works fine when running from commandline as nagios user or as root. It even works fine within Nagios *e

Re: [Nagios-users] Escalation question

2010-05-19 Thread Martin Melin
On Wed, May 19, 2010 at 3:20 PM, Vladimir N. Indik wrote: > I want to send notification to three contacts in order. > o...@example.com > wait 10 minutes > t...@example.com > wait 10 minutes > th...@example.com > > My config: > define service { >        host_name                       hostname >  

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Michael Maxwell
On 5/19/10 9:42 AM, "Morris, Patrick" wrote: > You've left out what is probably the most important information: how the > check that is failing is defined. > > Take a look at your host check definition. Chances are it's either got a > typo in the configuration, is pointing at a plugin that's no

Re: [Nagios-users] Anyone? : SIGSEGV when trying to use eventhandler

2010-05-19 Thread nagios
Thanks for the reply Guy. I installed nagios from the "contrib" repo. as per the blog here...and all looks like it's working fine. This is the most recent version in the repo's. I guess I could compile it from source, but would like to be sure that's the issue before I go down this path. http:

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Michael Maxwell
On 5/19/10 8:58 AM, "Assaf Flatto" wrote: > AFAIK centos has a security mechanism called msec that can some time > change the permissions of files and cause this issue . > > check if you have the msec is working ( usually in the cron.d or > cron.hourly or cron.daily directory ) and if so - you m

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Morris, Patrick
Michael Maxwell wrote: > Hello, > > Just installed nagios 3.2.1 following the docs at > http://nagios.sourceforge.net/docs/3_0/quickstart.html > > Everything starts up normally, web interface is fine, can login, > enable/disable checks, etc... However, every 2 hours exactly, > I'm getting the foll

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Lacayo, Luis F
What happens when you run it from the command line as the Nagios user? I had a similar issue that required the check to run as root. Luis -Original Message- From: Assaf Flatto [mailto:nag...@flatto.net] Sent: Wednesday, May 19, 2010 10:59 AM To: nagios-users@lists.sourceforge.net Subjec

[Nagios-users] NRPE: Unable to read output (after upgrading sudo to 1.7.2p1-6.el5_5)

2010-05-19 Thread Yungwei Chen
Hi, I have been using a check_ldap_fd command on a nagios client without any problem. That command requires superuser permission. After upgrading sudo to 1.7.2p1-6.el5_5, the command stops working. Running the following command on my nagios server always shows the

Re: [Nagios-users] localhost DOWN messages, return code 127 is out of bounds

2010-05-19 Thread Assaf Flatto
Michael Maxwell wrote: > Hello, > > Just installed nagios 3.2.1 following the docs at > http://nagios.sourceforge.net/docs/3_0/quickstart.html > > Everything starts up normally, web interface is fine, can login, > enable/disable checks, etc... However, every 2 hours exactly, > I'm getting the foll

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread Frost, Mark {PBC}
> -Original Message- > From: Corey Hickey [mailto:bugfood...@fatooh.org] > Sent: Tuesday, May 18, 2010 9:30 PM > To: nagios-users@lists.sourceforge.net > Subject: [Nagios-users] trying to fix problem with excessive latency > > Hello, > > I have inherited maintenance of a medium-sized Na

Re: [Nagios-users] NRPE with command arguments

2010-05-19 Thread Ludse Verhoeven
I think that's fine. It all depends on what you want to pass to the command itself (in this case the /usr/locall.nagios/libexec/check_disk-plugin). If f.i. -w "$ARG1$" can use space-seperated values (which it can't, but just for arguments sake), then you have to specify (in nrpe.cfg) the (singl

Re: [Nagios-users] NRPE with command arguments

2010-05-19 Thread Enrico Zimol
On 19 May 2010 11:13, Ludse Verhoeven wrote: > Not sure if this works like it is supposed to (haven't tested it myself), > but qua syntax this seems more correct. > That worked fine :) Other question, where exactly I've to insert quotes on nrpe.cfg commands to prevent code injection? I think lik

Re: [Nagios-users] Best use of check_ntp_time plugin

2010-05-19 Thread Andreas Ericsson
On 05/19/2010 03:01 PM, Deborah Martin wrote: > Folks, > > We've had a problem recently where the timestamp has been drifting > hours ahead on the VM box running nagios. (v3.2.0 with SLES 10 SP2) > Don't run any kind of scheduling engine inside a VM. It's like begging for Mr Murphy to come kick

[Nagios-users] Best use of check_ntp_time plugin

2010-05-19 Thread Deborah Martin
Folks, We've had a problem recently where the timestamp has been drifting hours ahead on the VM box running nagios. (v3.2.0 with SLES 10 SP2) As a consequence I'm setting up the plugin check_ntp_time to ensure that the clock on the nagios monitoring box doesn't drift too far ahead (or behind.)

[Nagios-users] Escalation question

2010-05-19 Thread Vladimir N. Indik
I want to send notification to three contacts in order. o...@example.com wait 10 minutes t...@example.com wait 10 minutes th...@example.com My config: define service { host_name hostname service_description servicename notification_interval

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread Marc Powell
On May 19, 2010, at 7:00 AM, C. Bensend wrote: > Doesn't this kill the use of said macros in notifications, etc? Not if you're using them in the 'standard' way by passing them as $MACRONAME$ in the command_line of the notification command. This is how 99% of installations use them I would ex

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread C. Bensend
>> Doesn't this kill the use of said macros in notifications, etc? > > No. It just means you can't access the various macros as environment > variables, like so: > > $NAGIOS_(macroname) > > They still get swapped in at the command line, so they're still > totally useful for that particular purpose

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread Andreas Ericsson
On 05/19/2010 02:00 PM, C. Bensend wrote: > >> Try disabling "enable_environment_macros". It helped use greatly when we >> disabled this option. Nagios was spending most of its time allocating >> memory for environment variables it never used. (in our case) >> >> Host/Service checks latency droppe

Re: [Nagios-users] trying to fix problem with excessive latency

2010-05-19 Thread C. Bensend
> Try disabling "enable_environment_macros". It helped use greatly when we > disabled this option. Nagios was spending most of its time allocating > memory for environment variables it never used. (in our case) > > Host/Service checks latency dropped from ~20 minutes to 10 seconds I > would say.

Re: [Nagios-users] Full Throttle Nagios

2010-05-19 Thread Max
Really look forward to seeing this code - this is the process pool code, yes? On 5/19/10, Andreas Ericsson wrote: > There are several tweaks one can do to make Nagios more or less saturate the > CPU, but none of them are enough if the number of checks rise above a > certain > level. We've noticed

Re: [Nagios-users] Full Throttle Nagios

2010-05-19 Thread Andreas Ericsson
There are several tweaks one can do to make Nagios more or less saturate the CPU, but none of them are enough if the number of checks rise above a certain level. We've noticed that we can run about 60K service-checks so long as we don't have many state-changes in the network. That's not nearly good

Re: [Nagios-users] Anyone? : SIGSEGV when trying to use eventhandler

2010-05-19 Thread Guy Waugh
I'm definitely no expert but... * What does it say when you 'ldd' the nagios binary? Are all the libraries the binary is linked against able to be found? Are those libraries up-to-date? * Where did you get nagios from? Did you compile it or is it pre-built? If pre-built, are there any updates? * I

[Nagios-users] Anyone? : SIGSEGV when trying to use eventhandler

2010-05-19 Thread nagios
Anybody? If you need extra information, just let me know what you need to see and I'll upload it. Thanks. -Original Message- From: "nagios" To: nagios-users@lists.sourceforge.net Date: Wed, 19 May 2010 01:42:15 +1000 Subject: [Nagios-users] SIGSEGV when trying to use eventhandler Hi g

Re: [Nagios-users] NRPE with command arguments

2010-05-19 Thread Ludse Verhoeven
Hi there, Your final command to invoke check_disk is "check_nrpe!check_disk!$_HOSTDISKWARNING$!$_HOSTDISKCRITICAL$". Passing this to check_nrpe will only get check_disk as an argument passed through to nrpe (since your check_nrpe-command is "$USER1$/check_nrpe -H $HOSTADDRESS$ -p $_HOSTNRPE_PO

Re: [Nagios-users] NRPE with command arguments

2010-05-19 Thread Enrico Zimol
>after that - change the testing to work the way you want it Hello, i'm testing that but I can't understand what is wrong on my configurations. (blame_nrpe e ricompiling it's ok) On nrpe.cfg I defined this command: command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c $ARG2$ And