Re: [Nagios-users] check_load gone crazy

Marc Powell Wed, 08 Sep 2010 04:56:47 -0700

On Sep 7, 2010, at 7:39 PM, Mike Chesnut wrote:

> I'm wondering if this is a known bug, and/or if anybody else has seen 
> similar behavior...
> 
> We're using Nagios 3.2.1 on Linux, monitoring several Linux systems.  We 
> run the check_load probe against every system.  Occasionally (at 
> non-regular intervals), Nagios will freak out and alert on the load 
> average of many (sometimes *all*) systems.  When this occurs, it reports 
> the *same* load averages for each system, and the weirdest part is that 
> these load averages are completely bogus.


> Any ideas for what I can do to get to the bottom of why this happens?

What transport mechanism are you using to run check_load on the remote systems? 
It is not 'network aware' and so the check_load binary must be installed on 
each remote machine and run on that machine via some transport (check_nrpe, 
check_by_ssh, etc). It seems to me that you are not running it on the remote 
machines but are instead monitoring the load of your nagios machine multiple 
times. Can you show a couple example service{} and corresponding command{} 
definitions if you are unsure?

--
Marc
------------------------------------------------------------------------------
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd
_______________________________________________
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] check_load gone crazy

Reply via email to