Re: [Nagios-users] Weird Nagios Problem

2012-12-04 Thread Jeffrey Watts
Martin, I've always used NRPE to run check_load remotely. If you use SNMP, you can also write a custom plugin to gather the values that way. There might be a plugin that someone else has written, too. Jeffrey. On Tue, Dec 4, 2012 at 8:33 AM, Martin Hugo wrote: > You are right, it was using c

Re: [Nagios-users] 2 Nagios boxes running together in different locations

2012-05-09 Thread Jeffrey Watts
This is exactly how I do things, except I have three sites. Jeffrey On Wednesday, May 9, 2012, C. Bensend wrote: > > > I've dealt with this situation before, and I've ended up > implementing two mostly standalone Nagios systems. They each > check their own site, so if their external network goes

Re: [Nagios-users] check_http issue

2012-03-12 Thread Jeffrey Watts
I wonder if it's an epoch thing... Are all of the certs that are failing ones in which the expiry year is 2038 or greater? Jeffrey. On Mon, Mar 12, 2012 at 4:04 AM, Sunny Jaisinghani < sunny_jaisingh...@symantec.com> wrote: > Hello, > > I am using the check_http plugin for checking the SSL cer

Re: [Nagios-users] Service dependency help

2012-02-07 Thread Jeffrey Watts
Anyone have any advice on this problem? Thanks again, Jeffrey. On Thu, Jan 26, 2012 at 11:56 AM, Jeffrey Watts wrote: > Hello, I'm having some trouble getting a service dependency working and I > was hoping for some help. I've read the section in Wolfgang Barth's book

Re: [Nagios-users] check_snmp OID integer /10

2012-02-03 Thread Jeffrey Watts
You can always write a script that acts as a wrapper around check_snmp if you want prettier info displayed. Jeffrey. 2012/2/1 Sánta József > Hi! > > ** ** > > I have a temperature SNMP device. > > ** ** > > snmpget 10.0.0.63 1.3.6.1.4.1.17095.4.1.3.3.0 -c public -v 1 > > iso.3.6.1.4

[Nagios-users] Service dependency help

2012-01-26 Thread Jeffrey Watts
Hello, I'm having some trouble getting a service dependency working and I was hoping for some help. I've read the section in Wolfgang Barth's book "Nagios 2nd Edition" and googled around a bit, but something's still not working right. I'm using Nagios 3.0.6. Specifically, I want to set it up so

Re: [Nagios-users] question about check_disk

2012-01-20 Thread Jeffrey Watts
You have it backwards. 'df' says 24% used, not remaining. 'check_disk' shows how much is remaining (76% and 99% in this case). I've never liked how check_disk displays its results by default, most Unix tools show as a primary metric how much of a resource is used and not how much is remaining.

Re: [Nagios-users] disk checks unreliable

2011-10-05 Thread Jeffrey Watts
The check is working correctly - /mnt/store is a valid path in both circumstances. Remember, in Unix mounted filesystems all sit on top of the / filesystem, so when you umount the filesystem on /mnt/store, that mountpoint still exists (on /). The way I've done it in the past is by using -r/-R to

Re: [Nagios-users] [RESOLVED] Return code of 9 is out of bounds when plugin is runin nagios, but return code is 0 when run from shell

2011-09-18 Thread Jeffrey Watts
I think there was something in the FAQ about not using the built-in Perl interpreter. I don't, I've had nothing but trouble with it. Jeffrey. On Sun, Sep 18, 2011 at 8:57 PM, Samuel Kidman wrote: > Hi All > > I managed to resolve this by changing the command definition. Not sure > why that was

Re: [Nagios-users] Scheduled downtime and host checks

2011-06-01 Thread Jeffrey Watts
On Wed, Jun 1, 2011 at 1:27 AM, Kumar, Ashish wrote: > >> No, scheduled downtime only affects notifications, and the stats you >> see in the availability cgi. Service and host checks run as normal >> during scheduled downtime. > > > Thanks Jim for the explanation but I do not see any rational re

Re: [Nagios-users] Distributed Nagios Configuration with Passive Checks

2011-03-21 Thread Jeffrey Watts
If I understand you correctly, are you trying to have your master server accept passive checks from other Nagios servers and check services and servers itself? If so, what I think would be easier (and is the method recommended by the docs) would be to just set up a second Nagios server (or instanc

Re: [Nagios-users] Monitoring unmounted partition

2011-02-08 Thread Jeffrey Watts
IMHO /proc/mounts is a much better place to look, as NFS problems and so forth can cause 'mount' to hang and hit the NRPE timeout. Jeffrey. On Tue, Feb 8, 2011 at 6:24 AM, dave stern - e-mail.pluribus.unum < dit.d...@gmail.com> wrote: > Write a plugin. It could search the output of the command,

Re: [Nagios-users] Monitoring temperatures on Cisco equipment

2011-01-27 Thread Jeffrey Watts
Thanks Gerald, that's exactly what I was looking for! Jeffrey. On Thu, Jan 27, 2011 at 3:51 AM, Ortner, Gerald wrote: > Hi, > > > > We use > check_cisco_envmonto > monitor our Cisco equipment. It’s us

Re: [Nagios-users] Monitoring temperatures on Cisco equipment

2011-01-27 Thread Jeffrey Watts
I think you misunderstand. Those two plugins return WARNING or CRITICAL if one of the two things occur: 1) If the ciscoEnvMonTemperatureState is not "normal". 2) If the passed -w and -c values are less than ciscoEnvMonTemperatureStatusValue. What I'm asking is why #2 is _required_. I can unders

Re: [Nagios-users] NRPE- Log files in syslog

2011-01-26 Thread Jeffrey Watts
You really ought to read up on syslog and how it works. It can do everything you're asking. The whole point of the syslog service is to abstract logging from the application - so that it can be centralized, filtered, separated, etc. :) Try (depends on which syslog you have running, ps can help

Re: [Nagios-users] Notification criteria

2011-01-26 Thread Jeffrey Watts
Nagios notifies on HARD states (or flapping, which is another matter). It's generally assumed that people don't want to be notified every time there's a failed check - they want to be notified after it's failed a certain number of times. A SOFT state is a state where non-OK checks have occurred l

[Nagios-users] Monitoring temperatures on Cisco equipment

2011-01-26 Thread Jeffrey Watts
I'm looking to monitor temperature on some various Cisco equipment (2821, 3750, 4948, 6509, etc). I've looked at the check_catalyst_temp.pl and check_env_stats.py plugins, which both look in .1.3.6.1.4.1.9.9.13.1. I see and understand the basic mechanisms of these checks. What I don't understand

Re: [Nagios-users] Problem with check_openmanage

2011-01-24 Thread Jeffrey Watts
, too. Thanks! Jeffrey. On Mon, Jan 24, 2011 at 10:55 AM, Trond Hasle Amundsen < t.h.amund...@usit.uio.no> wrote: > Jeffrey Watts writes: > > > Hello, I'm using Mr. Amundsen's excellent check_openmanage plugin, and > I'm > > getting an odd error: > > &

[Nagios-users] Problem with check_openmanage

2011-01-24 Thread Jeffrey Watts
Hello, I'm using Mr. Amundsen's excellent check_openmanage plugin, and I'm getting an odd error: $ check_openmanage -H myserver -C public Power Supply 0 [AC] needs attention: Presence detected, Failure detected, AC lost Voltage sensor 14 [PS 2 Voltage 2] is INTERNAL ERROR: Use of uninitialized val