On 3/16/06, Marc Powell <[EMAIL PROTECTED]> wrote:
> -----Original Message-----
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED]] On Behalf Of Mike Linden
> Sent: Thursday, March 16, 2006 8:08 AM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] phantom host up messages
>
> Hi,
> I have a couple systems that have been unplugged and removed from the
rack
> ,yet Nagios still reports a Host UP alert for them from the check_ping
> process.
> How is this possible?
>
> ***** Nagios *****
>
> Notification Type: RECOVERY
> Host: csep1039
> State: UP
> Address: csep1039
> Info: (No output!)
>
> Date/Time: Thu Mar 16 08:56:42 EST 2006
The plugin/command you are using to check the host is providing no
command line output but is still exiting with a code of 0 (successful).
Try running the check_command, as the nagios user, exactly as it's
defined, from the command line and then 'echo $?'. The results of the
echo for a down host should be '2'. Chances are you're going to see some
additional error output from ping/check_ping.
There was also a suggestion earlier for another issue to add '2>&1' to
the end of the check_command definition to redirect error output back to
Nagios. If that works as advertised, and I believe it will, that could
be informative as well.
--
Marc
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmdlnk&kid0944&bid$1720&dat1642
_______________________________________________
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
Marc,
indeed there are other issues here.
The check_ping is seg faulting with an error code of 139
./check_ping -H csep0602 -w 100.0,20% -c 500.0,60% ; echo $?
Segmentation fault
139
Which is resulting in a host up message.
Do you have any idea why this would occur on a seemingly random basis? I also manually ran the command for other servers, which also resulted in the same output.
Thanks
Mike
--
Mike Linden
http://linden.linuxps.com