Re: [Nagios-users] alerting flakey

2007-03-07 Thread Ezra Radoff
sounds like you've seen this before?
I did as you advised. stop restart. I don't know what other processes to look 
for besides the one below. There weren't any running.

isk-nagios:/usr/local/nagios/etc # ps -ef | grep nagios
nagios   25494 1  0 14:48 ?00:00:00 /usr/local/nagios/bin/nagios -d 
/usr/local/nagios/etc/nagios.cfg
root 25501 25032  0 14:50 pts/000:00:00 grep nagios



-Original Message-
From: Santhosh Kumar A [mailto:[EMAIL PROTECTED]
Sent: Wed 3/7/2007 6:43 AM
To: Ezra Radoff; nagios-users@lists.sourceforge.net
Subject: RE: [Nagios-users] alerting flakey
 
 

 



From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Ezra
Radoff
Sent: Wednesday, March 07, 2007 1:28 PM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] alerting flakey

 

 

hello. I've been using nagios for a couple of months now pretty
successfully, but I've noticed that the alerting function is a bit
flakey. I've been over the configuration many times, but everything
seems fine. The amount of alerting it does seems to change after I
restart the service with /etc/init.d/nagios restart. It was sending
warning and criticals. Then, after a restart, it wasn't sending service
critical alerts. Then I restarted it again. It wasn't sending anything.
Then I restarted it again, and it was sending warnings.



 check whether multiple nagios daemons running or not .  

stop  nagios and ensure every nagios process is killed then do a start
(don't use restart)

Santhosh


I'm using version 2.6 which I got from the CVS tree a couple of months
ago.

Can anybody give me a little help on this one?

The alert just calls a script I wrote by hand which is referenced in the
commands.cfg . I don't use the groups or anything.

No alert attempt is showing up in the event log either.

Thanks 


-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] alerting flakey

2007-03-07 Thread Ezra Radoff

OK. It's definatly none of those. take a look.

define service{
use local-service
hostgroup_name  cisco_routers
service_description Cisco_load
check_command   
check_snmp_load_cisco!cisco!90,80,60!100,100,100
}

##

define service{
namelocal-service   ; The name of 
this service template
use generic-service ; Inherit 
default values from the generic-service definition
check_period24x7; The service 
can be checked at any time of the day
max_check_attempts  4   ; Re-check the 
service up to 4 times in order to determine its final (hard) state
normal_check_interval   5   ; Check the 
service every 5 minutes under normal conditions
retry_check_interval1   ; Re-check the 
service every minute until a hard state can be determined
contact_groups  admins  ; Notifications 
get sent out to everyone in the 'admins' group
notification_optionsw,u,c,r ; Send 
notifications about warning, unknown, critical, and recovery events
notification_interval   60  ; Re-notify 
about service problems every hour
notification_period 24x7; Notifications 
can be sent out at any time
register0   ; DONT REGISTER 
THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE!

}

##


Whether a critical alert gets generated or not can depend on the
notification_options in the service definition, the host definition
and/or the contact definition.

Whether notifications are generated at all can depend on
notification_enabled in the host or service definition, on the
timeperiod in the contact definition, globally in the nagios
configuration and it can be dynamically enabled/disabled for hosts,
services and for nagios as a whole.

My guess is that it might be something quite simple in the
notification_options somewhere.  See
http://nagios.sourceforge.net/docs/2_0/notifications.html

Another option worth trying is check_for_orphaned_services in your
main nagios.cfg file.  See:
http://nagios.sourceforge.net/docs/2_0/configmain.html#check_for_orphaned_services

Cheers,

Jim

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] alerting flakey

2007-03-07 Thread Ezra Radoff
No, but I'm thinking now that it's always sending warnings and never sending 
criticals.
It's not flapping. We had a server down for hours.
It wasn't sending the warnings because it only does it after four. I think that 
part has been consistant.
In the service def it looks like all four states are configured for sending 
alerts. I don't get it.


-Original Message-
From: [EMAIL PROTECTED] on behalf of Jim Avery
Sent: Wed 3/7/2007 2:48 AM
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] alerting flakey
 
On 07/03/07, Ezra Radoff <[EMAIL PROTECTED]> wrote:
> hello. I've been using nagios for a couple of months now pretty
> successfully, but I've noticed that the alerting function is a bit flakey.
> I've been over the configuration many times, but everything seems fine. The
> amount of alerting it does seems to change after I restart the service with
> /etc/init.d/nagios restart. It was sending warning and criticals. Then,
> after a restart, it wasn't sending service critical alerts. Then I restarted
> it again. It wasn't sending anything. Then I restarted it again, and it was
> sending warnings.

It's not because the hosts or services are flapping is it?

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] alerting flakey

2007-03-07 Thread Ezra Radoff


hello. I've been using nagios for a couple of months now pretty successfully, 
but I've noticed that the alerting function is a bit flakey. I've been over the 
configuration many times, but everything seems fine. The amount of alerting it 
does seems to change after I restart the service with /etc/init.d/nagios 
restart. It was sending warning and criticals. Then, after a restart, it wasn't 
sending service critical alerts. Then I restarted it again. It wasn't sending 
anything. Then I restarted it again, and it was sending warnings. 

I'm using version 2.6 which I got from the CVS tree a couple of months ago.

Can anybody give me a little help on this one?

The alert just calls a script I wrote by hand which is referenced in the 
commands.cfg . I don't use the groups or anything.

No alert attempt is showing up in the event log either.

Thanks
-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null