Re: [Nagios-users] alerting flakey
sounds like you've seen this before? I did as you advised. stop restart. I don't know what other processes to look for besides the one below. There weren't any running. isk-nagios:/usr/local/nagios/etc # ps -ef | grep nagios nagios 25494 1 0 14:48 ?00:00:00 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg root 25501 25032 0 14:50 pts/000:00:00 grep nagios -Original Message- From: Santhosh Kumar A [mailto:[EMAIL PROTECTED] Sent: Wed 3/7/2007 6:43 AM To: Ezra Radoff; nagios-users@lists.sourceforge.net Subject: RE: [Nagios-users] alerting flakey From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Ezra Radoff Sent: Wednesday, March 07, 2007 1:28 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] alerting flakey hello. I've been using nagios for a couple of months now pretty successfully, but I've noticed that the alerting function is a bit flakey. I've been over the configuration many times, but everything seems fine. The amount of alerting it does seems to change after I restart the service with /etc/init.d/nagios restart. It was sending warning and criticals. Then, after a restart, it wasn't sending service critical alerts. Then I restarted it again. It wasn't sending anything. Then I restarted it again, and it was sending warnings. check whether multiple nagios daemons running or not . stop nagios and ensure every nagios process is killed then do a start (don't use restart) Santhosh I'm using version 2.6 which I got from the CVS tree a couple of months ago. Can anybody give me a little help on this one? The alert just calls a script I wrote by hand which is referenced in the commands.cfg . I don't use the groups or anything. No alert attempt is showing up in the event log either. Thanks - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] alerting flakey
OK. It's definatly none of those. take a look. define service{ use local-service hostgroup_name cisco_routers service_description Cisco_load check_command check_snmp_load_cisco!cisco!90,80,60!100,100,100 } ## define service{ namelocal-service ; The name of this service template use generic-service ; Inherit default values from the generic-service definition check_period24x7; The service can be checked at any time of the day max_check_attempts 4 ; Re-check the service up to 4 times in order to determine its final (hard) state normal_check_interval 5 ; Check the service every 5 minutes under normal conditions retry_check_interval1 ; Re-check the service every minute until a hard state can be determined contact_groups admins ; Notifications get sent out to everyone in the 'admins' group notification_optionsw,u,c,r ; Send notifications about warning, unknown, critical, and recovery events notification_interval 60 ; Re-notify about service problems every hour notification_period 24x7; Notifications can be sent out at any time register0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } ## Whether a critical alert gets generated or not can depend on the notification_options in the service definition, the host definition and/or the contact definition. Whether notifications are generated at all can depend on notification_enabled in the host or service definition, on the timeperiod in the contact definition, globally in the nagios configuration and it can be dynamically enabled/disabled for hosts, services and for nagios as a whole. My guess is that it might be something quite simple in the notification_options somewhere. See http://nagios.sourceforge.net/docs/2_0/notifications.html Another option worth trying is check_for_orphaned_services in your main nagios.cfg file. See: http://nagios.sourceforge.net/docs/2_0/configmain.html#check_for_orphaned_services Cheers, Jim - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] alerting flakey
No, but I'm thinking now that it's always sending warnings and never sending criticals. It's not flapping. We had a server down for hours. It wasn't sending the warnings because it only does it after four. I think that part has been consistant. In the service def it looks like all four states are configured for sending alerts. I don't get it. -Original Message- From: [EMAIL PROTECTED] on behalf of Jim Avery Sent: Wed 3/7/2007 2:48 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] alerting flakey On 07/03/07, Ezra Radoff <[EMAIL PROTECTED]> wrote: > hello. I've been using nagios for a couple of months now pretty > successfully, but I've noticed that the alerting function is a bit flakey. > I've been over the configuration many times, but everything seems fine. The > amount of alerting it does seems to change after I restart the service with > /etc/init.d/nagios restart. It was sending warning and criticals. Then, > after a restart, it wasn't sending service critical alerts. Then I restarted > it again. It wasn't sending anything. Then I restarted it again, and it was > sending warnings. It's not because the hosts or services are flapping is it? - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] alerting flakey
hello. I've been using nagios for a couple of months now pretty successfully, but I've noticed that the alerting function is a bit flakey. I've been over the configuration many times, but everything seems fine. The amount of alerting it does seems to change after I restart the service with /etc/init.d/nagios restart. It was sending warning and criticals. Then, after a restart, it wasn't sending service critical alerts. Then I restarted it again. It wasn't sending anything. Then I restarted it again, and it was sending warnings. I'm using version 2.6 which I got from the CVS tree a couple of months ago. Can anybody give me a little help on this one? The alert just calls a script I wrote by hand which is referenced in the commands.cfg . I don't use the groups or anything. No alert attempt is showing up in the event log either. Thanks - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null