Re: How much downtime do we afford for nagios?

2008-04-27 Thread Nigel Jones
Hi, Hi, For a few days false notification of nagios reduced. But it has increased again. You sure? Looking at the /configs/system/nagios/services/template.cfg reveals that it is configured as max_check_attempt = 4 and retry_check_interval 1 for hosts and max_check_attempts = 3 and

Re: How much downtime do we afford for nagios?

2008-04-27 Thread susmit shannigrahi
So if a service or host is unreachable for 3 or 4 mins, we get a notification. (However most of the cases it is false positive, due to congestion or others). Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Nigel Jones
So if a service or host is unreachable for 3 or 4 mins, we get a notification. (However most of the cases it is false positive, due to congestion or others). Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Jeroen van Meeuwen
Nigel Jones wrote: Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all the other collateral notifications. Collateral notifications can be caught using service dependencies and parent hosts. Do we currently use any?

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Nigel Jones
On Sun, April 27, 2008 11:01 pm, Jeroen van Meeuwen wrote: Nigel Jones wrote: Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all the other collateral notifications. Collateral notifications can be caught using service

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Mike McGrath
On Sun, 27 Apr 2008, susmit shannigrahi wrote: So if a service or host is unreachable for 3 or 4 mins, we get a notification. (However most of the cases it is false positive, due to congestion or others). Looking through my email, from what I can recall there are no false

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Mike McGrath
On Sun, 27 Apr 2008, Jeroen van Meeuwen wrote: Nigel Jones wrote: Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all the other collateral notifications. Collateral notifications can be caught using service

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Mike McGrath
On Mon, 28 Apr 2008, Nigel Jones wrote: On Sun, April 27, 2008 11:01 pm, Jeroen van Meeuwen wrote: Nigel Jones wrote: Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all the other collateral notifications.

How much downtime do we afford for nagios?

2008-04-26 Thread susmit shannigrahi
Hi, For a few days false notification of nagios reduced. But it has increased again. Looking at the /configs/system/nagios/services/template.cfg reveals that it is configured as max_check_attempt = 4 and retry_check_interval 1 for hosts and max_check_attempts = 3 and retry_check_interval 1.