How much downtime do we afford for nagios?

2008-04-26 Thread susmit shannigrahi
Hi, For a few days false notification of nagios reduced. But it has increased again. Looking at the /configs/system/nagios/services/template.cfg reveals that it is configured as max_check_attempt = 4 and retry_check_interval 1 for hosts and max_check_attempts = 3 and retry_check_interval 1. S

Re: How much downtime do we afford for nagios?

2008-04-26 Thread Nigel Jones
> Hi, Hi, > > For a few days false notification of nagios reduced. But it has increased > again. You sure? > > Looking at the /configs/system/nagios/services/template.cfg reveals > that it is configured as > max_check_attempt = 4 and retry_check_interval 1 for hosts > and > max_check_attempts = 3

Re: How much downtime do we afford for nagios?

2008-04-26 Thread susmit shannigrahi
> > So if a service or host is unreachable for 3 or 4 mins, we get a > > notification. (However most of the cases it is false positive, due to > > congestion or others). > Looking through my email, from what I can recall there are no false > positives. xen6 had to be power-cycled which caused

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Nigel Jones
>> > So if a service or host is unreachable for 3 or 4 mins, we get a >> > notification. (However most of the cases it is false positive, due to >> > congestion or others). >> Looking through my email, from what I can recall there are no false >> positives. xen6 had to be power-cycled which c

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Jeroen van Meeuwen
Nigel Jones wrote: Looking through my email, from what I can recall there are no false positives. xen6 had to be power-cycled which caused all the other collateral notifications. Collateral notifications can be caught using service dependencies and parent hosts. Do we currently use any? Ki

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Nigel Jones
On Sun, April 27, 2008 11:01 pm, Jeroen van Meeuwen wrote: > Nigel Jones wrote: >> Looking through my email, from what I can recall there are no false >> positives. xen6 had to be power-cycled which caused all the other >> collateral notifications. >> > > Collateral notifications can be caught usi

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Mike McGrath
On Sun, 27 Apr 2008, susmit shannigrahi wrote: > > > So if a service or host is unreachable for 3 or 4 mins, we get a > > > notification. (However most of the cases it is false positive, due to > > > congestion or others). > > Looking through my email, from what I can recall there are no fals

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Mike McGrath
On Sun, 27 Apr 2008, Jeroen van Meeuwen wrote: > Nigel Jones wrote: > > Looking through my email, from what I can recall there are no false > > positives. xen6 had to be power-cycled which caused all the other > > collateral notifications. > > > > Collateral notifications can be caught using serv

Re: How much downtime do we afford for nagios?

2008-04-27 Thread Mike McGrath
On Mon, 28 Apr 2008, Nigel Jones wrote: > On Sun, April 27, 2008 11:01 pm, Jeroen van Meeuwen wrote: > > Nigel Jones wrote: > >> Looking through my email, from what I can recall there are no false > >> positives. xen6 had to be power-cycled which caused all the other > >> collateral notifications