Hi Ton!

On Mi, 2009-11-04 at 14:49 +0000, Ton Voon wrote:
> Hi Henry,
> 
> On 4 Nov 2009, at 13:08, Henry wrote:
> 
> > Ergo: It looks like a trivial change to Opsview:
> >  -- manage two more config lines per service:
> > - 'check_freshness'
> > - 'freshness_threshold'
> >  -- add a checkbox and a input field to the GUI
> > So why not add it? (This is a feature request)
> > 
> 
> 
> This makes sense. Would it be for active checks as well as passive
> checks?

Isn't applicable on active checks as tries to keep the check results
recent (fresh) by active checking/retry the check.

Imho a freshness checking is indispensable for passive Checks.

> I'm thinking that the freshness_threshold should always be set to a
> number to signify the duration of the freshness interval, rather than
> letting Nagios calculate it. Your particular use case is about a time
> triggered job so you should know the frequency of this. We can have a
> checkbox for freshness, and onclick, take the check interval if set
> (error if no value set).

Exactly! (Nagios isn't able to calculate the threshold as it hasn't any
knowledge about how often then remote, passive check result get sent.)


> Opsview also sets an additional amount to the freshness threshold.
> This is 30 minutes by default. The reason is that when we set
> freshness values on the Opsview master, we don't want to get lots of
> alerts if a slave is done until this additional amount has passed.
> Will this be a problem?

Depends:
To illustrate it with my simplest use cases:

- A Backup job, that runs once a day, has a threshold of 26 hours (to
avoid race conditions). An additional 30 Minutes won't carry weight.

- A batch job that has to run every 30 Minutes (60 minutes SLA). So we
send a passive result to Opsview, just before the job exits
(successfully or not). ATM the threshold is 45 minutes, so we got enough
time to troubleshoot/restart the job, if it doesn't send an result. Here
additional 30 minutes bother.

Hmm. We've to agonize over this.

> I guess you also want to define what happens when this freshness check
> interval is reached. IE, raise a warning or critical error and
> possibly some specific text too. At least two more fields then :)

You've got me here :)


> > 
> > And maybe I can get it sponsored. But I need to know how to sponsor.
> > So
> > please let me know hot to join the party.
> > 
> 
> 
> I'm happy to discuss how this could be solved on the mailing list
> (especially as others can see whether it would work for them).
> 
> 
> When you want to proceed with sponsoring, you can send to me directly
> or visit http://opsview.org/development
> 
> 
> Ton

-- 
regards, Henry


_______________________________________________
Opsview-users mailing list
[email protected]
http://lists.opsview.org/lists/listinfo/opsview-users

Reply via email to