Neither the cluster manager nor the RA can know that the error is temporary.
You can only know that with the benefit of hindsight.
So what you're asking for is that the cluster ignores the first N errors...
which doesn't sound very "HA".
The better approach is write the RA in such a way that it do
Donnerstag, 19. April 2012 14:36
To: pacemaker@oss.clusterlabs.org
Subject: Re: [Pacemaker] OCF Resource agent monitor activity failed due to
temporary error
On 04/19/2012 01:59 PM, Kulovits Christian - OS ITSC wrote:
> Hi Andreas,
> Exactly this is what i want pacemaker to do when my RA i
ent: Donnerstag, 19. April 2012 13:51
> To: pacemaker@oss.clusterlabs.org
> Subject: Re: [Pacemaker] OCF Resource agent monitor activity failed due to
> temporary error
>
> Hi Christian,
>
> On 04/19/2012 01:38 PM, Kulovits Christian - OS ITSC wrote:
>> Hi, Andreas
>>
ed in every RA instead of
once in pacemaker.
Christian
-Original Message-
From: Andreas Kurz [mailto:andr...@hastexo.com]
Sent: Donnerstag, 19. April 2012 13:51
To: pacemaker@oss.clusterlabs.org
Subject: Re: [Pacemaker] OCF Resource agent monitor activity failed due to
temporary erro
/www.hastexo.com/services/remote
>
> -Original Message-
> From: Andreas Kurz [mailto:andr...@hastexo.com]
> Sent: Donnerstag, 19. April 2012 11:44
> To: pacemaker@oss.clusterlabs.org
> Subject: Re: [Pacemaker] OCF Resource agent monitor activity failed due to
> temporary error
:44
To: pacemaker@oss.clusterlabs.org
Subject: Re: [Pacemaker] OCF Resource agent monitor activity failed due to
temporary error
On 04/19/2012 11:35 AM, emmanuel segura wrote:
> on-fail attribute
well, if you ignore a monitor failure you actually can disable
monitoring completely.
The correct
On 04/19/2012 11:35 AM, emmanuel segura wrote:
> on-fail attribute
well, if you ignore a monitor failure you actually can disable
monitoring completely.
The correct way to deal with that problem is to fix the RA ... patches
are always welcome ;-)
Regards,
Andreas
--
Need help with Pacemaker?
h
on-fail attribute
Il giorno 19 aprile 2012 11:29, Kulovits Christian - OS ITSC <
christian.kulov...@austrian.com> ha scritto:
> Hi,
>
> During a monitor activity for a SRDF Resource a temporary error occurred
> and the resource agent cannot determine the state of the resource and
> returned
Hi,
During a monitor activity for a SRDF Resource a temporary error occurred and
the resource agent cannot determine the state of the resource and returned
OCF_ERR_GENERIC. The cluster restarted the resource and all depending resources
as designed. Is there a way to say that this failed monitor