Re: [Pacemaker] clear failcount when monitor is successful?

Johan Huysmans Wed, 24 Apr 2013 00:18:26 -0700

In our setup it is possible that a resource can automatically recoverand that no human intervention is needed.

Therefore our cluster should be able to recover automatically.


gr.
Johan

On 24-04-13 08:47, Michael Schwartzkopff wrote:


Am Mittwoch, 24. April 2013, 08:35:29 schrieb Johan Huysmans:

> I tried the failure-timeout.

> But I noticed that when the failure-timeout resets the failcount the

> resource becomes OK in the crm_mon view.

> However the resource is still failing.

>

> This shouldn't happen, Can this behaviour be changed with some setting?

>

> gr.

> Johan

>

> On 24-04-13 07:23, Andrew Beekhof wrote:

> > On 23/04/2013, at 11:24 PM, Johan Huysmans<johan.huysm...@inuits.be> wrote:


> >> Hi All,

> >>

> >> I have a cloned resource, running on my both nodes, my on-fail isset to

> >> block. So if the resource fails on a node the failcountincreases, but

> >> whenever the resource automatically recovers the failcount isn'treset.


> >>

> >> Is there a way to reset the failcount to 0, when the monitor is

> >> successful?

> >

> > No, but you can expire them after a period of time.

> > _______________________________________________

> > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org

> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker

> >

> > Project Home: http://www.clusterlabs.org

> > Getting started:http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf


> > Bugs: http://bugs.clusterlabs.org

>

> _______________________________________________

> Pacemaker mailing list: Pacemaker@oss.clusterlabs.org

> http://oss.clusterlabs.org/mailman/listinfo/pacemaker

>

> Project Home: http://www.clusterlabs.org

> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf

> Bugs: http://bugs.clusterlabs.org

When a resource fails and the failcounter increases it is a reason foradminstrator intervention to look what went wrong. On that occasionthe admin also can clean the failcounter.

During the normal operation of a cluster a failure should not happen.A non-zero failcounter is always a sign of a problem.


Greetings,

--

Dr. Michael Schwartzkopff

Guardinistr. 63

81375 München

Tel: (0163) 172 50 98



_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Re: [Pacemaker] clear failcount when monitor is successful?

Reply via email to