[Pacemaker] migration-threshold causing unnecessary restart of underlying resources

2010-08-11 Thread Cnut Jansen
Hi, I'm once again experiencing (imho) strange behaviour respectively decision-making by Pacemaker, and I hope that someone can either enlighten me a little about this, its intention and/or a possible misconfiguration or something, or confirm it a possible bug. Basically I have a cluster of

Re: [Pacemaker] lrmd WARN on high IO load

2010-08-11 Thread Dejan Muhamedagic
Hi, On Wed, Aug 11, 2010 at 05:17:03PM -0300, Diego Woitasen wrote: > Hi > > 2010/8/2 Dejan Muhamedagic : > > Hi, > > > > On Mon, Jul 19, 2010 at 07:09:11PM -0300, Diego Woitasen wrote: > >> 2010/7/16 Diego Woitasen : > >> > Hi, > >> >  I've installed Heartbeat+Pacemaker (3.0.3 and 1.0.9). I have

Re: [Pacemaker] Temporarely suspending monitoring

2010-08-11 Thread Dejan Muhamedagic
Hi, On Thu, Aug 12, 2010 at 12:14:59AM +0200, Bart Coninckx wrote: > On Wednesday 11 August 2010 23:55:42 Bart Coninckx wrote: > > On Wednesday 11 August 2010 23:01:22 Vince Gabriel wrote: > > > > -Original Message- > > > > From: Bart Coninckx [mailto:bart.conin...@telenet.be] > > > > Sent

Re: [Pacemaker] Temporarely suspending monitoring

2010-08-11 Thread Bart Coninckx
On Wednesday 11 August 2010 23:55:42 Bart Coninckx wrote: > On Wednesday 11 August 2010 23:01:22 Vince Gabriel wrote: > > > -Original Message- > > > From: Bart Coninckx [mailto:bart.conin...@telenet.be] > > > Sent: Wednesday, August 11, 2010 10:49 AM > > > To: pacemaker@oss.clusterlabs.org

Re: [Pacemaker] Temporarely suspending monitoring

2010-08-11 Thread Bart Coninckx
On Wednesday 11 August 2010 23:01:22 Vince Gabriel wrote: > > -Original Message- > > From: Bart Coninckx [mailto:bart.conin...@telenet.be] > > Sent: Wednesday, August 11, 2010 10:49 AM > > To: pacemaker@oss.clusterlabs.org > > Subject: [Pacemaker] Temporarely suspending monitoring > > > > H

Re: [Pacemaker] Need help using OCFS2 with openais/pacemaker

2010-08-11 Thread Dejan Muhamedagic
On Wed, Aug 11, 2010 at 11:11:45PM +0300, Vladislav Bogdanov wrote: > 11.08.2010 22:09, patrick.ouel...@promutuel.ca пишет: > > First of all, wow guys great software I love it so far. > > > > Second, I hope im posting this at the right place or i'll get flamed. > > > > I have followed the great

Re: [Pacemaker] Temporarely suspending monitoring

2010-08-11 Thread Vince Gabriel
> -Original Message- > From: Bart Coninckx [mailto:bart.conin...@telenet.be] > Sent: Wednesday, August 11, 2010 10:49 AM > To: pacemaker@oss.clusterlabs.org > Subject: [Pacemaker] Temporarely suspending monitoring > > Hi, > > We're using the Xen resource agents with an operation "moni

[Pacemaker] IPaddr2 not failing-over

2010-08-11 Thread Vince Gabriel
Hi everyone, I have new cluster that is works exceptionally well with the exception of the IPaddr2 virtual interfaces initiated failovers. If the interface is downed or cable disconnected, a failover never happens. I've attempted to incorporate pingd however that has not helped either? It's my un

Re: [Pacemaker] lrmd WARN on high IO load

2010-08-11 Thread Diego Woitasen
Hi 2010/8/2 Dejan Muhamedagic : > Hi, > > On Mon, Jul 19, 2010 at 07:09:11PM -0300, Diego Woitasen wrote: >> 2010/7/16 Diego Woitasen : >> > Hi, >> >  I've installed Heartbeat+Pacemaker (3.0.3 and 1.0.9). I have a >> > resource which executes an script to check the service: >> > >> > primitive kol

Re: [Pacemaker] Need help using OCFS2 with openais/pacemaker

2010-08-11 Thread Vladislav Bogdanov
11.08.2010 22:09, patrick.ouel...@promutuel.ca пишет: > First of all, wow guys great software I love it so far. > > Second, I hope im posting this at the right place or i'll get flamed. > > I have followed the great document by Andrew Cluster from scratch > but since im using more recent versio

[Pacemaker] Need help using OCFS2 with openais/pacemaker

2010-08-11 Thread Patrick.Ouellet
First of all, wow guys great software I love it so far. Second, I hope im posting this at the right place or i'll get flamed. I have followed the great document by Andrew Cluster from scratch but since im using more recent version of the software im stuck at adding OCFS2 support to the cluster

[Pacemaker] Temporarely suspending monitoring

2010-08-11 Thread Bart Coninckx
Hi, We're using the Xen resource agents with an operation "monitor" that repeats every 60 seconds. For backing up the Xen machines, we use "xm save" and "xm restore" which takes them offline for a short amount of time (and copies the memory contents to a file). Of course, when the monitor chec

Re: [Pacemaker] Antwort: Re: stonith sbd problem

2010-08-11 Thread Dejan Muhamedagic
Hi, On Wed, Aug 11, 2010 at 11:48:17AM +0200, philipp.achmuel...@arz.at wrote: > i removed the clone, set the global cluster property for stonith-timeout. > > the nodes need about 3-5 minutes to startup after they get "shot" > > i did some more tests and found out that if the node, which runs re

[Pacemaker] Antwort: Re: stonith sbd problem

2010-08-11 Thread philipp . achmueller
i removed the clone, set the global cluster property for stonith-timeout. the nodes need about 3-5 minutes to startup after they get "shot" i did some more tests and found out that if the node, which runs resource sbd_fence, get "shot" the remaining node see the stonith resource online on both

[Pacemaker] Antwort: Re: stonith sbd problem

2010-08-11 Thread philipp . achmueller
>> any ideas on the "unrunnable" problem? >That's expected: one can't run operations on a node which is offline. i would expect a failover of the resources to node lnx0047b. since lnx0047a is stonith'ed, the resources should start on remaining node. >> any ideas on the stonith problem? > We'd ne

Re: [Pacemaker] stonith sbd problem

2010-08-11 Thread Lars Marowsky-Bree
On 2010-08-10T10:16:05, philipp.achmuel...@arz.at wrote: > primitive sbd_fence stonith:external/sbd \ > params sbd_device="/dev/hdisk-4652-38b5" stonith-timeout="60s" > clone fence sbd_fence \ > meta target-role="Started" Like Dejan said, you shouldn't run it as a clone, but this