Re: [Pacemaker] SBD fencing - stonith plugin not installed

2013-02-06 Thread Lars Marowsky-Bree
On 2013-02-06T11:21:57, "Alexandr A. Alexandrov" wrote: > I built everything from current sources (well, pacemaker is not exaclty > current, see compilation problem thread), and now plugin works. Excellent. Note that there's also an updated version of sbd on http://hg.linux-ha.org/sbd/ - the bi

[Pacemaker] Node fails to rejoin cluster

2013-02-06 Thread Tal Yalon
Hi all, I have a 2-node cluster, where node-2 got fenced and now after reboot tries to rejoin the cluster but fails and gets stuck in a loop for hours and never joins back. After another reboot it managed to join, and there was no time difference between the nodes. Below is corosync/pacemaker lo

Re: [Pacemaker] Serious issue with booth for site failover

2013-02-06 Thread Lars Marowsky-Bree
On 2013-01-19T12:19:46, Yves Trudeau wrote: > Hi, > Forget this, everything is fine. An iptables rule was missing in my > failure test. Hi Yves, which iptables rule was missing, if I may ask? Regards, Lars -- Architect Storage/HA SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild

Re: [Pacemaker] Pacemaker resource migration behaviour

2013-02-06 Thread James Guthrie
A quick addendum to this message: The log files I provided actually continue until the resources do get started on the host. The trigger for that is the 6-minute failure-timeout timer that pops. As can be seen in pe-input-50, the resources conntrackd, condition, sub-ospfd and sub-ripd are in sl

Re: [Pacemaker] Serious issue with booth for site failover

2013-02-06 Thread Yves Trudeau
Hi Lars, the site suposedly down was still able to send packet to the arbitrator. That caused the ballot number to be increased on the arbitrator so the valid ballot from the surviving site was discarded in acceptor_promise. Although an operational mistake on my side, I wonder if this can

[Pacemaker] Reboot of cluster members with heavy load on filesystem.

2013-02-06 Thread Carlos Xavier
Hi. We are running two clusters compounded of two machines. We are using DRBD + OCFS2 to make the common filesystem. The the version of the components in use are: OpenSuse 12.1 pacemaker-1.1.6-2.1.2.x86_64 corosync-1.4.1-2.1.3.x86_64 openais-1.1.4-9.7.3.x86_64 ocfs2-tools-1.8.0-4.1.2.x86_64 kerne

Re: [Pacemaker] Pacemaker resource migration behaviour

2013-02-06 Thread David Vossel
- Original Message - > From: "James Guthrie" > To: "The Pacemaker cluster resource manager" > Sent: Wednesday, February 6, 2013 6:52:07 AM > Subject: Re: [Pacemaker] Pacemaker resource migration behaviour > > A quick addendum to this message: > > The log files I provided actually cont

Re: [Pacemaker] Serious issue with booth for site failover

2013-02-06 Thread Lars Marowsky-Bree
On 2013-02-06T11:24:08, Yves Trudeau wrote: > Hi Lars, >the site suposedly down was still able to send packet to the arbitrator. > That caused the ballot number to be increased on the arbitrator so the valid > ballot from the surviving site was discarded in acceptor_promise. Although > an op

Re: [Pacemaker] Reboot of cluster members with heavy load on filesystem.

2013-02-06 Thread Andrew Beekhof
On Thu, Feb 7, 2013 at 5:58 AM, Carlos Xavier wrote: > Hi. > > We are running two clusters compounded of two machines. We are using DRBD + > OCFS2 to make the common filesystem. > The the version of the components in use are: > OpenSuse 12.1 > pacemaker-1.1.6-2.1.2.x86_64 > corosync-1.4.1-2.1.3.x

[Pacemaker] help install pacemaker-mgmt

2013-02-06 Thread E-Blokos
as F18 pacemaker-mgmt package doesn't exist I tried to compile from the source but I'm not sure to understand how to compile it ./ConfigureMe --prefix=/usr --sysconfdir=/etc --localstatedir=/var --mandir=/usr/share/man --disable-rpath configure Configure flags for RedHat Linux: --prefix=/usr --s

Re: [Pacemaker] help install pacemaker-mgmt

2013-02-06 Thread Gao,Yan
On 02/07/13 10:44, E-Blokos wrote: > as F18 pacemaker-mgmt package doesn't exist > I tried to compile from the source but I'm not sure > to understand how to compile it > > ./ConfigureMe --prefix=/usr --sysconfdir=/etc --localstatedir=/var > --mandir=/usr/share/man --disable-rpath configure The r

Re: [Pacemaker] help install pacemaker-mgmt

2013-02-06 Thread E-Blokos
as F18 pacemaker-mgmt package doesn't exist I tried to compile from the source but I'm not sure to understand how to compile it ./ConfigureMe --prefix=/usr --sysconfdir=/etc --localstatedir=/var --mandir=/usr/share/man --disable-rpath configure The right way is: ./ConfigureMe configure [options]

Re: [Pacemaker] Node fails to rejoin cluster

2013-02-06 Thread Andrew Beekhof
On Wed, Feb 6, 2013 at 9:11 PM, Tal Yalon wrote: > Hi all, > > I have a 2-node cluster, where node-2 got fenced and now after reboot tries > to rejoin the cluster but fails and gets stuck in a loop for hours and never > joins back. > > After another reboot it managed to join, and there was no time

Re: [Pacemaker] Fencing configuration with pcmk_host_map argument

2013-02-06 Thread Andrew Beekhof
On Wed, Feb 6, 2013 at 2:57 AM, Marek Grac wrote: > Hi, > > > On 02/05/2013 03:24 AM, Andrew Beekhof wrote: >> >> I cleared the IPMI configuration and kept only the two WTI fencing >> Primitives in my configuration to make it as simple as possible : >> >> primitive wti_fence01 stonith:fence_wti \

Re: [Pacemaker] Fencing configuration with pcmk_host_map argument

2013-02-06 Thread Andrew Beekhof
On Wed, Feb 6, 2013 at 4:42 AM, Thibaut Pouzet wrote: > Le 05/02/2013 16:57, Marek Grac a écrit : >> >> Hi, >> >> On 02/05/2013 03:24 AM, Andrew Beekhof wrote: >>> >>> I cleared the IPMI configuration and kept only the two WTI fencing >>> Primitives in my configuration to make it as simple as poss

Re: [Pacemaker] How are Scores computed regarding roles?

2013-02-06 Thread Andrew Beekhof
On Wed, Feb 6, 2013 at 10:09 AM, Jake Smith wrote: > > - Original Message - >> From: "Marcos Tiago" >> To: pacemaker@oss.clusterlabs.org >> Sent: Tuesday, February 5, 2013 3:11:13 PM >> Subject: [Pacemaker] How are Scores computed regarding roles? >> >> >> >> >> >> Guys, >> >> >> >> I am

Re: [Pacemaker] Current Pacemaker from Git does not compile

2013-02-06 Thread Vladislav Bogdanov
06.02.2013 00:47, Andrew Beekhof wrote: [...] > I thought it was supposed to be legal to do this, its not like the > definitions are different :-/ > Grumble. Following fixes this issue for me on EL6. diff --git a/include/crm/common/ipcs.h b/include/crm/common/ipcs.h index 5202bbc..b7991ae 100644

Re: [Pacemaker] help install pacemaker-mgmt

2013-02-06 Thread Gao,Yan
On 02/07/13 12:58, E-Blokos wrote: >>> as F18 pacemaker-mgmt package doesn't exist >>> I tried to compile from the source but I'm not sure >>> to understand how to compile it >>> >>> ./ConfigureMe --prefix=/usr --sysconfdir=/etc --localstatedir=/var >>> --mandir=/usr/share/man --disable-rpath confi

Re: [Pacemaker] Reboot of cluster members with heavy load on filesystem.

2013-02-06 Thread Alexandr A. Alexandrov
Carlos, Increasing corosync timeouts and 'monitor' action timeouts in pacemaker might help, but do you have separate leased network connection for corosync? It is better to connect your servers directly with cross cable (to be independent of switches/networ