Re: [Pacemaker] Patch for bugzilla 2541: Shell should warn if parameter uniqueness is violated

2011-03-25 Thread Vladislav Bogdanov
Oops, this is actually a bug in fence_ipmilan which reports all params as unique. 26.03.2011 08:28, Vladislav Bogdanov wrote: > Hi, > > it seems like it was commit d0472a26eda1 which now causes following: > > WARNING: Resources > stonith-v02-a,stonith-v02-b,stonith-v02-c,stonith-v02-d violate >

Re: [Pacemaker] Patch for bugzilla 2541: Shell should warn if parameter uniqueness is violated

2011-03-25 Thread Vladislav Bogdanov
Hi, it seems like it was commit d0472a26eda1 which now causes following: WARNING: Resources stonith-v02-a,stonith-v02-b,stonith-v02-c,stonith-v02-d violate uniqueness for parameter "action": "reboot" WARNING: Resources stonith-v02-a,stonith-v02-b,stonith-v02-c,stonith-v02-d violate uniqueness for

Re: [Pacemaker] WARN: msg_to_op(1324): failed to get the value of field lrm_opstatus from a ha_msg

2011-03-25 Thread Bob Schatz
A few more thoughts that occurred after I hit 1. This problem sees to only occur when "/etc/init.d/heartbeat start" is executed on two nodes at the same time. If I only do one at a time it does not seem to occur. (this may be related to the creation of master/slave resources in /etc/ha.d/re

Re: [Pacemaker] DRBD and pacemaker interaction

2011-03-25 Thread Lars Ellenberg
On Fri, Mar 25, 2011 at 06:39:10PM +0100, Christoph Bartoschek wrote: > Hi, > > I´ve already sent this mail to linux-ha but that list seems to be dead: What makes you think so? That you did not get a reply within 40 minutes? You make me feel sorry about having replied there. Maybe you should co

Re: [Pacemaker] IPaddr2 Netmask Bug Fix Issue

2011-03-25 Thread Pavel Levshin
25.03.2011 18:47, darren.mans...@opengi.co.uk: We configure a virtual IP on the non-arping lo interface of both servers and then configure the IPaddr2 resource with lvs_support=true. This RA will remove the duplicate IP from the lo interface when it becomes active. Grouping the VIP with ldire

Re: [Pacemaker] Is there any way to reduce the time for migration of the resource from one node to another node in a cluster on failover.

2011-03-25 Thread Rakesh K
Andrew Beekhof writes: Hi Andrew Beekhof I measured the time, when heart beat recognize the process had failed on first node to the process and VIP has created on the second node . i calculated the time using the heartbeat log files which logs the messages with time. _

[Pacemaker] DRBD and pacemaker interaction

2011-03-25 Thread Christoph Bartoschek
Hi, I´ve already sent this mail to linux-ha but that list seems to be dead: we experiment with DRBD and pacemaker and see several times that the DRBD part is degraded (One node is outdated or diskless or something similar) but crm_mon just reports that the DRBD resource runs as master and slav

[Pacemaker] IPaddr2 Netmask Bug Fix Issue

2011-03-25 Thread Darren.Mansell
Hello all. Between SLE 11 HAE and SLE 11 SP1 HAE (pacemaker 1.0.3 - pacemaker 1.1.2) the following bit has changed in the IPaddr2 RA: Old: local iface=`$IP2UTIL -o -f inet addr show | grep "\ $BASEIP/" \ | cut -d ' ' -f2 | grep -v '^ipsec[0-9][0-9]*$'` New: local ifac

Re: [Pacemaker] [RFC PATCH] Try to fix startup-fencing not happening

2011-03-25 Thread Simone Gotti
On 03/25/2011 11:10 AM, Andrew Beekhof wrote: > On Thu, Mar 17, 2011 at 11:54 PM, Simone Gotti wrote: >> Hi, >> >> When using corosync + pcmk v1 starting both corosync and pacemakerd (and >> I think also using heartbeat or anything other than cman) as quorum >> provider, at startup in the CIB will

Re: [Pacemaker] Is there any way to reduce the time for migration of the resource from one node to another node in a cluster on failover.

2011-03-25 Thread Andrew Beekhof
On Tue, Mar 22, 2011 at 12:41 PM, rakesh k wrote: > Hi All > > I am providing you the configuration I used for testing the resource > migration. > > Node-1 resource failed . > Message sent to node-2 > the log message i found in ha-debug file (pengine: [15991]: notice: > common_apply_stickiness: To

Re: [Pacemaker] [RFC PATCH] Try to fix startup-fencing not happening

2011-03-25 Thread Andrew Beekhof
On Thu, Mar 17, 2011 at 11:54 PM, Simone Gotti wrote: > Hi, > > When using corosync + pcmk v1 starting both corosync and pacemakerd (and > I think also using heartbeat or anything other than cman) as quorum > provider, at startup in the CIB will not be a entry for > the nodes that are not in clus

Re: [Pacemaker] Fencing order

2011-03-25 Thread Andrew Beekhof
On Mon, Mar 21, 2011 at 4:06 PM, Pavel Levshin wrote: > Hi. > > Today, we had a network outage. Quite a few problems suddenly arised in out > setup, including crashed corosync, known notify bug in DRBD RA and some > problem with VirtualDomain RA timeout on stop. > > But particularly strange was fe

Re: [Pacemaker] Pacemaker with Apache2...

2011-03-25 Thread Andrew Beekhof
Guessing the status URL isnt enabled in the apache config. On Wed, Mar 23, 2011 at 8:53 PM, Pavel Levshin wrote: > 23.03.2011 17:10, Yannik Nicod: > > Failed actions: >     WebSite_start_0 (node=clutest02, call=4, rc=1, status=complete): unknown > error >     WebSite_monitor_0 (node=clutest01, ca

Re: [Pacemaker] How to send email-notification on failure of resource in cluster frame work

2011-03-25 Thread Andrew Beekhof
"man crm_mon" look for the word "mail", if its not there - then whoever built the packages didnt include support for that feature On Thu, Mar 24, 2011 at 5:46 AM, Rakesh K wrote: > Hi ALL > Is there any way to send Email notifications when a resource is failure in the > cluster frame work. > >

Re: [Pacemaker] CMAN integration questions

2011-03-25 Thread Andrew Beekhof
On Thu, Mar 24, 2011 at 9:27 AM, Vladislav Bogdanov wrote: > 23.03.2011 21:38, Pavel Levshin wrote: >> 23.03.2011 15:56, Vladislav Bogdanov: >> >> >>> After 1 minute vd01-d takes over DC role. >>> >>> Mar 23 10:10:03 vd01-d crmd: [1875]: info: update_dc: Set DC to vd01-d >>> (3.0.5) >