Re: [Pacemaker] error: send_cpg_message: Sending message via cpg FAILED: (rc=6) Try again

2013-12-09 Thread Jan Friesse
Brian J. Murrell (brian) napsal(a): > I seem to have another instance where pacemaker fails to exit at the end > of a shutdown. Here's the log from the start of the "service pacemaker > stop": > > Dec 3 13:00:39 wtm-60vm8 crmd[14076]: notice: do_state_transition: State > transition S_POLICY_E

Re: [Pacemaker] pcs ping connectivity rule

2013-12-09 Thread Bauer, Stefan (IZLBW Extern)
May i ask, how your configuration snippet look like? Thank you Stefan -Ursprüngliche Nachricht- Von: Martin Ševčík [mailto:sev...@esys.cz] Gesendet: Freitag, 6. Dezember 2013 12:26 An: pacemaker@oss.clusterlabs.org Betreff: Re: [Pacemaker] pcs ping connectivity rule I installed crmsh

Re: [Pacemaker] configuration of stonith

2013-12-09 Thread Dvorak Andreas
Dear all, thank you for the answers. Now I created to stonith resources pcs stonith create ipmi-fencing-sv2837 fence_ipmilan pcmk_host_list="sv2837" ipaddr=10.110.28.37 action="reboot" login=abc passwd=abc123 delay=15 op monitor interval=60s pcs stonith create ipmi-fencing-sv2836 fence_ipmilan

Re: [Pacemaker] configuration of stonith

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 07:42:59 schrieb Masopust, Christian: > > >> If you're using 1.1.10+, > > >> > > >> pcs stonith create fence_pcmk1_ipmi fence_ipmilan \ > > >> > > >> pcmk_host_list="pcmk-1" ipaddr="pcmk-1.ipmi" \ > > >> action="reboot" login="admin" passwd="secret" delay=15 \

[Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Bauer, Stefan (IZLBW Extern)
Hi List, even though following well known documentations about a ping clone resource my resources are not moving to the node with the better connectivity: 2 Nodes configured, 2 expected votes 6 Resources configured. Online: [ debian6-n2 debian6-n1 ] Resource Group: cluster1 p

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread emmanuel segura
where is your config? 2013/12/9 Bauer, Stefan (IZLBW Extern) > Hi List, > > > > even though following well known documentations about a ping clone > resource my resources are not moving to the node with the better > connectivity: > > > > 2 Nodes configured, 2 expected votes > > 6 Resources con

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Bauer, Stefan (IZLBW Extern)
Pardon! node debian6-n1 node debian6-n2 primitive p_alias0 ocf:heartbeat:IPaddr2 \ params ip="4.5.6.7" cidr_netmask="24" nic="eth0" \ op start interval="0" timeout="20" \ op stop interval="0" timeout="30" \ op monitor interval="20" primitive p_conntrackd lsb:conntra

Re: [Pacemaker] Pacemaker very often STONITHs other node

2013-12-09 Thread Nikita Staroverov
Hello, Still did not receive any hints from you. And you are definitely my only hope before I switch to Proxmox or (even worse) some commercial stuff. At least can you tell mi if mode 4 could cause trouble with Corosync? Thanks! According to your logs, posted before, the reason was: Nov

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 10:28:32 schrieb Bauer, Stefan: > Pardon! > > node debian6-n1 > node debian6-n2 > primitive p_alias0 ocf:heartbeat:IPaddr2 \ > params ip="4.5.6.7" cidr_netmask="24" nic="eth0" \ > op start interval="0" timeout="20" \ > op stop interval="0" timeo

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread emmanuel segura
your location ins't enaugh, sorry for my english :) location mynet mygrp_or_rsc \ rule $id="-rule" -inf: not_defined pingd or pingd number:lte 300 2013/12/9 Bauer, Stefan (IZLBW Extern) > Pardon! > > > > node debian6-n1 > > node debian6-n2 > > primitive p_alias0 ocf:heartbeat:IPaddr2

Re: [Pacemaker] Pacemaker very often STONITHs other node

2013-12-09 Thread Michał Margula
W dniu 09.12.2013 11:34, Nikita Staroverov pisze: So, what happens? :) Rivendell-B tried to stop XEN-acsystemy01, but couldn't do that due to time out of operation. Failure on stop operation is fatal by default and leading to stonith. Rivendell-A caught this and fence rivendell-B. You also have

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Bauer, Stefan (IZLBW Extern)
Hi Michael, so that means, either increasing one value or lower the other right? Or is lowering resource_stickiness the only reasonable way? I tried a multiplier of 1000 but no change in the behavior. Stefan -Ursprüngliche Nachricht- Von: Michael Schwartzkopff [mailto:m...@sys4.de] You

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Lars Marowsky-Bree
On 2013-12-09T10:28:32, "Bauer, Stefan (IZLBW Extern)" wrote: > location groupwithping cluster1 \ > rule $id="groupwithping-rule" pingd: defined pingd I tend to prefer a -inf score for nodes where pingd is *not* defined or zero. (Downside is that when you lose all connectivity, all ser

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 11:27:51 schrieb Bauer, Stefan: > Hi Michael, > > so that means, either increasing one value or lower the other right? > Or is lowering resource_stickiness the only reasonable way? > > I tried a multiplier of 1000 but no change in the behavior. > > Stefan > > -U

Re: [Pacemaker] monitor on-fail=ignore not restarting when resource reported as stopped

2013-12-09 Thread Lars Marowsky-Bree
On 2013-12-06T16:06:09, Patrick Hemmer wrote: Hi Patrick, > > For a resource that pacemaker expects to be started, it's an error if it > > is found to be stopped. Pacemaker can't tell if it is really cleanly > > stopped, or died, or ... > Oh, and I'll quote the OCF spec on this one: > > 1 g

[Pacemaker] Reg. trigger when node failure occurs

2013-12-09 Thread ESWAR RAO
Hi All, I have a 3 node ( node1, node2, node3 ) setup on which HB+pacemaker runs. I have resources running on clone mode on node1 and node2. Is there anyway to get a trigger when a node failure occurs i.e., can i trigger any script if the node3 fails (on which no resource runs) ??? Thanks Eswar

Re: [Pacemaker] Pacemaker very often STONITHs other node

2013-12-09 Thread Nikita Staroverov
Hello, Thank you for your answer. I have two drbd - /dev/drbd1 and /dev/drbd2. And I use them as PVs for LVM which has one Volume Group hosting all the VMs. So should I have as many DRBDs as VMs and get rid off LVM at all? PS. If it is not a secret what are you recommended timeouts? Thank

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Bauer, Stefan (IZLBW Extern)
Why are some resources listed more than once in the output? What is the difference between group_color and native_color? If a resource has a value of -INFINITY is it because the cluster already decided that this resource should not run on this host or it can not run on this host due to other reas

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 13:06:04 schrieb Bauer, Stefan: > Why are some resources listed more than once in the output? > What is the difference between group_color and native_color? > If a resource has a value of -INFINITY is it because the cluster already > decided that this resource should no

[Pacemaker] WG: configuration of stonith

2013-12-09 Thread Dvorak Andreas
Dear all My problem with stonith is solved. Here is what I did: pcs stonith create ipmi-fencing-sv2837 fence_ipmilan pcmk_host_list="sv2837" ipaddr=10.110.28.37 action="off" login=ipmi passwd=abc delay=15 op monitor interval=60s pcs stonith create ipmi-fencing-sv2836 fence_ipmilan pcmk_host_li

[Pacemaker] cluster heartbeat is not used

2013-12-09 Thread Dvorak Andreas
Dear all, during failover tests I found out that I can put down the heartbeat interfaces and the cluster ignores that. But if I put down bond0 the fencing is running. Can please somebody help me? bond0 Link encap:Ethernet HWaddr C8:1F:66:D7:3B:FE inet addr:10.15.28.36 Bcast:10.1

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 14:40:29 schrieb Dvorak Andreas: > Dear all > > My problem with stonith is solved. > > Here is what I did: > > pcs stonith create ipmi-fencing-sv2837 fence_ipmilan pcmk_host_list="sv2837" > ipaddr=10.110.28.37 action="off" login=ipmi passwd=abc delay=15 op monitor >

[Pacemaker] ocf resource agents - pre and post scripts

2013-12-09 Thread Vladimir
Hello everyone, Is there a built-in mechanizm in pacemaker to trigger a pre or post script or do the ocf resource agents bring something like that? I could also create a kind of dummy resource after a primitive resource. But I asked my self if there is another/better way. Thanks. _

Re: [Pacemaker] ocf resource agents - pre and post scripts

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 15:24:06 schrieb Vladimir: > Hello everyone, > > Is there a built-in mechanizm in pacemaker to trigger a pre or post > script or do the ocf resource agents bring something like that? > > I could also create a kind of dummy resource after a primitive > resource. But I

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread Masopust, Christian
Hi Andreas, as far as I can say (as a pacemaker novice), everything fine besides the equal delays for both stoninths... I would suggest to configure only one with a delay of 15s. br, christian > -Ursprüngliche Nachricht- > Von: Dvorak Andreas [mailto:andreas.dvo...@baaderbank.de] > Ge

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread emmanuel segura
show cat /proc/net/bonding/bond0 2013/12/9 Dvorak Andreas > Dear all, > > > > during failover tests I found out that I can put down the heartbeat > interfaces and the cluster ignores that. But if I put down bond0 the > fencing is running. > > Can please somebody help me? > > > > bond0 Lin

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread Dvorak Andreas
Hi Here it is cat /proc/net/bonding/bond0 Ethernet Channel Bonding Driver: v3.6.0 (September 26, 2009) Bonding Mode: fault-tolerance (active-backup) Primary Slave: None Currently Active Slave: em3 MII Status: up MII Polling Interval (ms): 100 Up Delay (ms): 0 Down Delay (ms): 0 Slave Interface:

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread Masopust, Christian
> > pcs stonith create ipmi-fencing-sv2837 fence_ipmilan > pcmk_host_list="sv2837" > > ipaddr=10.110.28.37 action="off" login=ipmi passwd=abc > delay=15 op monitor > > interval=60s pcs stonith create ipmi-fencing-sv2836 fence_ipmilan > > pcmk_host_list="sv2836" ipaddr=10.110.28.36 action="off" lo

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread emmanuel segura
why you are editing corosync.conf, but as cluster stack, you are using cman? 2013/12/9 emmanuel segura > because in your corosync-cfgtool -s you are using bonding address > > > > > 2013/12/9 Dvorak Andreas > >> Hi >> >> >> >> Here it is >> >> cat /proc/net/bonding/bond0 >> >> Ethernet Channel

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 14:58:13 schrieben Sie: > > > pcs stonith create ipmi-fencing-sv2837 fence_ipmilan > > > > pcmk_host_list="sv2837" > > > > > ipaddr=10.110.28.37 action="off" login=ipmi passwd=abc > > > > delay=15 op monitor > > > > > interval=60s pcs stonith create ipmi-fencing-sv2

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread emmanuel segura
because in your corosync-cfgtool -s you are using bonding address 2013/12/9 Dvorak Andreas > Hi > > > > Here it is > > cat /proc/net/bonding/bond0 > > Ethernet Channel Bonding Driver: v3.6.0 (September 26, 2009) > > > > Bonding Mode: fault-tolerance (active-backup) > > Primary Slave: None > >

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 15:53:59 schrieb Dvorak Andreas: > Hi > > Here it is > cat /proc/net/bonding/bond0 > Ethernet Channel Bonding Driver: v3.6.0 (September 26, 2009) > > Bonding Mode: fault-tolerance (active-backup) > Primary Slave: None > Currently Active Slave: em3 > MII Status: up > M

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread emmanuel segura
I think they sould be pcs constraint location ipmi-fencing-sv2837 prefers sv2837=-INFINITY pcs constraint location ipmi-fencing-sv2836 prefers sv2836=-INFINITY 2013/12/9 Michael Schwartzkopff > Am Montag, 9. Dezember 2013, 14:58:13 schrieben Sie: > > > > pcs stonith create ipmi-fencing-sv2837

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread Masopust, Christian
> > > > > > This is not optimal. Nothing prevents the resource, that can > > > fence node sv2837 > > > to run on host sv2837. You just say, that it should run on > > > node sv2836. > > > > > > Better would be something like > > > > > > crm configure location place-fencing-sv2837 -inf: sv2837 > >

Re: [Pacemaker] WG: configuration of stonith

2013-12-09 Thread Masopust, Christian
Hi Emmanuel, thanks for the hint, reading (again) the chapter about resource stickiness, I see and understand the difference :) br, christian Von: emmanuel segura [mailto:emi2f...@gmail.com] Gesendet: Montag, 09. Dezember 2013 16:12 An: m...@sys4.de; The Pacemak

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread Dvorak Andreas
Hi, thank you for the quick answers. I thought I would need to edit the corosync.conf file. So I do not need to? Where should I configure the heartbeat interconnect interfaces? With corosync-cfgtool -s it shows the wrong ip? But where does that come from? Andreas Von: emmanuel segura [mailto:em

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread emmanuel segura
the ip is ok, but if you are using cman, you need to edit /etc/cluster/cluster.conf, that's what i think 2013/12/9 Dvorak Andreas > Hi, > > > > thank you for the quick answers. > > I thought I would need to edit the corosync.conf file. So I do not need to? > > Where should I configure the heart

Re: [Pacemaker] cluster heartbeat is not used

2013-12-09 Thread Michael Schwartzkopff
Am Montag, 9. Dezember 2013, 16:27:25 schrieb Dvorak Andreas: > Hi, > > thank you for the quick answers. > I thought I would need to edit the corosync.conf file. So I do not need to? > Where should I configure the heartbeat interconnect interfaces? > > With corosync-cfgtool -s it shows the wrong

Re: [Pacemaker] Ressources not moving to node with better connectivity - pingd

2013-12-09 Thread Bauer, Stefan (IZLBW Extern)
Thank you for your time! It works now :) Stefan -Ursprüngliche Nachricht- Von: Michael Schwartzkopff [mailto:m...@sys4.de] Gesendet: Montag, 9. Dezember 2013 14:26 An: The Pacemaker cluster resource manager Betreff: Re: [Pacemaker] Ressources not moving to node with better connectivity

Re: [Pacemaker] error: send_cpg_message: Sending message via cpg FAILED: (rc=6) Try again

2013-12-09 Thread Brian J. Murrell
On Mon, 2013-12-09 at 09:28 +0100, Jan Friesse wrote: > > Error 6 error means "try again". This is happening ether if corosync is > overloaded or creating new membership. Please take a look to > /var/log/cluster/corosync.log if you see something strange there (+ make > sure you have newest corosyn

Re: [Pacemaker] pcs ping connectivity rule

2013-12-09 Thread Martin Ševčík
relevant parts of the config: primitive pingd ocf:pacemaker:ping \ params host_list="10.242.50.251 10.242.50.252" multiplier="1" location l_best_connectivity g_ris \ rule $id="l_best_connectivity-rule" pingd: defined pingd best regards, m. On 12/09/2013 09:43 AM, Bauer, Stefan

[Pacemaker] is ccs as racy as it feels?

2013-12-09 Thread Brian J. Murrell
So, I'm trying to wrap my head around this need to migrate to pacemaker +CMAN. I've been looking at http://clusterlabs.org/quickstart-redhat.html and https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Linux/6/html/Cluster_Administration/ It seems "ccs" is the tool to configure

Re: [Pacemaker] Reg. trigger when node failure occurs

2013-12-09 Thread ESWAR RAO
Hi All, Can someone please let me know if there is a clean to trigger any script by pacemaker if HB on a node has stopped/node failed occurred if I ran HB+pacemaker on a 3 node setup?? Thanks Eswar On Mon, Dec 9, 2013 at 5:16 PM, ESWAR RAO wrote: > Hi All, > > I have a 3 node ( node1, node2,

Re: [Pacemaker] Reg. trigger when node failure occurs

2013-12-09 Thread Michael Schwartzkopff
Am Dienstag, 10. Dezember 2013, 12:19:25 schrieb ESWAR RAO: > Hi All, > > Can someone please let me know if there is a clean to trigger any script by > pacemaker if HB on a node has stopped/node failed occurred if I ran > HB+pacemaker on a 3 node setup?? > > Thanks > Eswar > > On Mon, Dec 9, 201