The resource agent is broken. Specifically it does not correctly handle "stopping a stopped service" according to the LSB spec. http://clusterlabs.org/doc/en-US/Pacemaker/1.1-crmsh/html/Pacemaker_Explained/ap-lsb.html
On 05/04/2013, at 2:37 AM, Yann Fouillat <yann.fouillat....@makina-corpus.com> wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > Hello, > > I am testing pacemaker and am trying to set up a cluster with two > nodes. Each node have a HAProxy on it. My configuration is: > > "" > node drupal.mysql.hits.com > node drupal2.mysql.hits.com > primitive failover-ip ocf:heartbeat:IPaddr2 \ > params ip="192.168.0.108" cidr_netmask="32" \ > op monitor interval="1s" > primitive haproxy lsb:haproxy \ > op monitor interval="1s" > colocation haproxy-with-failover-ip inf: haproxy failover-ip > order haproxy-after-failover-ip inf: failover-ip haproxy > property $id="cib-bootstrap-options" \ > dc-version="1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b" \ > cluster-infrastructure="openais" \ > expected-quorum-votes="2" \ > stonith-enabled="false" \ > no-quorum-policy="ignore" > "" > > My problem is that, when I kill the haproxy process on the active > node, pacemaker can't stop it: > > "" > ============ > Last updated: Thu Apr 4 17:26:41 2013 > Stack: openais > Current DC: drupal.mysql.hits.com - partition with quorum > Version: 1.0.9-74392a28b7f31d7ddc86689598bd23114f58978b > 2 Nodes configured, 2 expected votes > 2 Resources configured. > ============ > > Online: [ drupal2.mysql.hits.com drupal.mysql.hits.com ] > > failover-ip (ocf::heartbeat:IPaddr2): Started > drupal.mysql.hits.com > haproxy (lsb:haproxy): Started drupal2.mysql.hits.com (unmanaged) FAILED > > Migration summary: > * Node drupal2.mysql.hits.com: > haproxy: migration-threshold=1000000 fail-count=1000000 > * Node drupal.mysql.hits.com: > > Failed actions: > haproxy_monitor_1000 (node=drupal2.mysql.hits.com, call=8, rc=7, > status=complete): not running > haproxy_stop_0 (node=drupal2.mysql.hits.com, call=9, rc=4, > status=complete): insufficient privileges > "" > > I have the following in the log: > > "" > Apr 4 17:18:21 drupal2 lrmd: [2390]: WARN: For LSB init script, no > additional parameters are needed. > Apr 4 17:18:21 drupal2 lrmd: [30014]: info: RA output: > (haproxy:stop:stdout) Stopping haproxy: haproxy > Apr 4 17:18:21 drupal2 lrmd: [30014]: info: RA output: > (haproxy:stop:stderr) kill: No such process > Apr 4 17:18:21 drupal2 crmd: [30017]: info: process_lrm_event: LRM > operation haproxy_stop_0 (call=9, rc=4, cib-update=15, confirmed=true) > insufficient privileges > "" > > But if I restart corosync, haproxy is restarted without any problem. > Do you have any idea what is going wrong ? > > Thanks, > > Yann > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.12 (GNU/Linux) > Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/ > > iQEcBAEBAgAGBQJRXZ48AAoJEO1YhKM5jgEd/8wH/2+gFlDrp/ZieobdPuOfEfaK > kswoN22LQyQa15CiPHcRTx9mPJ7ZItLeM56hcgjZ4UeuZNNrnMoRmjfs+tMvE/Jo > rrbQ8JUDELgkHkGHaDyKrweW7rIOclA7sTNNqNKuoFBIBXPCkGOMrzmIyOFJkoo1 > MjUcBHF41yVVHKcvSzy+g4r3lGNDh43ygQiScsWqsG7AjVHH4XbfB02D7J7l6Xo7 > ghjB+mYjUOxLs/eTKKScVR7TFNZM8gRN/c19VzerV6ADIyQr2jrDr8j/AxpFE5bW > osoG0MMjzmaoSm3/36dBYNwakEW1QUGnEeODcM8qR2yshPkMxO+ITdOAFtY75gI= > =dvzx > -----END PGP SIGNATURE----- > > _______________________________________________ > Pacemaker mailing list: Pacemaker@oss.clusterlabs.org > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org _______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org