Re: [ClusterLabs] Access denied when using Floating IP

2017-01-06 Thread Ken Gaillot
On 12/26/2016 12:03 AM, Kaushal Shriyan wrote: > Hi, > > I have set up Highly Available HAProxy Servers with Keepalived and > Floating IP. I have the below details > > *Master Node keepalived.conf* > > global_defs { > # Keepalived process identifier > #lvs_id haproxy_DH > } > # Script used to

Re: [ClusterLabs] centos 7 drbd fubar

2017-01-06 Thread Ken Gaillot
On 12/27/2016 03:08 PM, Dimitri Maziuk wrote: > I ran centos 7.3.1611 update over the holidays and my drbd + nfs + imap > active-passive pair locked up again. This has now been consistent for at > least 3 kernel updates. This time I had enough consoles open to run > fuser & lsof though. > > The

Re: [ClusterLabs] Status and help with pgsql RA

2017-01-06 Thread Ken Gaillot
On 12/28/2016 02:24 PM, Nils Carlson wrote: > Hi, > > I am looking to set up postgresql in high-availability and have been > comparing the guide at > http://wiki.clusterlabs.org/wiki/PgSQL_Replicated_Cluster with the > contents of the pgsql resource agent on github. It seems that there have >

Re: [ClusterLabs] Can packmaker launch haproxy from new network namespace automatically?

2016-12-21 Thread Ken Gaillot
On 12/17/2016 07:26 PM, Hao QingFeng wrote: > Hi Folks, > > I am installing packmaker to manage the cluster of haproxy within > openstack on ubuntu 16.04. > > I met the problem that haproxy can't start listening for some services > in vip because the related ports > > were occupied by those

[ClusterLabs] New ClusterLabs logo unveiled :-)

2016-12-22 Thread Ken Gaillot
banner, though the banner will need some tweaking to make the best use of it. You might not see it there immediately due to browser caching and DNS resolver caching (the wiki IP changed recently as part of an OS upgrade), but it's there. :-) Wishing everyone a happy holiday season, -- Ken Gaillot

Re: [ClusterLabs] Cluster failure

2016-12-20 Thread Ken Gaillot
On 12/20/2016 12:21 AM, Rodrick Brown wrote: > I'm fairly new to Pacemaker and have a few questions about > > The following log event and why resources was removed from my cluster > Right before the resources being killed SIGTERM I notice the following > message. > Dec 18 19:18:18

Re: [ClusterLabs] Antw: Running two independent clusters

2017-03-22 Thread Ken Gaillot
On 03/22/2017 05:23 AM, Nikhil Utane wrote: > Hi Ulrich, > > It's not an option unfortunately. > Our product runs on a specialized hardware and provides both the > services (A & B) that I am referring to. Hence I cannot have service A > running on some nodes as cluster A and service B running on

Re: [ClusterLabs] error: The cib process (17858) exited: Key has expired (127)

2017-03-24 Thread Ken Gaillot
in > <http://www.linkedin.com/company/systemec-b.v.> Systemec Youtube > <http://www.youtube.com/user/systemec1> > > > > Van: Ken Gaillot <kgail...@redhat.com> > Verzonden: vrijdag 24 maart 2017 16:49 > Aan: users@cluste

Re: [ClusterLabs] stonith in dual HMC environment

2017-03-28 Thread Ken Gaillot
On 03/28/2017 08:20 AM, Alexander Markov wrote: > Hello, Dejan, > >> Why? I don't have a test system right now, but for instance this >> should work: >> >> $ stonith -t ibmhmc ipaddr=10.1.2.9 -lS >> $ stonith -t ibmhmc ipaddr=10.1.2.9 -T reset {nodename} > > Ah, I see. Everything (including

Re: [ClusterLabs] pending actions

2017-03-24 Thread Ken Gaillot
On 03/07/2017 04:13 PM, Jehan-Guillaume de Rorthais wrote: > Hi, > > Occasionally, I find my cluster with one pending action not being executed for > some minutes (I guess until the "PEngine Recheck Timer" elapse). > > Running "crm_simulate -SL" shows the pending actions. > > I'm still confused

Re: [ClusterLabs] Create ressource to monitor each IPSEC VPN

2017-03-24 Thread Ken Gaillot
On 03/09/2017 01:44 AM, Damien Bras wrote: > Hi, > > > > We have a 2 nodes cluster with ipsec (libreswan). > > Actually we have a resource to monitor the service ipsec (via system). > > > > But now I would like to monitor each VPN. Is there a way to do that ? > Which agent could I use for

Re: [ClusterLabs] Antw: Running two independent clusters

2017-03-30 Thread Ken Gaillot
khil Not yet, we've been tweaking the syntax a bit, so I wanted to have something more final first. But it's very close. > > On Thu, Mar 23, 2017 at 7:35 PM, Ken Gaillot <kgail...@redhat.com > <mailto:kgail...@redhat.com>> wrote: > > On 03/22/2017 11:08 PM, Nikhil Ut

Re: [ClusterLabs] Antw: Running two independent clusters

2017-03-23 Thread Ken Gaillot
est nodes if you want to monitor and manage multiple services within them. If your services require hardware access that's not easily passed to a VM, containerizing the services might be a better option. > On Wed, Mar 22, 2017 at 8:06 PM, Ken Gaillot <kgail...@redhat.com > <mailto:kgail..

Re: [ClusterLabs] Three node cluster becomes completely fenced if one node leaves

2017-03-27 Thread Ken Gaillot
On 03/27/2017 03:54 PM, Seth Reid wrote: > > > > On Fri, Mar 24, 2017 at 2:10 PM, Ken Gaillot <kgail...@redhat.com > <mailto:kgail...@redhat.com>> wrote: > > On 03/24/2017 03:52 PM, Digimer wrote: > > On 24/03/17 04:44 PM, Seth Reid wrote: >

Re: [ClusterLabs] Syncing data and reducing CPU utilization of cib process

2017-03-31 Thread Ken Gaillot
On 03/31/2017 06:44 AM, Nikhil Utane wrote: > We are seeing this log in pacemaker.log continuously. > > Mar 31 17:13:01 [6372] 0005B932ED72cib: info: > crm_compress_string: Compressed 436756 bytes into 14635 (ratio 29:1) in > 284ms > > This looks to be the reason for high CPU. What

Re: [ClusterLabs] error: The cib process (17858) exited: Key has expired (127)

2017-03-24 Thread Ken Gaillot
On 03/24/2017 08:06 AM, Rens Houben wrote: > I recently upgraded a two-node cluster (named 'castor' and 'pollux' > because I should not be allowed to think up computer names before I've > had my morning caffeine) from Debian wheezy to Jessie after the > backports for corosync and pacemaker finally

Re: [ClusterLabs] stonith in dual HMC environment

2017-03-24 Thread Ken Gaillot
On 03/22/2017 09:42 AM, Alexander Markov wrote: > >> Please share your config along with the logs from the nodes that were >> effected. > > I'm starting to think it's not about how to define stonith resources. If > the whole box is down with all the logical partitions defined, then HMC > cannot

Re: [ClusterLabs] Failover question

2017-03-16 Thread Ken Gaillot
onfigured apache as a clone, so it will run on all nodes, regardless of where the IP is -- but the IP would only be placed where apache is successfully running. >> Am 15.03.2017 um 15:15 schrieb Ken Gaillot <kgail...@redhat.com>: >> >> Sure, just add a colocation con

Re: [ClusterLabs] Failover question

2017-03-15 Thread Ken Gaillot
Sure, just add a colocation constraint for virtual_ip with proxy. On 03/15/2017 05:06 AM, Frank Fiene wrote: > Hi, > > Another beginner question: > > I have configured a virtual IP resource on two hosts and an apache resource > cloned on both machines like this > > pcs resource create

Re: [ClusterLabs] CIB configuration: role with many expressions - error 203

2017-03-21 Thread Ken Gaillot
On 03/21/2017 11:20 AM, Radoslaw Garbacz wrote: > Hi, > > I have a problem when creating rules with many expressions: > > > boolean-op="and"> >id="on_nodes_dbx_first_head-expr" value="Active"/> >id="on_nodes_dbx_first_head-expr" value="AH"/> > >

Re: [ClusterLabs] Pacemaker for Embedded Systems

2017-04-11 Thread Ken Gaillot
On 04/10/2017 03:58 PM, Chad Cravens wrote: > Hello all: > > we have implemented large cluster solutions for complex server > environments that had databases, application servers, apache web servers > and implemented fencing with the IPMI fencing agent. > > However, we are considering if

[ClusterLabs] Coming in Pacemaker 1.1.17: Per-operation fail counts

2017-04-03 Thread Ken Gaillot
er 1.1.17. [1] http://lists.clusterlabs.org/pipermail/users/2016-September/004096.html -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://w

Re: [ClusterLabs] Antw: Coming in Pacemaker 1.1.17: Per-operation fail counts

2017-04-04 Thread Ken Gaillot
On 04/04/2017 01:18 AM, Ulrich Windl wrote: >>>> Ken Gaillot <kgail...@redhat.com> schrieb am 03.04.2017 um 17:00 in >>>> Nachricht > <ae3a7cf4-2ef7-4c4f-ae3f-39f473ed6...@redhat.com>: >> Hi all, >> >> Pacemaker 1.1.17 will have a signifi

Re: [ClusterLabs] STONITH not communicated back to initiator until token expires

2017-04-04 Thread Ken Gaillot
On 03/13/2017 10:43 PM, Chris Walker wrote: > Thanks for your reply Digimer. > > On Mon, Mar 13, 2017 at 1:35 PM, Digimer > wrote: > > On 13/03/17 12:07 PM, Chris Walker wrote: > > Hello, > > > > On our two-node EL7 cluster (pacemaker:

[ClusterLabs] Coming in Pacemaker 1.1.17: container bundles

2017-03-31 Thread Ken Gaillot
inside the container. The feature is currently experimental and will likely get significant bugfixes throughout the coming release cycle, but the syntax is stable and likely what will be released. I intend to add a more detailed walk-through example to the ClusterLabs wiki. -- Ken Gaillot

Re: [ClusterLabs] how start resources on the last running node

2017-04-05 Thread Ken Gaillot
On 04/04/2017 10:01 AM, Ján Poctavek wrote: > Hi, > > I came here to ask for some inspiration about my cluster setup. > > I have 3-node pcs+corosync+pacemaker cluster. When majority of nodes > exist in the cluster, everything is working fine. But what recovery > options do I have when I lose 2

Re: [ClusterLabs] Antw: Re: Rename option group resource id with pcs

2017-04-11 Thread Ken Gaillot
On 04/11/2017 05:48 AM, Ulrich Windl wrote: Dejan Muhamedagic schrieb am 11.04.2017 um 11:43 in > Nachricht <20170411094352.GD8414@tuttle.homenet>: >> Hi, >> >> On Tue, Apr 11, 2017 at 10:50:56AM +0200, Tomas Jelinek wrote: >>> Dne 11.4.2017 v 08:53 SAYED, MAJID ALI SYED

Re: [ClusterLabs] Surprising semantics of location constraints with INFINITY score

2017-04-11 Thread Ken Gaillot
On 04/11/2017 08:30 AM, Kristoffer Grönlund wrote: > Hi all, > > I discovered today that a location constraint with score=INFINITY > doesn't actually restrict resources to running only on particular > nodes. From what I can tell, the constraint assigns the score to that > node, but doesn't change

Re: [ClusterLabs] nodes ID assignment issue

2017-04-17 Thread Ken Gaillot
On 04/13/2017 10:40 AM, Radoslaw Garbacz wrote: > Hi, > > I have a question regarding building CIB nodes scope and specifically > assignment to node IDs. > It seems like the preexisting scope is not honored and nodes can get > replaced based on check-in order. > > I pre-create the nodes scope

Re: [ClusterLabs] KVM virtualdomain - stopped

2017-04-17 Thread Ken Gaillot
On 04/13/2017 03:01 AM, Jaco van Niekerk wrote: > > Hi > > I am having endless problems with ocf::heartbeat:VirtualDomain when > failing over to second node. The virtualdomain goes into a stopped state > > virtdom_compact (ocf::heartbeat:VirtualDomain): Stopped > > * virtdom_compact_start_0 on

Re: [ClusterLabs] How to force remove a cluster node?

2017-04-17 Thread Ken Gaillot
On 04/13/2017 01:11 PM, Scott Greenlese wrote: > Hi, > > I need to remove some nodes from my existing pacemaker cluster which are > currently unbootable / unreachable. > > Referenced >

Re: [ClusterLabs] Why shouldn't one store resource configuration in the CIB?

2017-04-17 Thread Ken Gaillot
On 04/13/2017 11:11 AM, Ferenc Wágner wrote: > Hi, > > I encountered several (old) statements on various forums along the lines > of: "the CIB is not a transactional database and shouldn't be used as > one" or "resource parameters should only uniquely identify a resource, > not configure it" and

Re: [ClusterLabs] starting primitive resources of a group without starting the complete group - unclear behaviour

2017-04-21 Thread Ken Gaillot
On 04/21/2017 04:38 AM, Lentes, Bernd wrote: > > > - On Apr 21, 2017, at 1:24 AM, Ken Gaillot kgail...@redhat.com wrote: > >> On 04/20/2017 02:53 PM, Lentes, Bernd wrote: > >> >> target-role=Stopped prevents a resource from being started. >> >>

Re: [ClusterLabs] starting primitive resources of a group without starting the complete group - unclear behaviour

2017-04-21 Thread Ken Gaillot
On 04/21/2017 07:52 AM, Lentes, Bernd wrote: > > > - On Apr 21, 2017, at 11:38 AM, Bernd Lentes > bernd.len...@helmholtz-muenchen.de wrote: > >> - On Apr 21, 2017, at 1:24 AM, Ken Gaillot kgail...@redhat.com wrote: >> >>> On 04/20/2017 02:53 PM, Le

Re: [ClusterLabs] Colocation of a primitive resource with a clone with limited copies

2017-04-21 Thread Ken Gaillot
On 04/21/2017 07:14 AM, Vladislav Bogdanov wrote: > 20.04.2017 23:16, Jan Wrona wrote: >> On 20.4.2017 19:33, Ken Gaillot wrote: >>> On 04/20/2017 10:52 AM, Jan Wrona wrote: >>>> Hello, >>>> >>>> my problem is closely related to the threa

Re: [ClusterLabs] Wtrlt: Antw: Re: Antw: Re: how important would you consider to have two independent fencing device for each node ?

2017-04-20 Thread Ken Gaillot
On 04/20/2017 01:43 AM, Ulrich Windl wrote: > Should have gone to the list... > > Digimer schrieb am 19.04.2017 um 17:20 in Nachricht >> <600637f1-fef8-0a3d-821c-7aecfa398...@alteeve.ca>: >>> On 19/04/17 02:38 AM, Ulrich Windl wrote: >>> Digimer

Re: [ClusterLabs] Colocation of a primitive resource with a clone with limited copies

2017-04-20 Thread Ken Gaillot
On 04/20/2017 10:52 AM, Jan Wrona wrote: > Hello, > > my problem is closely related to the thread [1], but I didn't find a > solution there. I have a resource that is set up as a clone C restricted > to two copies (using the clone-max=2 meta attribute||), because the > resource takes long time to

Re: [ClusterLabs] Antw: Re: 2-Node Cluster Pointless?

2017-04-18 Thread Ken Gaillot
On 04/18/2017 02:47 AM, Ulrich Windl wrote: Digimer schrieb am 16.04.2017 um 20:17 in Nachricht > <12cde13f-8bad-a2f1-6834-960ff3afc...@alteeve.ca>: >> On 16/04/17 01:53 PM, Eric Robinson wrote: >>> I was reading in "Clusters from Scratch" where Beekhof states, "Some would

Re: [ClusterLabs] Antw: Re: Never join a list without a problem...

2017-03-08 Thread Ken Gaillot
Issue 9 > > Send Users mailing list submissions to > users@clusterlabs.org > > To subscribe or unsubscribe via the World Wide Web, visit > http://lists.clusterlabs.org/mailman/listinfo/users > or, via email, send a message with subject or body 'help' to >

Re: [ClusterLabs] resource was disabled automatically

2017-03-07 Thread Ken Gaillot
On 03/06/2017 08:29 PM, cys wrote: > At 2017-03-07 05:47:19, "Ken Gaillot" <kgail...@redhat.com> wrote: >> To figure out why a resource was stopped, you want to check the logs on >> the DC (which will be the node with the most "pengine:" messages

Re: [ClusterLabs] PCMK_OCF_DEGRADED (_MASTER): exit codes are mapped to PCMK_OCF_UNKNOWN_ERROR

2017-03-06 Thread Ken Gaillot
On 03/06/2017 10:55 AM, Lars Ellenberg wrote: > On Thu, Mar 02, 2017 at 05:31:33PM -0600, Ken Gaillot wrote: >> On 03/01/2017 05:28 PM, Andrew Beekhof wrote: >>> On Tue, Feb 28, 2017 at 12:06 AM, Lars Ellenberg >>> <lars.ellenb...@linbit.com> wrote: >&

Re: [ClusterLabs] resource was disabled automatically

2017-03-06 Thread Ken Gaillot
On 03/06/2017 03:49 AM, cys wrote: > Hi, > > Today I found one resource was disabled. I checked that nobody did it. > The logs showed crmd(or pengine?) stopped it. I don't known why. > So I want to know will pacemaker disable resource automatically? > If so, when and why? > > Thanks. Pacemaker

Re: [ClusterLabs] Antw: Re: Ordering Sets of Resources

2017-03-01 Thread Ken Gaillot
On 03/01/2017 01:36 AM, Ulrich Windl wrote: >>>> Ken Gaillot <kgail...@redhat.com> schrieb am 26.02.2017 um 20:04 in >>>> Nachricht > <dbf562ff-a830-fc3c-84dc-487b892fc...@redhat.com>: >> On 02/25/2017 03:35 PM, iva...@libero.it wrote: >>

Re: [ClusterLabs] PCMK_OCF_DEGRADED (_MASTER): exit codes are mapped to PCMK_OCF_UNKNOWN_ERROR

2017-03-02 Thread Ken Gaillot
On 03/01/2017 05:28 PM, Andrew Beekhof wrote: > On Tue, Feb 28, 2017 at 12:06 AM, Lars Ellenberg > wrote: >> When I recently tried to make use of the DEGRADED monitoring results, >> I found out that it does still not work. >> >> Because LRMD choses to filter them in

Re: [ClusterLabs] Cannot clone clvmd resource

2017-03-01 Thread Ken Gaillot
On 03/01/2017 03:49 PM, Anne Nicolas wrote: > Hi there > > > I'm testing quite an easy configuration to work on clvm. I'm just > getting crazy as it seems clmd cannot be cloned on other nodes. > > clvmd start well on node1 but fails on both node2 and node3. Your config looks fine, so I'm going

Re: [ClusterLabs] cluster does not detect kill on pacemaker process ?

2017-04-07 Thread Ken Gaillot
tioning node. The rest of the cluster would then use stonith to disable that node, so it could safely recover its services elsewhere. > On Fri, Apr 7, 2017 at 7:58 AM, Ken Gaillot <kgail...@redhat.com > <mailto:kgail...@redhat.com>> wrote: > > On 04/05/201

Re: [ClusterLabs] cloned resources ordering and remote nodes problem

2017-04-06 Thread Ken Gaillot
On 04/06/2017 09:32 AM, Radoslaw Garbacz wrote: > Hi, > > > I have a question regarding resources order settings. > > Having cloned resources: "res_1-clone", "res_2-clone", > and defined order: first "res_1-clone" then "res_2-clone" > > When I have a monitoring failure on a remote node with

Re: [ClusterLabs] Can't See Why This Cluster Failed Over

2017-04-07 Thread Ken Gaillot
On 04/07/2017 12:58 PM, Eric Robinson wrote: > Somebody want to look at this log and tell me why the cluster failed over? > All we did was add a new resource. We've done it many times before without > any problems. > > -- > > Apr 03 08:50:30 [22762] ha14acib: info:

Re: [ClusterLabs] [Problem] The crmd causes an error of xml.

2017-04-07 Thread Ken Gaillot
On 04/06/2017 08:49 AM, renayama19661...@ybb.ne.jp wrote: > Hi All, > > I confirmed a development edition of Pacemaker. > - > https://github.com/ClusterLabs/pacemaker/tree/71dbd128c7b0a923c472c8e564d33a0ba1816cb5 > > > property no-quorum-policy="ignore" \ > stonith-enabled="true"

Re: [ClusterLabs] [ClusterLabs Developers] checking all procs on system enough during stop action?

2017-04-24 Thread Ken Gaillot
On 04/24/2017 10:32 AM, Jehan-Guillaume de Rorthais wrote: > On Mon, 24 Apr 2017 17:08:15 +0200 > Lars Ellenberg wrote: > >> On Mon, Apr 24, 2017 at 04:34:07PM +0200, Jehan-Guillaume de Rorthais wrote: >>> Hi all, >>> >>> In the PostgreSQL Automatic Failover (PAF)

Re: [ClusterLabs] Two nodes cluster issue

2017-07-31 Thread Ken Gaillot
h multiple devices -- level 1 would be the IPMI, and level 2 would be the fallback device. qdevice helps with quorum, which would let one side attempt to fence the other, but it doesn't affect whether the fencing succeeds. With a two-node cluster, you can use qdevice to get quorum, or you ca

Re: [ClusterLabs] Two nodes cluster issue

2017-07-31 Thread Ken Gaillot
Please ignore my re-reply to the original message, I'm in the middle of a move and am getting by on little sleep at the moment :-) On Mon, 2017-07-31 at 09:26 -0500, Ken Gaillot wrote: > On Mon, 2017-07-24 at 11:51 +, Tomer Azran wrote: > > Hello, > > > > > >

Re: [ClusterLabs] Antw: Re: Antw: Re: from where does the default value for start/stop op of a resource come ?

2017-08-02 Thread Ken Gaillot
On Wed, 2017-08-02 at 18:32 +0200, Lentes, Bernd wrote: > > - On Aug 2, 2017, at 10:42 AM, Ulrich Windl > ulrich.wi...@rz.uni-regensburg.de wrote: > > > > > > I thought the cluster does not perform actions that are not defined in the > > configuration (e.g. "monitor"). > > I think the

Re: [ClusterLabs] Updated attribute is not displayed in crm_mon

2017-08-15 Thread Ken Gaillot
On Tue, 2017-08-15 at 08:42 +0200, Jan Friesse wrote: > Ken Gaillot napsal(a): > > On Mon, 2017-08-14 at 12:33 -0500, Ken Gaillot wrote: > >> On Wed, 2017-08-02 at 09:59 +, 井上 和徳 wrote: > >>> Hi, > >>> > >>> In Pacemaker-

Re: [ClusterLabs] Updated attribute is not displayed in crm_mon

2017-08-16 Thread Ken Gaillot
+ ringnumber_1 : 192.168.102.132 is UP > > Regards, > Kazunori INOUE > > > -Original Message- > > From: Ken Gaillot [mailto:kgail...@redhat.com] > > Sent: Tuesday, August 15, 2017 2:42 AM > > To: Cluster Labs - All topics related to o

Re: [ClusterLabs] IPaddr2 RA and bonding

2017-08-10 Thread Ken Gaillot
actual > resulting bond speed based on a bond type. For load-balancing bonds like LACP > (mode 4) one it uses coefficient of 0.8 (iirc) to reflect actual possible > load via multiple links. > > > > > > > -Original Message- > > From: Ken Gaillot

Re: [ClusterLabs] notify action is not called for the docker bundle resources

2017-08-09 Thread Ken Gaillot
"owner": > "redis:redis", "path": "/var/lib/redis", "recurse": true}, {"owner": > "redis:redis", "path": "/var/log/redis", "recurse": true}]} > > > > Please note the docker

Re: [ClusterLabs] Updated attribute is not displayed in crm_mon

2017-08-14 Thread Ken Gaillot
ode3: >+ KEY : V-3 > > > Best Regards > > _______ > Users mailing list: Users@clusterlabs.org > http://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting s

Re: [ClusterLabs] Updated attribute is not displayed in crm_mon

2017-08-14 Thread Ken Gaillot
On Mon, 2017-08-14 at 12:33 -0500, Ken Gaillot wrote: > On Wed, 2017-08-02 at 09:59 +, 井上 和徳 wrote: > > Hi, > > > > In Pacemaker-1.1.17, the attribute updated while starting pacemaker is not > > displayed in crm_mon. > > In Pacemaker-1.1.16, it is di

Re: [ClusterLabs] DRBD or SAN ?

2017-07-17 Thread Ken Gaillot
On 07/17/2017 04:51 AM, Lentes, Bernd wrote: > Hi, > > i established a two node cluster with two HP servers and SLES 11 SP4. I'd > like to start now with a test period. Resources are virtual machines. The > vm's reside on a FC SAN. The SAN has two power supplies, two storage > controller, two

Re: [ClusterLabs] Antwort: Antw: Antwort: Re: reboot node / cluster standby

2017-07-11 Thread Ken Gaillot
06.07.2017 09:28 >> Betreff: [ClusterLabs] Antw: Antwort: Re: reboot node / cluster standby >> >> >>> <philipp.achmuel...@arz.at> schrieb am 03.07.2017 um 15:30 in > Nachricht >> <of2758213a.f6dc56ee-onc1258152.0046de1e-c1258152.004a3...@arz.at>: >

[ClusterLabs] Pacemaker 1.1.17 released

2017-07-06 Thread Ken Gaillot
d list of bug fixes and other changes, see the change log: https://github.com/ClusterLabs/pacemaker/blob/1.1/ChangeLog Many thanks to all contributors of source code to this release, including Alexandra Zhuravleva, Andrew Beekhof, Aravind Kumar, Eric Marques, Ferenc Wágner, Yan Gao, Hayley Swimelar, H

Re: [ClusterLabs] DRBD or SAN ?

2017-07-18 Thread Ken Gaillot
On 07/18/2017 09:34 AM, Lentes, Bernd wrote: > > > - On Jul 17, 2017, at 11:51 AM, Bernd Lentes > bernd.len...@helmholtz-muenchen.de wrote: > >> Hi, >> >> i established a two node cluster with two HP servers and SLES 11 SP4. I'd >> like >> to start now with a test period. Resources are

Re: [ClusterLabs] stonith disabled, but pacemaker tries to reboot

2017-07-20 Thread Ken Gaillot
On 07/20/2017 03:46 AM, Daniel.L wrote: > Hi Pacemaker Users, > > > We have a 2 node pacemaker cluster (v1.1.14). > Stonith at this moment is disabled: > > $ pcs property --all | grep stonith > stonith-action: reboot > stonith-enabled: false > stonith-timeout: 60s > stonith-watchdog-timeout:

Re: [ClusterLabs] (no subject)

2017-07-20 Thread Ken Gaillot
On 07/20/2017 12:21 AM, ArekW wrote: > Hi, How to properly unset a value with pcs? Set to false or null gives error: > > # pcs stonith update vbox-fencing verbose=false --force > or > # pcs stonith update vbox-fencing verbose= --force > > Jul 20 07:14:11 nfsnode1 stonith-ng[11097]: warning:

Re: [ClusterLabs] why resources are restarted when a node rejoins a cluster?

2017-07-25 Thread Ken Gaillot
> Users mailing list: Users@clusterlabs.org > http://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org -- Ken Gaillot &l

Re: [ClusterLabs] epic fail

2017-07-24 Thread Ken Gaillot
e NFS server also managed by pacemaker? Is it ordered after DRBD? Did pacemaker try to stop it before stopping DRBD? -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/lis

Re: [ClusterLabs] epic fail

2017-07-24 Thread Ken Gaillot
On Mon, 2017-07-24 at 17:13 +0200, Kristián Feldsam wrote: > Hmm, so when you know, that it happens also when putting node standy, > them why you run yum update on live cluster, it must be clear that > node will be fenced. Standby is not necessary, it's just a cautious step that allows the admin

Re: [ClusterLabs] Two nodes cluster issue

2017-07-24 Thread Ken Gaillot
> started: > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org > > > > > >

Re: [ClusterLabs] resources do not migrate although node is going to standby

2017-07-24 Thread Ken Gaillot
and i don't have a standalone vnc server > configured. Why is the port occupied ? Can't help there > Btw: the network sockets are live migrated too during a live migration of a > VirtualDomain resource ? > It should be like that. > > Thanks. > > > Bernd My memo

Re: [ClusterLabs] timeout for stop VirtualDomain running Windows 7

2017-07-24 Thread Ken Gaillot
you can restrict updates to a certain time window, you can set up a rule that uses a longer timeout during that window. If you can't restrict the time window, but you can run a script when updates are done, you could set a node attribute at that time (and clear it on reboot), and use a similar rule b

Re: [ClusterLabs] reboot node / cluster standby

2017-06-29 Thread Ken Gaillot
On 06/29/2017 04:42 AM, philipp.achmuel...@arz.at wrote: > Hi, > > In order to reboot a Clusternode i would like to set the node to standby > first, so a clean takeover for running resources can take in place. > Is there a default way i can set in pacemaker, or do i have to setup my > own systemd

Re: [ClusterLabs] reboot node / cluster standby

2017-06-29 Thread Ken Gaillot
On 06/29/2017 01:38 PM, Ludovic Vaugeois-Pepin wrote: > On Thu, Jun 29, 2017 at 7:27 PM, Ken Gaillot <kgail...@redhat.com> wrote: >> On 06/29/2017 04:42 AM, philipp.achmuel...@arz.at wrote: >>> Hi, >>> >>> In order to reboot a Clusternode i would lik

Re: [ClusterLabs] Coming in Pacemaker 1.1.17: container bundles

2017-06-30 Thread Ken Gaillot
On 06/30/2017 12:10 PM, Valentin Vidic wrote: > On Fri, Mar 31, 2017 at 05:43:02PM -0500, Ken Gaillot wrote: >> Here's an example of the CIB XML syntax (higher-level tools will likely >> provide a more convenient interface): >> >> >> >> > >

Re: [ClusterLabs] Question about STONITH for VM HA cluster in shared hosts environment

2017-06-29 Thread Ken Gaillot
On 06/29/2017 12:08 PM, Digimer wrote: > On 29/06/17 12:39 PM, Andrés Pozo Muñoz wrote: >> Hi all, >> >> I am a newbie to Pacemaker and I can't find the perfect solution for my >> problem (probably I'm missing something), maybe someone can give me some >> hint :) >> >> My scenario is the

Re: [ClusterLabs] Coming in Pacemaker 1.1.17: container bundles

2017-07-03 Thread Ken Gaillot
On 07/01/2017 06:47 AM, Valentin Vidic wrote: > On Fri, Jun 30, 2017 at 12:46:29PM -0500, Ken Gaillot wrote: >> The challenge is that some properties are docker-specific and other >> container engines will have their own specific properties. >> >> We decided to go wi

Re: [ClusterLabs] Problem with stonith and starting services

2017-07-03 Thread Ken Gaillot
On 07/03/2017 02:34 AM, Cesar Hernandez wrote: > Hi > > I have installed a pacemaker cluster with two nodes. The same type of > installation has done before many times and the following error never > appeared before. The situation is the following: > > both nodes running cluster services >

Re: [ClusterLabs] Problem with stonith and starting services

2017-07-06 Thread Ken Gaillot
On 07/06/2017 08:54 AM, Cesar Hernandez wrote: > >> >> So, the above log means that node1 decided that node2 needed to be >> fenced, requested fencing of node2, and received a successful result for >> the fencing, and yet node2 was not killed. >> >> Your fence agent should not return success

Re: [ClusterLabs] About Corosync up to 16 nodes limit

2017-07-06 Thread Ken Gaillot
On 07/06/2017 03:51 AM, mlb_1 wrote: > thanks for your solution. > > Is anybody can officially reply this topic ? Digimer is correct, the Red Hat and SuSE limits are their own chosen limits for technical support, not enforced by the code. There are no hard limits in the code, but practically

Re: [ClusterLabs] Fwd: Cluster - NFS Share Configuration

2017-07-06 Thread Ken Gaillot
On 07/06/2017 07:24 AM, pradeep s wrote: > Team, > > I am working on configuring cluster environment for NFS share using > pacemaker. Below are the resources I have configured. > > Quote: > Group: nfsgroup > Resource: my_lvm (class=ocf provider=heartbeat type=LVM) > Attributes: volgrpname=my_vg

Re: [ClusterLabs] Antw: Re: reboot node / cluster standby

2017-07-06 Thread Ken Gaillot
On 07/06/2017 02:21 AM, Ulrich Windl wrote: >>>> Ken Gaillot <kgail...@redhat.com> schrieb am 29.06.2017 um 21:15 in >>>> Nachricht > <44ee8b24-fe14-a204-f791-248546c2f...@redhat.com>: >> On 06/29/2017 01:38 PM, Ludovic Vaugeois-Pepin wrote: >>&g

Re: [ClusterLabs] Problem with stonith and starting services

2017-07-06 Thread Ken Gaillot
On 07/04/2017 08:28 AM, Cesar Hernandez wrote: > >> >> Agreed, I don't think it's multicast vs unicast. >> >> I can't see from this what's going wrong. Possibly node1 is trying to >> re-fence node2 when it comes back. Check that the fencing resources are >> configured correctly, and check whether

Re: [ClusterLabs] fence_vbox Unable to connect/login to fencing device

2017-07-06 Thread Ken Gaillot
On 07/06/2017 10:13 AM, ArekW wrote: > Hi, > > It seems that my the fence_vbox is running but there are errors in > logs every few minutes like: > > Jul 6 12:51:12 nfsnode1 fence_vbox: Unable to connect/login to fencing device > Jul 6 12:51:13 nfsnode1 stonith-ng[7899]: warning:

Re: [ClusterLabs] fence_vbox Unable to connect/login to fencing device

2017-07-06 Thread Ken Gaillot
On 07/06/2017 10:29 AM, Ken Gaillot wrote: > On 07/06/2017 10:13 AM, ArekW wrote: >> Hi, >> >> It seems that my the fence_vbox is running but there are errors in >> logs every few minutes like: >> >> Jul 6 12:51:12 nfsnode1 fence_vbox: Unable to connect/logi

Re: [ClusterLabs] Problem with stonith and starting services

2017-07-06 Thread Ken Gaillot
On 07/06/2017 09:26 AM, Klaus Wenninger wrote: > On 07/06/2017 04:20 PM, Cesar Hernandez wrote: >>> If node2 is getting the notification of its own fencing, it wasn't >>> successfully fenced. Successful fencing would render it incapacitated >>> (powered down, or at least cut off from the network

Re: [ClusterLabs] Introducing the Anvil! Intelligent Availability platform

2017-07-05 Thread Ken Gaillot
Wow! I'm looking forward to the September summit talk. On 07/05/2017 01:52 AM, Digimer wrote: > Hi all, > > I suspect by now, many of you here have heard me talk about the Anvil! > intelligent availability platform. Today, I am proud to announce that it > is ready for general use! > >

[ClusterLabs] IPaddr2 cloning inside containers

2017-04-26 Thread Ken Gaillot
thinking about it. Pacemaker's new bundle feature doesn't support cloning the IPs it creates, but that might be an interesting future feature if this issue is resolved. -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.or

Re: [ClusterLabs] Problem with clone ClusterIP

2017-04-26 Thread Ken Gaillot
On 04/26/2017 02:45 AM, Bratislav Petkovic wrote: > Tahank you, > > > > We use the Cisco Nexus 7000 switches, they support Multicast MAC. > > It is possible that something is not configured correctly. > > In this environment working IBM PowerHA SystemMirror 7.1 (use Multicast) > without

Re: [ClusterLabs] in standby but still running resources..

2017-04-27 Thread Ken Gaillot
On 04/27/2017 08:29 AM, lejeczek wrote: > .. is this ok? > > hi guys, > > pcs shows no errors after I did standby node, but pcs shows resources > still are being ran on the node I just stoodby. > Is this normal? > > 0.9.152 @C7.3 > thanks > P. That should happen only for as long as it takes to

[ClusterLabs] Coming in Pacemaker 1.1.17: start a node in standby

2017-04-24 Thread Ken Gaillot
uot;online", and any manual setting of standby mode would be overwritten at the next boot. Many thanks to developers Alexandra Zhuravleva and Sergey Mishin, who contributed this feature as part of a project with EMC. -- Ken Gaillot <kgail...@redhat.com> ___

Re: [ClusterLabs] can't live migrate VirtualDomain which is part of a group

2017-04-24 Thread Ken Gaillot
On 04/24/2017 02:33 PM, Lentes, Bernd wrote: > > - On Apr 24, 2017, at 9:11 PM, Ken Gaillot kgail...@redhat.com wrote: > >>>> primitive prim_vnc_ip_mausdb IPaddr \ >>>>params ip=146.107.235.161 nic=br0 cidr_netmask=24 \ >>>>meta is

Re: [ClusterLabs] Question about fence_mpath

2017-04-28 Thread Ken Gaillot
On 04/28/2017 03:37 PM, Chris Adams wrote: > Once upon a time, Seth Reid said: >> This confused me too when I set up my cluster. I found that everything >> worked better if I didn't specify a device path. I think there was >> documentation on Redhat that led me to try removing

Re: [ClusterLabs] can't live migrate VirtualDomain which is part of a group

2017-04-25 Thread Ken Gaillot
On 04/25/2017 09:14 AM, Lentes, Bernd wrote: > > > - On Apr 24, 2017, at 11:11 PM, Ken Gaillot kgail...@redhat.com wrote: > >> On 04/24/2017 02:33 PM, Lentes, Bernd wrote: >>> >>> ----- On Apr 24, 2017, at 9:11 PM, Ken Gaillot kgail...@re

Re: [ClusterLabs] resource group vs colocation

2017-04-27 Thread Ken Gaillot
On 04/27/2017 02:02 PM, lejeczek wrote: > hi everyone > > I have a group and I'm trying to colocate - sounds strange - order with > the group is not how I want it. > I was hoping that with colocation set I can reorder the resources - can > I? Because .. something, or my is not getting there. > I

Re: [ClusterLabs] should such a resource set work?

2017-04-28 Thread Ken Gaillot
On 04/28/2017 08:17 AM, lejeczek wrote: > hi everybody > > I have a set: > > set IP2 IP2 IP2 LVM(exclusive) mountpoint smb smartd sequential=true ^^^ Is this a typo? > setoptions score=INFINITY > > it should work, right? > > yet when I standby a node and I see cluster jumps

Re: [ClusterLabs] Problem with clone ClusterIP

2017-04-25 Thread Ken Gaillot
On 04/25/2017 09:32 AM, Bratislav Petkovic wrote: > I want to make active/active cluster with two physical servers. > > On the servers are installed: oraclelinux-release-7.2-1.0.5.el7.x86_64, > > Pacemaker 1.1.13-10.el7, Corosync Cluster Engine, version '2.3.4', > > pcs 0.9.143. Cluster starts

Re: [ClusterLabs] big trouble with a DRBD resource

2017-08-04 Thread Ken Gaillot
b resource. Why complaining so often ? And > why stopping after ~20.000 traps ? > And complaining about not configured monitor operation just 8 times. I'm not really sure; I haven't used ClusterMon enough to say. If you have Pacemaker 1.1.15 or later, the alerts feature is preferred to ClusterMon

Re: [ClusterLabs] Fwd: Multi cluster

2017-08-04 Thread Ken Gaillot
roach, you should certainly be able to > model your fallback scenario, something like: > > - define a group A (VIP, apache, app), infinity-located with DC > - define a different group B with the same content, set up as clone > B_clone being (-infinity)-located with DC > - set

Re: [ClusterLabs] Notification agent and Notification recipients

2017-08-04 Thread Ken Gaillot
hanisms to achieve this ? > > > Regards, > Sriram. > > > ___ > Users mailing list: Users@clusterlabs.org > http://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://

Re: [ClusterLabs] big trouble with a DRBD resource

2017-08-07 Thread Ken Gaillot
On Mon, 2017-08-07 at 12:54 +0200, Lentes, Bernd wrote: > - On Aug 4, 2017, at 10:19 PM, Ken Gaillot kgail...@redhat.com wrote: > > > Unfortunately no -- logging, and troubleshooting in general, is an area > > we are continually striving to improve, but there are more to-

<    1   2   3   4   5   6   7   8   9   10   >