Re: [ClusterLabs] Antw: Re: Q: ordering for a monitoring op only?

2018-08-21 Thread Ken Gaillot
On Tue, 2018-08-21 at 07:49 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 20.08.2018 um > > > > 16:49 in > > Nachricht > <1534776566.6465.5.ca...@redhat.com>: > > On Mon, 2018‑08‑20 at 10:51 +0200, Ulrich Windl wrote: > > >

Re: [ClusterLabs] Q: Forcing a role change of master/slave resource

2018-08-20 Thread Ken Gaillot
ing "-G" to see the current scores or "-v " to change them. The node with the highest score will be promoted. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users

Re: [ClusterLabs] Q: ordering for a monitoring op only?

2018-08-20 Thread Ken Gaillot
had > written a monitor for HP-UX' cluster that did not have this problem, > even though the configuration files were read from NFS (It's not > magic: Just periodically copy them to shared memory, and read the > config from shared memory). > > Regards, > Ulrich -- Ken

Re: [ClusterLabs] Q: automaticlly remove expired location constraints

2018-08-23 Thread Ken Gaillot
7:26Z" > > One problem is that the date value is not a constant, and it had to > be compared against the current date > > Regards, > Ulrich crm_resource --clear -r RSC will clear all cli-* constraints -- Ken Gaillot ___ U

Re: [ClusterLabs] Q: (SLES11 SP4) lrm_rsc_op without last-run?

2018-08-23 Thread Ken Gaillot
t; The node is not completely up-to-date, and it's using pacemaker- > 1.1.12-18.1... > > Regards, > Ulrich > > > _______ > Users mailing list: Users@clusterlabs.org > https://lists.clusterlabs.org/mailman/listinfo/users > > Proj

Re: [ClusterLabs] Redundant ring not recovering after node is back

2018-08-24 Thread Ken Gaillot
t; Here you have some of my configuration settings on node 1 > > > > > > > (I probed > > > > > > > already > > > > > > > to change rrp_mode): > > > > > > > > > > > > > > *- corosync.conf* > > > > > > > > > > > > > > > > >

Re: [ClusterLabs] Spurious node loss in corosync cluster

2018-08-20 Thread Ken Gaillot
    } > } > service { >     name: pacemaker >     ver: 1 > } > amf { >     mode: disabled > } > > Thanks in advance for the help. > Prasad > > ___ > Users mailing list: Users@clusterlabs.org > https://lists.clusterlabs.

Re: [ClusterLabs] Different Times in the Corosync Log?

2018-08-21 Thread Ken Gaillot
g a > very > short interval sequentially (i.e. no intermittent failure recovered > with > a restart of lrmd, AFAICT).  In case it can have any bearing, how do > you start pacemaker -- systemd, initscript, as a corosync plugin, > something else? -- Ken Gaillot

Re: [ClusterLabs] Antw: Re: Spurious node loss in corosync cluster

2018-08-21 Thread Ken Gaillot
device utilization, then look for > network bottlenecks... > > A new corosync release cannot fix those, most likely. > > Regards, > Ulrich > > > > > In any case, for the current scenario, we did not see any > > scheduling > > related messages. > > &

Re: [ClusterLabs] FYI: regression using 2.0.0 / 1.1.19 Pacemaker Remote node with older cluster nodes

2018-07-17 Thread Ken Gaillot
to leave this as a known issue, and rely on the workarounds. On Mon, 2018-07-16 at 09:21 -0500, Ken Gaillot wrote: > Hi all, > > The just-released Pacemaker 2.0.0 and 1.1.19 releases have an issue > when a Pacemaker Remote node is upgraded before the cluster nodes. > > Pacemaker 2.0.0 contain

Re: [ClusterLabs] ping Resource Agent doesnt work

2018-07-24 Thread Ken Gaillot
mailing list: Users@clusterlabs.org > https://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch. > pdf > Bugs: http://bugs.clusterlabs.org -- Ken Gaillot

[ClusterLabs] FYI: regression using 2.0.0 / 1.1.19 Pacemaker Remote node with older cluster nodes

2018-07-16 Thread Ken Gaillot
rading any Pacemaker Remote nodes (which is the recommended practice anyway). -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting sta

Re: [ClusterLabs] ping Resource Agent doesnt work

2018-07-25 Thread Ken Gaillot
mailing list: Users@clusterlabs.org > https://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch. > pdf > Bugs: http://bugs.clusterlabs.org -- Ken Gaillot

Re: [ClusterLabs] Weird Fencing Behavior

2018-07-17 Thread Ken Gaillot
rue > > > > Quorum: > >   Options: > > > > > > > > ___ > > Users mailing list: Users@clusterlabs.org > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > Project Home: http://www.clusterlabs.o

Re: [ClusterLabs] Antw: Re: Q: ordering clones with interleave=false

2018-08-30 Thread Ken Gaillot
On Thu, 2018-08-30 at 08:28 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 29.08.2018 um > > > > 20:30 in > > Nachricht > <1535567455.5594.5.ca...@redhat.com>: > > On Wed, 2018‑08‑29 at 13:30 +0200, Ulrich Windl wrote: > > > Hi! &

Re: [ClusterLabs] Q: native_color scores for clones

2018-08-30 Thread Ken Gaillot
uot; resource has scores 0, 1, and -INFINITY, and the > ":1" resource has score 1 once and -INFINITY twice. > > When I look at the "clone_solor" scores, the prm_DLM:* primitives > look as expected (no -INFINITY). However the cln_DLM clones have > score like 1,

Re: [ClusterLabs] Pacemaker startup retries

2018-08-30 Thread Ken Gaillot
\ > default-resource-stickiness=200 \ > stonith-timeout=180s \ > last-lrm-refresh=1534489943 > > > Thanks > > César Hernández Bañó -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org htt

Re: [ClusterLabs] Pacemaker startup retries

2018-08-31 Thread Ken Gaillot
the cluster itself) or some external program. If it's the cluster, I'd look at the "pengine:" logs on the DC before that, to see if there are any hints (node unclean, etc.). Then keep going backward until the ultimate cause is found. -- Ken Gaillot

Re: [ClusterLabs] Pacemaker startup retries

2018-09-05 Thread Ken Gaillot
; Filesystem(p_fs_datosweb)[962]: 2018/08/31_11:00:05 INFO: Running > start for /dev/drbd/by-res/datoswebstorage on /mnt/datosweb > Filesystem(p_fs_database)[961]: 2018/08/31_11:00:05 INFO: Running > start for /dev/drbd/by-res/databasestorage on /mnt/database > > > .. > > > Can

Re: [ClusterLabs] Antw: Rebooting a standby node triggers lots of transitions

2018-09-05 Thread Ken Gaillot
t; Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratc > > h.pdf  > > Bugs: http://bugs.clusterlabs.org  > > > > ___________ > Users mailing list: Users@clusterlabs.org > https://lists.clusterlabs.org/mailman/listinfo/

Re: [ClusterLabs] Antw: Re: Antw: Q: native_color scores for clones

2018-09-05 Thread Ken Gaillot
On Wed, 2018-09-05 at 09:32 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 04.09.2018 um > > > > 19:21 in Nachricht > > <1536081690.4387.6.ca...@redhat.com>: > > On Tue, 2018-09-04 at 11:22 +0200, Ulrich Windl wrote: > > >

Re: [ClusterLabs] Pacemaker startup retries

2018-09-05 Thread Ken Gaillot
t; > >  Oh :( I'm using Pacemaker-1.1.14. > Do you know if this reboot retries are just run 3 times? All the > tests I've done the rebooting is finished after 3 times. > > Thanks > Cesar No, if I remember correctly, it would just keep going until

Re: [ClusterLabs] Pacemaker startup retries

2018-09-05 Thread Ken Gaillot
On Wed, 2018-09-05 at 09:51 -0500, Ken Gaillot wrote: > On Wed, 2018-09-05 at 16:38 +0200, Cesar Hernandez wrote: > > Hi > > > > > > > > Ah, this rings a bell. Despite having fenced the node, the > > > cluster > > > still conside

Re: [ClusterLabs] Pacemaker startup retries

2018-09-05 Thread Ken Gaillot
rom source, you can apply the patch that fixes the issue to the 1.1.14 code base: https://github.com/ClusterLabs/pacemaker/commit/98457d1635db1222f93599b6021e662e766ce62d -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://list

Re: [ClusterLabs] About fencing stonith

2018-09-06 Thread Ken Gaillot
onfigure primitive RADIUS-IP ocf:heartbeat:IPaddr2 \ > params ip="192.168.0.9" nic="eth0" cidr_netmask="24" \ > op monitor interval=10s timeout=20s > crm configure primitive RADIUS lsb:freeradius op monitor interval=10s > timeout=20s > crm configure clone RADI

Re: [ClusterLabs] Q: ordering clones with interleave=false

2018-08-29 Thread Ken Gaillot
ifference, whether the resource cannot run on an online > node, or is unable due to a standby or offline node? > > Regards, > Ulrich Interleave=false only applies to instances that will be started in the current transition, so offline nodes don't prevent dependent resources from sta

Re: [ClusterLabs] Q: Resource Groups vs Resources for stickiness and colocation?

2018-08-29 Thread Ken Gaillot
6-12.el7.x86_64) > > /Ian This sounds like a bug. Feel free to submit a report at bugs.clusterlabs.org and attach the policy engine input file with the unexpected behavior. FYI a group's stickiness is the sum of the stickiness of each active member, though no score can be bigger than I

Re: [ClusterLabs] Problem with pacemaker resources when NTP sync is done

2018-07-04 Thread Ken Gaillot
to handle large time jumps. Jumps forward aren't too bad, but jumps backward can cause significant trouble. > #  pacemakerd --version > Pacemaker 1.1.16 > Written by Andrew Beekhof > # corosync -v > Corosync Cluster Engine, version '2.4.2' > Copyright (c) 2006-2009 R

Re: [ClusterLabs] Cluster from scratch - 7.6. Configure the Cluster for the DRBD device

2018-07-05 Thread Ken Gaillot
ieve the note about the version shipped with CentOS 7.1 is no longer an issue with recent versions. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterl

Re: [ClusterLabs] Antw: OCF Return codes OCF_NOT_RUNNING

2018-07-11 Thread Ken Gaillot
; if I have a resource threshold set >1,  i get start->monitor->stop > > cycle > > until the threshold is consumed > > Then either your start is broken, or your monitor is broken. Try to > validate your RA using ocf-tester before using it. > > Regards, &g

Re: [ClusterLabs] What triggers fencing?

2018-07-11 Thread Ken Gaillot
ason not to do this is that if you use 0, > > > > > > > then don't use > > > > > > > anything at all (0 is default), and any other value > > > > > > > causes avoidable > > > > > > > fence delays. > > > &

Re: [ClusterLabs] Problem with pacemaker init.d script

2018-07-11 Thread Ken Gaillot
> > > > > > Project Home: http://www.clusterlabs.org > > > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scra > > > tch.pdf > > > Bugs: http://bugs.clusterlabs.org > > > > > > > ___ > > Users m

[ClusterLabs] Pacemaker 1.1.19 released

2018-07-11 Thread Ken Gaillot
nks to all contributors of source code to this release, including Andrew Beekhof, Gao,Yan, Hideo Yamauchi, Jan Pokorný, Ken Gaillot, and Klaus Wenninger. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailma

Re: [ClusterLabs] Problem with pacemaker init.d script

2018-07-11 Thread Ken Gaillot
ation must be done first. So maybe the idea was to always require someone to specify run levels. But it does make more sense that they would be listed in the LSB header. One reason it wouldn't have been an issue before is some older distros use the init script's chkconfig header ins

Re: [ClusterLabs] Pacemaker alert framework

2018-07-06 Thread Ken Gaillot
could even combine everything into a single custom resource agent for use as a master/slave resource, where the master is the only instance that actually runs the resource, and the slaves just act on the notifications. > > Regards, > Klaus > > > Thanks > > > > /Ian.

[ClusterLabs] Pacemaker 2.0.0 has been released

2018-07-06 Thread Ken Gaillot
er Explained" document has grown large enough that topics related to cluster administration have been moved to their own new document, "Pacemaker Administration": http://clusterlabs.org/pacemaker/doc/ Many thanks to all contributors of source code to this release, including Andrew Beekho

Re: [ClusterLabs] Clearing failed actions

2018-07-09 Thread Ken Gaillot
> > Also, is there a way to clear one specific item from the list, or > > is clearing > > all the only option? > > pcs failcount reset [node] With the low level tools, you can use -r / --resource and/or -N / -- node with crm_resource to limit the clean-up. --

Re: [ClusterLabs] Antw: Clone resource active only if all nodes are active

2018-01-22 Thread Ken Gaillot
_ > Users mailing list: Users@clusterlabs.org > http://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch. > pdf > Bugs: http://bugs.clusterlabs.org --

Re: [ClusterLabs] Resources stopped due to unmanage

2018-03-12 Thread Ken Gaillot
report. > The question is: is there a sane way to run VMs under pacemaker's  > control? If yes, is it described somewhere? > > > -- > Pavel -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.

Re: [ClusterLabs] copy file

2018-03-08 Thread Ken Gaillot
nt for data integrity. Native replication would avoid all that. > 2018-03-07 10:20 GMT+01:00 Klaus Wenninger <kwenn...@redhat.com>: > > On 03/07/2018 10:03 AM, Mevo Govo wrote: > > > Thanks for advices, I will try! > > > lados. > > > > >

Re: [ClusterLabs] [Problem]The pengine core dumps when changing attributes of bundle.

2018-03-09 Thread Ken Gaillot
r  9 10:10:21 rh74-test pacemakerd[17719]:  notice: Respawning > failed child process: pengine > Mar  9 10:10:21 rh74-test pacemakerd[17719]:    info: Using uid=990 > and group=984 for process pengine > Mar  9 10:10:21 rh74-test pacemakerd[17719]:    info: Forked child > 1

Re: [ClusterLabs] corosync 2.4 CPG config change callback

2018-03-14 Thread Ken Gaillot
into corosync :) > > > > > > > > > Regards, > > > > >    Honza > > > > > > > > > > > > > > > > > help would be appreciated, much thanks! > > > > > > > > > > > > cheers, > > > > > > Thomas > &g

[ClusterLabs] Pacemaker 2.0.0-rc2 now available

2018-04-06 Thread Ken Gaillot
We do many regression tests and simulations, but we can't cover all possible use cases, so your feedback is important and appreciated. Many thanks to all contributors of source code to this release, including Andrew Beekhof, Gao,Yan, Hideo Yamauchi, Jan Pokorný, and Ken Gaillot. Thanks also to Fabio DiNitt

Re: [ClusterLabs] Possible idea for 2.0.0: renaming the Pacemaker daemons

2018-04-09 Thread Ken Gaillot
er the convenient 15-character limit anyway. On Wed, 2018-03-28 at 12:40 -0500, Ken Gaillot wrote: > Hi all, > > Andrew Beekhof brought up a potential change to help with reading > Pacemaker logs. > > Currently, pacemaker daemon names are not intuitive, making it > difficult to

Re: [ClusterLabs] Possible idea for 2.0.0: renaming the Pacemaker daemons

2018-04-10 Thread Ken Gaillot
On Tue, 2018-04-10 at 08:50 +0200, Jehan-Guillaume de Rorthais wrote: > On Tue, 10 Apr 2018 00:54:01 +0200 > Jan Pokorný <jpoko...@redhat.com> wrote: > > > On 09/04/18 12:10 -0500, Ken Gaillot wrote: > > > Based on the list discussion and feedback I cou

Re: [ClusterLabs] STONITH forever?

2018-04-10 Thread Ken Gaillot
e_state[@uname='xxx-a']/transient_attributes: OK (rc=0, > origin=xxx-b/crmd/88, version=0.164.37) >   > This the repeats forevermore ... >   > Thanks for any hints, >   > cheers, >   > Stefan -- Ken Gaillot <kgail...@redhat.com> _

Re: [ClusterLabs] Antw: Re: Possible idea for 2.0.0: renaming the Pacemaker daemons

2018-04-11 Thread Ken Gaillot
On Wed, 2018-04-11 at 08:49 +0200, Ulrich Windl wrote: > > > > Ken Gaillot <kgail...@redhat.com> schrieb am 09.04.2018 um > > > > 19:10 in Nachricht > > <1523293841.5734.7.ca...@redhat.com>: > > Based on the list discussion and feedback I

Re: [ClusterLabs] 答复: No slave is promoted to be master

2018-04-12 Thread Ken Gaillot
l-ha pgsqld notify=true > interleave=true; >   >   > Sometimes it reports the following error, how to configure to avoid > it? -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.o

Re: [ClusterLabs] General Capabilities Question

2018-04-13 Thread Ken Gaillot
ce/VIP stuff does, but you probably want to write your own OCF resource agent (see IPaddr2 as an example) to manage the IP, and let Pacemaker call it as needed. -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs

Re: [ClusterLabs] How to cancel a fencing request?

2018-04-09 Thread Ken Gaillot
On Tue, 2018-04-10 at 00:02 +0200, Jehan-Guillaume de Rorthais wrote: > On Tue, 03 Apr 2018 17:35:43 -0500 > Ken Gaillot <kgail...@redhat.com> wrote: > > > On Tue, 2018-04-03 at 21:46 +0200, Klaus Wenninger wrote: > > > On 04/03/2018 05:43 PM, Ken Gaillot wrote:   &g

Re: [ClusterLabs] Failing operations immediately when node is known to be down

2018-04-13 Thread Ken Gaillot
it. What log messages do you see from corosync and pacemaker indicating that the node is down? Do you have fencing configured and tested? -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org https://lists.clus

Re: [ClusterLabs] How can I prevent multiple start of IPaddr 2 in an environment using fence_mpath?

2018-04-06 Thread Ken Gaillot
onf doesn't exist', >   last-rc-change='Fri Apr  6 13:16:39 2018', queued=0ms, exec=0ms > == > > We regard this behavior as a problem. > Is there a way to avoid this behavior? > > Regards, Yusuke Hi Yusuke, One possibility would be to implement network fabric fencing a

Re: [ClusterLabs] 答复: No slave is promoted to be master

2018-04-18 Thread Ken Gaillot
; > minutes after the cluster starts. > > > > Why is there about 15 minutes delay every time? >   > This was a bug in Pacemaker up to 1.1.17. I did a report about this > last August and Ken Gaillot fixed it few days later in 1.1.18. See: >   > https://lists.clusterlabs.or

Re: [ClusterLabs] Pacemaker resources are not scheduled

2018-04-16 Thread Ken Gaillot
t; useful > > to check a basic sanity of the custom agents: > > https://github.com/ClusterLabs/resource-agents/tree/master/tools/oc > ft > > > I did run ocf-tester and the result was passed. Here I carefully read > the log. When this error was printed in the log, the late

Re: [ClusterLabs] Regarding patch submission for PCS

2018-04-23 Thread Ken Gaillot
best way is via a github pull request at: https://github.com/ClusterLabs/pcs If you are not familiar with github, or can't go that route for whatever reason, let us know. -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterla

Re: [ClusterLabs] Displaying "original" resources location scores?

2018-03-27 Thread Ken Gaillot
y to automatically calculate just a portion of the score. -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started

[ClusterLabs] Announcing the first ClusterLabs video karaoke contest!

2018-04-01 Thread Ken Gaillot
, And show the world your uptime. Keep serving all the things you can. Standby your node. Users list members will vote on all submissions, and the winner will receive a COMPLETE SET of all available ClusterLabs swag!* -- Ken Gaillot <kgail...@redhat.com> * DISCLAIMERS: ClusterLabs current

[ClusterLabs] Possible idea for 2.0.0: renaming the Pacemaker daemons

2018-03-28 Thread Ken Gaillot
d, PREFIX-state  crmd: PREFIX-controld, PREFIX-clusterd, PREFIX-controller  lrmd: PREFIX-locald, PREFIX-resourced, PREFIX-runner  pengine: PREFIX-policyd, PREFIX-scheduler  stonithd: PREFIX-fenced, PREFIX-stonithd, PREFIX-executioner pacemaker_remoted: PREFIX-remoted, PREFIX-remote --

Re: [ClusterLabs] symmetric-cluster=false doesn't work

2018-03-26 Thread Ken Gaillot
will be *probed* on every node (a one-time monitor action to check whether they are already running there), but they should only be *started* on allowed nodes. -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org http

Re: [ClusterLabs] Colocation constraint for grouping all master-mode stateful resources with important stateless resources

2018-03-26 Thread Ken Gaillot
> be active on the same node. It means that in your case of > > > >   > id="pcs_rsc_colocation_set_drbdfs_set_drbd.master_inside-interface- > > sameip.master_outside-interface-sameip.master" > > score="INFINITY"> > > > >    &

Re: [ClusterLabs] copy file

2018-03-26 Thread Ken Gaillot
(c1+c6). Before starting the db, you want a resource that checks whether the original config needs repair, and if so, copy it from the backup outside DRBD. It sounds like you should make a copy of the oracle agent, and modify its start action to do what you want. > 2018-03-08 20:12 GMT+01:00 Ken Gaill

Re: [ClusterLabs] Dependency loop

2018-03-26 Thread Ken Gaillot
raid that this could be the cause of my resources falling back > to a node that has recovered from a fail over although I have a > stickiness score of INFINITY. > > Thanks, > George > ___ > Users mailing list: Users@clusterlabs.org >

Re: [ClusterLabs] How to cancel a fencing request?

2018-04-02 Thread Ken Gaillot
> My questions are: > > > > > > 1. is it possible to cancel the fencing request  > > > 2. is it possible reset the node status to "online" ?  > > > > Not that I'm aware of. > > Argh! > > ++ You could fix the problem with the sto

Re: [ClusterLabs] Advisory order for cluster-managed resources

2018-04-03 Thread Ken Gaillot
e. > --  > Sam Gardner    > Trustwave | SMART SECURITY ON DEMAND -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clu

Re: [ClusterLabs] How to cancel a fencing request?

2018-04-03 Thread Ken Gaillot
On Tue, 2018-04-03 at 07:36 +0200, Klaus Wenninger wrote: > On 04/02/2018 04:02 PM, Ken Gaillot wrote: > > On Mon, 2018-04-02 at 10:54 +0200, Jehan-Guillaume de Rorthais > > wrote: > > > On Sun, 1 Apr 2018 09:01:15 +0300 > > > Andrei Borzenkov <arvidj...@gmail.c

Re: [ClusterLabs] Possible idea for 2.0.0: renaming the Pacemaker daemons

2018-04-03 Thread Ken Gaillot
On Tue, 2018-04-03 at 08:33 +0200, Kristoffer Grönlund wrote: > Ken Gaillot <kgail...@redhat.com> writes: > > > > I > > > would vote against PREFIX-configd as compared to other cluster > > > software, > > > I would expect that daem

Re: [ClusterLabs] Advisory order for cluster-managed resources

2018-04-03 Thread Ken Gaillot
mplify things but > perhaps not. Groups are easier to follow if you have simple colocation+order sequences. Sets can help with more complicated set-ups, but they are tricky to get right and always difficult to read. -- Ken Gaillot <kgail...@redhat.com>

Re: [ClusterLabs] How to cancel a fencing request?

2018-04-03 Thread Ken Gaillot
On Tue, 2018-04-03 at 21:33 +0200, Jehan-Guillaume de Rorthais wrote: > On Mon, 02 Apr 2018 09:02:24 -0500 > Ken Gaillot <kgail...@redhat.com> wrote: > > On Mon, 2018-04-02 at 10:54 +0200, Jehan-Guillaume de Rorthais > > wrote: > > > On Sun, 1 Apr 2018 09:01:15 +03

Re: [ClusterLabs] How to cancel a fencing request?

2018-04-03 Thread Ken Gaillot
On Tue, 2018-04-03 at 21:46 +0200, Klaus Wenninger wrote: > On 04/03/2018 05:43 PM, Ken Gaillot wrote: > > On Tue, 2018-04-03 at 07:36 +0200, Klaus Wenninger wrote: > > > On 04/02/2018 04:02 PM, Ken Gaillot wrote: > > > > On Mon, 2018-04-02 at 10:54 +0200, Jehan-Guill

Re: [ClusterLabs] Possible idea for 2.0.0: renaming the Pacemaker daemons

2018-03-29 Thread Ken Gaillot
On Thu, 2018-03-29 at 10:35 +0200, Kristoffer Grönlund wrote: > Ken Gaillot <kgail...@redhat.com> writes: > > > Hi all, > > > > Andrew Beekhof brought up a potential change to help with reading > > Pacemaker logs. > > > > Currently,

Re: [ClusterLabs] Resource switchover taking more time upon shutting off one of the node in a 2 node cluster

2018-03-26 Thread Ken Gaillot
vinash Sharma > > On Fri, Feb 23, 2018 at 8:57 PM, Ken Gaillot <kgail...@redhat.com> > wrote: > > On Fri, 2018-02-23 at 16:15 +0530, avinash sharma wrote: > > > Subject: Switchover of resource(MS) 'RoutingManager' and resource > > > group 'floatingips',

Re: [ClusterLabs] Error observed while starting cluster

2018-03-21 Thread Ken Gaillot
list sent by peer for local node > Mar 20 10:55:45 [26932] pcmk3 pacemakerd: info: > mcp_cpg_deliver:    Ignoring process list sent by peer for local node > Mar 20 10:55:45 [26932] pcmk3 pacemakerd:    error: > pcmk_child_exit:    The crmd process (27037) exited: Key has expired >

Re: [ClusterLabs] state file not created for Stateful resource agent

2018-03-20 Thread Ken Gaillot
info/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch. > pdf > Bugs: http://bugs.clusterlabs.org -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing li

Re: [ClusterLabs] Colocation constraint for grouping all master-mode stateful resources with important stateless resources

2018-03-23 Thread Ken Gaillot
        score="-INFINITY"> >           >         >       The above constraints keep inside-interface on a node where eth1 is good, and outside-interface on a node where eth2 is good. I'm guessing you want to keep these two constraints, and start over from scratch on

Re: [ClusterLabs] copy file

2018-03-05 Thread Ken Gaillot
direct access to the original file rather than a copy. -- Ken Gaillot <kgail...@redhat.com> ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting

Re: [ClusterLabs] why some resources blocked

2018-03-02 Thread Ken Gaillot
uot;**" \ >   shutdown_method="immediate" \ >   op monitor interval=30s > > pcs -f clust_ora_cfg_tmp constraint colocation add ora_db_xe with > ora_listener INFINITY > pcs -f clust_ora_cfg_tmp constraint order promote ora_listener then > st

Re: [ClusterLabs] 答复: 答复: 答复: How to configure to make each slave resource has one VIP

2018-03-05 Thread Ken Gaillot
发件人: Users [mailto:users-boun...@clusterlabs.org] 代表 Ken Gaillot > 发送时间: 2018年2月23日 23:14 > 收件人: Cluster Labs - All topics related to open-source clustering > welcomed <users@clusterlabs.org> > 主题: Re: [ClusterLabs] 答复: 答复: How to configure to make each slave > resource has one V

Re: [ClusterLabs] 答复: 答复: How to configure to make each slave resource has one VIP

2018-03-05 Thread Ken Gaillot
ttp://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > > Bugs: http://bugs.clusterlabs.org > > ___ > > Users mailing list: Users@clusterlabs.org  > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > Project H

Re: [ClusterLabs] Antw: Re: Antw: Re: Resources not monitored in SLES11 SP4 (1.1.12-f47ea56)

2018-06-28 Thread Ken Gaillot
On Thu, 2018-06-28 at 09:09 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 27.06.2018 um > > > > 16:18 in Nachricht > > <1530109097.6452.1.ca...@redhat.com>: > > On Wed, 2018-06-27 at 07:41 +0200, Ulrich Windl wrote: > > > > >

Re: [ClusterLabs] Install fresh pacemaker + corosync fails

2018-06-28 Thread Ken Gaillot
c features, I'd go with whatever stock packages are available for libqb and corosync. knet will be supported by corosync 3 and is bleeding-edge at the moment (though probably solid). If you do want to compile libqb and/or corosync, the guide on the wiki grabs the la

Re: [ClusterLabs] Pacemaker not restarting Resource on same node

2018-06-28 Thread Ken Gaillot
oing anything else. Certain OCF resource agent exit codes are considered "hard" errors that prevent retrying on the same node: missing dependencies, file permission errors, etc. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org http

Re: [ClusterLabs] Antw: Re: Antw: Re: Resources not monitored in SLES11 SP4 (1.1.12-f47ea56)

2018-06-28 Thread Ken Gaillot
On Thu, 2018-06-28 at 09:13 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 27.06.2018 um > > > > 16:32 in Nachricht > > <1530109926.6452.3.ca...@redhat.com>: > > On Wed, 2018-06-27 at 09:18 -0500, Ken Gaillot wrote: > > > On

Re: [ClusterLabs] Antw: Salvaging aborted resource migration

2018-09-27 Thread Ken Gaillot
Actually, I wouldn't mind getting rid > > of > > them altogether in any output.) > > ‑‑  > > Thanks, > > Feri > > ___________ > > Users mailing list: Users@clusterlabs.org  > > https://lists.clusterlabs.org/mailman/listin

Re: [ClusterLabs] Understanding the behavior of pacemaker crash

2018-09-27 Thread Ken Gaillot
; > Thanks in advance > Prasad > > _______ > Users mailing list: Users@clusterlabs.org > https://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.c

Re: [ClusterLabs] Antw: pcmk 1.1.17: Which effective user is calling OCF agents for querying meta-data?

2018-09-27 Thread Ken Gaillot
s a big project and there are many more pressing issues to address. :-( There's no workaround within pacemaker, but the setfacl approach sounds useful. As a best practice, an agent's meta-data action should not do anything other than print meta-data. I.e. many agents have common initialization

Re: [ClusterLabs] Corosync 3 release plans?

2018-09-27 Thread Ken Gaillot
grotate is then configured to rotate by moving the log to a new name, sending the signal, then compressing the old log. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project

Re: [ClusterLabs] Corosync 3 release plans?

2018-09-27 Thread Ken Gaillot
On Thu, 2018-09-27 at 09:58 -0500, Ken Gaillot wrote: > On Thu, 2018-09-27 at 15:32 +0200, Ferenc Wágner wrote: > > Christine Caulfield writes: > > > > > TBH I would be quite happy to leave this to logrotate but the > > > message I > > > was getting

Re: [ClusterLabs] Corosync 3 release plans?

2018-09-27 Thread Ken Gaillot
On Thu, 2018-09-27 at 16:09 +0100, Christine Caulfield wrote: > On 27/09/18 16:01, Ken Gaillot wrote: > > On Thu, 2018-09-27 at 09:58 -0500, Ken Gaillot wrote: > > > On Thu, 2018-09-27 at 15:32 +0200, Ferenc Wágner wrote: > > > > Christine Caulfield writes: > >

Re: [ClusterLabs] Antw: pcmk 1.1.17: Which effective user is calling OCF agents for querying meta-data?

2018-09-27 Thread Ken Gaillot
r/lib/ocf/lib/heartbeat/ocf-shellfuncs. These OCFs would fail  > > > miserably. > > > So before I revisit all our OCFs to check if the well-behave if > > > called  > > > as non-root, I wanted to check if there is another way. > > > > > > Thanks, &

Re: [ClusterLabs] Antw: Salvaging aborted resource migration

2018-09-27 Thread Ken Gaillot
On Thu, 2018-09-27 at 18:00 +0200, Ferenc Wágner wrote: > Ken Gaillot writes: > > > On Thu, 2018-09-27 at 09:36 +0200, Ulrich Windl wrote: > > > > > Obviously you violated the most important cluster rule that is > > > "be > > > pa

Re: [ClusterLabs] Position of pacemaker in today's HA world

2018-10-05 Thread Ken Gaillot
e also highly relevant. Tighter integration with these would go a long way toward establishing longevity. That brings up another challenge, which is developer resources. It is difficult to keep up with triaging bug reports much less handling them. We occasionally have the opportunity to add signif

Re: [ClusterLabs] weird corosync - [TOTEM ] FAILED TO RECEIVE

2018-10-12 Thread Ken Gaillot
end turning on debug logging in corosync.conf, and posting the log here. Hopefully one of the corosync developers can chime in at that point. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users P

Re: [ClusterLabs] How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-15 Thread Ken Gaillot
> > Users mailing list: Users@clusterlabs.org > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > Project Home: http://www.clusterlabs.org > > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratc > &

Re: [ClusterLabs] Re: How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-15 Thread Ken Gaillot
  Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida >  16, A‑1 (Diversia) ,  Alcobendas ,  Madr > id,  28108 ,  Spain   On > 15/10/18 16:27, Ken Gaillot wrote: > > On Mon, 2018-10-15 at 14:39 +0200, Klaus Wenninger wrote: > > >

Re: [ClusterLabs] Re: How to generate RPMs for Pacemaker release 2.x on Centos

2018-10-17 Thread Ken Gaillot
8 249 >  |  M: +34 619 728 249  |  > franciscojavier.lo...@solera.com  |  Solera.com >   Audatex Datos, S.A.  |  Avda. de Bruselas, 36, Salida >  16, A‑1 (Diversia) ,  Alcobendas ,  Madr > id,  28108 ,  Spain   On

Re: [ClusterLabs] New cluster.target to control cluster services

2018-10-22 Thread Ken Gaillot
cemaker-cluster? clusterlabs-ha? high-availability?). The only drawback I see is that it's theoretically possible to deploy a different membership layer than corosync (in the past others were supported, and that may happen again in the future), and possible to run corosync without pacema

Re: [ClusterLabs] Fwd: Not getting Fencing monitor alerts

2018-10-17 Thread Ken Gaillot
                                                            > > >                                                              > > >     pcs property set stonith-enabled=true > > > > > > > > > Thanks, > > > Rohit > > > > > > &g

Re: [ClusterLabs] About the Pacemaker

2018-10-23 Thread Ken Gaillot
eration is one of the most commonly used Pacemaker features. You have the flexibility of failing over any combination of resources you want. Look into clone resources, master/slave clones, colocation constraints, and the on-fail property of operations. -- Ken Gaillot ___

Re: [ClusterLabs] Floating IP active in both nodes

2018-10-26 Thread Ken Gaillot
nfo/users > > >  > > > Project Home: http://www.clusterlabs.org > > > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scra > > tch.pdf > > > Bugs: http://bugs.clusterlabs.org > > >  > > > > ___

[ClusterLabs] Coming in 2.0.1 / 1.1.20: sbd compatibility with guest nodes and bundles

2018-11-05 Thread Ken Gaillot
k properly (with no loss of safety). -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org https://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_S

<    4   5   6   7   8   9   10   11   12   13   >