Re: [ClusterLabs] PAF with postgresql 13?

2022-03-08 Thread Ken Gaillot
g Ken?). Higher-level tools like pcs or crm shell could probably do it when removing the resource (i.e. if the resource was a promotable clone, check for and remove any node attributes of the form master-$RSC_ID). That sounds like a good idea to me. Pacemaker would be a bad place to do it becau

Re: [ClusterLabs] Pacemaker API (REST, SOAP, Java library)?

2022-03-08 Thread Ken Gaillot
and crm_mon will be added with the next release. In the meantime, most people just execute the command-line tools directly from their code. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: Re: Antw: [EXT] Re: Noticed oddity when DC is going to be fenced

2022-03-04 Thread Ken Gaillot
On Fri, 2022-03-04 at 08:17 +0100, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 02.03.2022 um > > > > 16:10 in > Nachricht > : > > On Wed, 2022-03-02 at 08:41 +0100, Ulrich Windl wrote: > > > > > > Ken Gaillot schrieb

Re: [ClusterLabs] Antw: [EXT] Re: Noticed oddity when DC is going to be fenced

2022-03-02 Thread Ken Gaillot
On Wed, 2022-03-02 at 08:41 +0100, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 01.03.2022 um > > > > 16:04 in > Nachricht > <463458e414f7c411eb1107335be6ee9a6e2d13ee.ca...@redhat.com>: > > On Tue, 2022‑03‑01 at 10:05 +0100, Ulrich Windl wrote: >

Re: [ClusterLabs] Noticed oddity when DC is going to be fenced

2022-03-01 Thread Ken Gaillot
; > (pacemaker-2.0.5+20201202.ba59be712-150300.4.16.1.x86_64) > > Did I misunderstand something, or does it look like a bug? > > Regards, > Ulrich > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] crm resource stop VirtualDomain - but VirtualDomain shutdown start some minutes later

2022-02-18 Thread Ken Gaillot
tarting the stop of the other > resources. > Cluster tried to "abort" the shutdown, but shutdown can't be aborted. > And i had bad luck that the shutdown of this domain took so long. > > Correct ? > > Bernd > Yes, other than the cluster isn't trying to abort the shutdown, it's just discarding any actions that were planned after it in the same transition. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] crm resource stop VirtualDomain - but VirtualDomain shutdown start some minutes later

2022-02-17 Thread Ken Gaillot
ew actions from the transition -- but any actions currently in flight must complete before the new transition can be calculated. Changes that abort a transition include configuration changes, a node joining or leaving, an unexpected action result being received, a node attribute changing, the cluster-r

Re: [ClusterLabs] crm resource stop VirtualDomain - but VirtualDomain shutdown start some minutes later

2022-02-16 Thread Ken Gaillot
s "complete=false" > ? > What does that mean ? > > Bernd "Complete" is whether all actions originally planned in the transition were completed. For complete=true, the log is basically just a heads-up that the cluster needs to recheck things, since there's nothing to actually abort. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] crm resource stop VirtualDomain - but VirtualDomain shutdown start some minutes later

2022-02-16 Thread Ken Gaillot
le transitions. Often when some event is happening, lots of micro-conditions (action results, node attribute changes, etc.) change in a short time, and you'll see a new transition after each one. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Xen, SLES15, libvirt live-migration and a fencing loop

2022-02-16 Thread Ken Gaillot
kernel. > But still, when booting correctly, the cluster would still try to > "recover" from the false "is active on 2 nodes", so the true fix was > a manual "cleanup" of the migration entries. > > Pacemaker &qu

Re: [ClusterLabs] crm resource stop VirtualDomain - but VirtualDomain shutdown start some minutes later

2022-02-15 Thread Ken Gaillot
nstead. Plus maybe replace "aborted" with "interrupted", so then we'd have "Action plan interrupted" which is maybe a little more understandable. > > Transition 128 is finished: > Feb 15 21:04:26 [15370] ha-idg-2 crmd: notice: > run_graph: Transition 128 (Complete=1, Pending=0, Fired=0, > Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input- > 3548.bz2): Complete > > And one second later the shutdown starts. Is that normal that there > is such a big time gap ? > > Bernd No, there should be another transition calculated (with a "saving input" message) immediately after the original transition is aborted. What's the timestamp on that? -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Re: what is the "best" way to completely shutdown a two-node cluster ?

2022-02-11 Thread Ken Gaillot
utilization counting "static" resource consumption, one > could have a > dynamic resource consumption (counting semaphore-like) that is > consumed while > an operation on an instance naming that resource is being performed. > So when you name your resource "concurrent_vm_ops" and asign that to > every vm > configuration, eventually initalizing the resource to siome thing > like 2 or 3, > then you could limit the concurrent VM invocations. Likewise, for > less heave > instances you could use more relaxed settings or no restrictions at > all... > > Regards, > Ulrich > You can accomplish something similar with an ordering constraint with kind=Serialize. In the case of "start vm1 then start vm2" with kind=Serialize, it means that vm1 and vm2 will not be started simultaneously, but neither actually requires the other or has to be done in a specific order. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Is there a python package for pacemaker ?

2022-02-02 Thread Ken Gaillot
to execute the Pacemaker command-line tools to do what you want. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Removing a resource without stopping it

2022-01-31 Thread Ken Gaillot
but hopefully > it > at least clarified a bit what's going on. > > > Regards, > Tomas > > > Dne 29. 01. 22 v 6:12 Digimer napsal(a): > > On 2022-01-29 00:10, Digimer wrote: > > > On 2022-01-28 16:54, Ken Gaillot wrote: > > >

Re: [ClusterLabs] Removing a resource without stopping it

2022-01-28 Thread Ken Gaillot
, which includes the is-managed setting, Pacemaker no longer knows the resource is unmanaged. And even if you set it via resource defaults or something, eventually you have to set it back, at which point Pacemaker will still have the same response. -- Ken Gaillot _

Re: [ClusterLabs] Pacemaker managing Keycloak

2022-01-28 Thread Ken Gaillot
vance. I'd check for SELinux denials first. A command executed from the command line is unconstrained, while being executed by a daemon is subject to SELinux policies. Other than that, maybe turn on any debugging options and check the keycloak logs from the container (e.g. using network logging or an e

[ClusterLabs] Native Chinese speaker wanted to proof a few translations

2022-01-27 Thread Ken Gaillot
al English, and the "msgstr" entries are the translations. You can either review them in github or reply here. Thanks to anyone who can help! -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users Clu

Re: [ClusterLabs] Reduce failover time by concurent stop of 2 RG

2022-01-27 Thread Ken Gaillot
> both Resource Groups to improve the failover time? > > BR, > J. Gogu The only thing I can think of is on-fail=fence -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Re: Feedback wanted: Native language support for Pacemaker help output

2022-01-13 Thread Ken Gaillot
slated, when a more complex setup is > needed -> you will always have to search in the source/github > issues/documentation/mailing list history and rely on English. > > Best Regards, > Strahil Nikolov > > > On Tue, Jan 11, 2022 at 9:23, Ulrich Windl > >

Re: [ClusterLabs] Feedback wanted: Native language support for Pacemaker help output

2022-01-10 Thread Ken Gaillot
Re-raising this due to the recent holidays ... Is translation of Pacemaker option help and man pages something people would like to see? Would anyone be willing to contribute or proofread translations if the tools were easy? On Fri, 2021-12-03 at 15:02 -0600, Ken Gaillot wrote: > Hi

Re: [ClusterLabs] [IMPORTANT] CI update

2022-01-04 Thread Ken Gaillot
is back online 100%. > > > > Fabio > > ___ > > Manage your subscription: > > https://lists.clusterlabs.org/mailman/listinfo/developers > > > > ClusterLabs home: https://www.clusterlabs.org/ >

Re: [ClusterLabs] VirtualDomain - unable to migrate

2022-01-04 Thread Ken Gaillot
monitor interval=30s > (c8kubermaster1-monitor-interval-30s) >start interval=0s timeout=60s > (c8kubermaster1-start-interval-0s) >stop interval=0s timeout=60s > (c8kubermaster1-stop-interval-0s) > > Any and all suggestions & thoughts

Re: [ClusterLabs] VirtualDomain - started but... not really

2021-12-16 Thread Ken Gaillot
On Sat, 2021-12-11 at 13:49 +, lejeczek via Users wrote: > > On 10/12/2021 21:17, Ken Gaillot wrote: > > On Fri, 2021-12-10 at 16:33 +, lejeczek via Users wrote: > > > Hi guys. > > > > > > I quite often.. well, to frequently in my mind, see a VM &

[ClusterLabs] FYI: fence history display regression in Pacemaker 2.1.2

2021-12-15 Thread Ken Gaillot
the master branch as a patch. Since this only affects the display, there are no plans for a special release. The fix will land in the next normal release, expected around the middle of 2022. -- Ken Gaillot ___ Manage your subscription: https

Re: [ClusterLabs] VirtualDomain - started but... not really

2021-12-10 Thread Ken Gaillot
>migrate_to interval=0s timeout=180s > (c8kubermaster2-migrate_to-interval-0s) >monitor interval=30s > (c8kubermaster2-monitor-interval-30s) >start interval=0s timeout=90s > (c8kubermaster2-start-interval-0s) >

Re: [ClusterLabs] pcs update resource command not working

2021-12-09 Thread Ken Gaillot
-Ei 'pcs|pacemaker|corosync' > pacemaker-2.0.2-2.el7.x86_64 > corosync-2.4.4-2.el7.x86_64 > pcs-0.9.169-1.el7.x86_64 > [root@node01 testadmin]# > > Thanks and Regards, > S Sathish S -- Ken Gaillot ___ Manage your subscription: http

[ClusterLabs] Feedback wanted: Native language support for Pacemaker help output

2021-12-03 Thread Ken Gaillot
ace for translations, but with this initial proof-of-concept, it involves github pull requests and reviews. Thoughts? -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Pacemaker 2.1.2 final release now available

2021-11-24 Thread Ken Gaillot
due to internal cluster issues as opposed to agent issues. As usual, it also includes a number of bug fixes. Many thanks to all contributors of source code to this release, including Chris Lumens, Ferenc Wágner, Gao,Yan, Grace Chin, Hideo Yamauchi, Ken Gaillot, Klaus Wenninger, and Oyvind

Re: [ClusterLabs] Which verson of pacemaker/corosync provides crm_feature_set 3.0.10?

2021-11-23 Thread Ken Gaillot
upgrade past 1.1.15 would put you in the same situation -- if the 1.1.15 node leaves the cluster, it can't rejoin until it's upgraded to the newer version. > Thank you very much for your help! > _Vitaly > > > On November 23, 2021 5:12 PM Ken Gaillot > > wrote: > >

Re: [ClusterLabs] Which verson of pacemaker/corosync provides crm_feature_set 3.0.10?

2021-11-23 Thread Ken Gaillot
seen at: https://wiki.clusterlabs.org/wiki/ReleaseCalendar 1.1.13 through 1.1.15 had feature set 3.0.10 > 3. Where could I get source rpms to rebuild this rpm on CentOs 8? > Thanks a lot! > _Vitaly Zolotusky The stock packages in the repos should be fine. All new

Re: [ClusterLabs] resource start after network reconnected

2021-11-19 Thread Ken Gaillot
ain the intermittence I > > > saw > > > this > > > morning. If I set that to 1 minute would that cause any gross > > > negative > > > issues? > > > > It increases CPU usage and IPC traffic. For Pacemaker 2.0.3 or > > later,

Re: [ClusterLabs] resource start after network reconnected

2021-11-19 Thread Ken Gaillot
to 1 minute would that cause any gross > negative > issues? It increases CPU usage and IPC traffic. For Pacemaker 2.0.3 or later, I definitely wouldn't bother. For older versions, 1 minute feels a bit much, I would go with around 5. > > Is there another setting besides cluster-recheck-

[ClusterLabs] Pacemaker 2.1.2-rc2 now available

2021-11-16 Thread Ken Gaillot
contributors of source code to this release, including Chris Lumens, Ferenc Wágner, and Ken Gaillot. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Fence node when network interface goes down

2021-11-12 Thread Ken Gaillot
se. Any help is appreciated! > > Thanks. Failure handling is configurable via the on-fail meta-attribute. You can set on-fail=fence for the ethmonitor resource's monitor action to fence the node if the monitor fails. There's also on-fail=standby, but that will still try to stop any

Re: [ClusterLabs] Antw: [EXT] Inquiry - remote node fencing issue

2021-11-05 Thread Ken Gaillot
On Fri, 2021-11-05 at 11:22 +0300, Andrei Borzenkov wrote: > On 05.11.2021 01:20, Ken Gaillot wrote: > > > There are two issues discussed in this thread. > > > > > > 1. Remote node is fenced when connection with this node is lost. > > > For > > > al

Re: [ClusterLabs] LVM and Filesystem resources - ordering and starting/stopping as a unit

2021-11-05 Thread Ken Gaillot
two resource start and > stop in a particular order, but also define that if one has to be > started or stopped the other must as well (in my defined order). > > Thanks. > > Greg Neitzert | Lead Software Engineer | RTC Software Engineering 2B > - Middleware > Unisys Cor

Re: [ClusterLabs] Antw: [EXT] Inquiry - remote node fencing issue

2021-11-04 Thread Ken Gaillot
On Sat, 2021-10-30 at 21:17 +0300, Andrei Borzenkov wrote: > On 29.10.2021 18:37, Ken Gaillot wrote: > ... > > > > > To address the original question, this is the log sequence I > > > > > find > > > > > most > > > > > relevant: >

Re: [ClusterLabs] (no subject)

2021-11-04 Thread Ken Gaillot
nc somewhere. I'm not aware of what might cause that. I'd check the corosync logs for clues. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Pacemaker 2.1.2-rc1 now available

2021-11-04 Thread Ken Gaillot
Gaillot, Klaus Wenninger, and Oyvind Albrigtsen. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Inquiry - remote node fencing issue

2021-10-29 Thread Ken Gaillot
On Fri, 2021-10-29 at 18:18 +0300, Andrei Borzenkov wrote: > On 29.10.2021 18:16, Andrei Borzenkov wrote: > > On 29.10.2021 17:53, Ken Gaillot wrote: > > > On Fri, 2021-10-29 at 13:59 +, Gerry R Sommerville wrote: > > > > Hey Andrei, > > > >

Re: [ClusterLabs] Antw: [EXT] Coming in Pacemaker 2.1.2: friendlier failed action display

2021-10-29 Thread Ken Gaillot
On Fri, 2021-10-29 at 08:24 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 28.10.2021 um > > > > 17:28 in > Nachricht > : > > Hi all, > > > > I hope to release the first release candidate for Pacemaker 2.1.2 > > next

Re: [ClusterLabs] Antw: [EXT] Inquiry - remote node fencing issue

2021-10-29 Thread Ken Gaillot
e fenced. There was nowhere to recover the connection. I'd need to see the CIB from that time to know why; it's possible you had an old constraint banning the connection from the other node (e.g. from a ban or move command), or something like that. -- Ken Gaillot _

[ClusterLabs] Coming in Pacemaker 2.1.2: friendlier failed action display

2021-10-28 Thread Ken Gaillot
pointer in the right direction). -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Corosync 2 vs Corosync 3

2021-10-25 Thread Ken Gaillot
t), but I don't think they're worth staying on 2. > Thanks, > > Toby Haynes -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Coming in Pacemaker 2.1.2: better display of internal failures

2021-10-20 Thread Ken Gaillot
On Wed, 2021-10-20 at 09:35 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 19.10.2021 um > > > > 19:16 in > Nachricht > : > > Hi all, > > > > I hope to get the first release candidate for Pacemaker 2.1.2 out > > in a > >

Re: [ClusterLabs] Coming in Pacemaker 2.1.2: better display of internal failures

2021-10-19 Thread Ken Gaillot
ueued=0ms, > exec=0ms > > Now, the exit reason will plainly say what happened: > > * rsc1_monitor_6 on node1 'error' (1): call=188, status='Error', > exitreason='Process interrupted by signal', last-rc-change='Fri Sep > 24 > 14:45:02 2021', queued=0ms, exec=0ms >

[ClusterLabs] Coming in Pacemaker 2.1.2: better display of internal failures

2021-10-19 Thread Ken Gaillot
queued=0ms, exec=0ms -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Trying to understand dampening (ping)

2021-10-14 Thread Ken Gaillot
ny group or constraints. > The same is true with colocation statements - if a constraint is made > with a ping resource without using a rule that references pingd then > the dampen behaviour is ignored completely. > > Is the ping'er missing something that does this: > > score

Re: [ClusterLabs] No link to https://clusterlabs.org/pacemaker/man/ from main page

2021-10-14 Thread Ken Gaillot
ction. It's in the Developers section, which makes no sense really. I'll cross-link it from Documentation when I get the chance ... -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] inquiry - remote node fails over

2021-10-13 Thread Ken Gaillot
How can I configure the cluster so that remote node srv-4 fails over > instead of getting fenced? > > > Thank you > > Janghyuk Boo. Hi, That is how it works whenever possible. If it fences the remote, it is because it was not recoverable. Logs from srv-1, srv-2, and srv-4 around that time would be helpful to give more detail. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: Re: Antw: Re: Antw: [EXT] Coming in Pacemaker 2.1.2: new fencing configuration options

2021-10-12 Thread Ken Gaillot
ejoining), the CIB versions will be compared, and the newest one wins (actually more like the one with the most changes). Generally, the existing cluster had more activity after the node was fenced, and the fenced node has little to no activity before it rejoins the cluster, so it works out well. However

Re: [ClusterLabs] Antw: [EXT] Coming in Pacemaker 2.1.2: new fencing configuration options

2021-10-08 Thread Ken Gaillot
On Fri, 2021-10-08 at 08:18 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 07.10.2021 um > > > > 22:53 in > Nachricht > <8bec6dc04c52d4ac5c2a8055eb7bae455f5a449d.ca...@redhat.com>: > > Hi all, > > > > We're looking ahead to the

[ClusterLabs] Coming in Pacemaker 2.1.2: new fencing configuration options

2021-10-07 Thread Ken Gaillot
w supports backslash-escaped characters (such as spaces) in the mapped name. For example, you could set pcmk_host_map="node1:Plug\ 1;node2:Plug\ 2" if the device expects "Plug 1" and "Plug 2" as the names. -- Ken Gaillot _

Re: [ClusterLabs] Move a resource only where another has Started

2021-10-07 Thread Ken Gaillot
got those > right.) > > I suppose another way to go about this would be to run another > clone'd resource that mimics the ping and automatically runs a > "resource cleanup MyGw-clone" if it notices the clone is down on and > node and the ping would succeed. But is there a cleaner way? > > Thanks, > D. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Constraint question

2021-10-05 Thread Ken Gaillot
abases are stopped? > > Best Regards, > John Hi, Yes, make the colocation optional (i.e. use a finite integer score) -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] corosync/pacemaker resources start after reboot - incorrect node ID calculated

2021-09-28 Thread Ken Gaillot
corosync traffic should go over (cluster token passing and control messages). Often this is a dedicated network so cluster traffic can't be crowded out by a networking spike (which might otherwise lead to fencing). -- Ken Gaillot ___ Manage your

Re: [ClusterLabs] Problem with high load (IO)

2021-09-27 Thread Ken Gaillot
t for an operation that does already have an explicit timeout, you'll have to use "pcs cluster edit" or cibadmin with the raw XML. > > On Mon, Sep 27, 2021 at 19:56, Ken Gaillot > > wrote: > > On Mon, 2021-09-27 at 12:37 +0200, Lentes, Bernd wrote: > > > Hi, >

Re: [ClusterLabs] Problem with high load (IO)

2021-09-27 Thread Ken Gaillot
ce mode for the window, or disable the monitor for the window. Of course that also disables any cluster response. You could instead lengthen operation timeouts during the window. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] cofigured trace for Virtual Domains - automatic restart ?

2021-09-20 Thread Ken Gaillot
On Sat, 2021-09-18 at 08:29 +0300, Andrei Borzenkov wrote: > On 17.09.2021 22:13, Ken Gaillot wrote: > > On Fri, 2021-09-17 at 15:54 +0200, Lentes, Bernd wrote: > > > Hi, > > > > > > today i configured tracing for some VirtualDomains: > > > > >

Re: [ClusterLabs] cofigured trace for Virtual Domains - automatic restart ?

2021-09-20 Thread Ken Gaillot
; > It > > would be nice to come up with a general solution that all agents > > can > > use rather than modify each agent's meta-data individually, but > > either > > approach would work. > > Bernd -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] cofigured trace for Virtual Domains - automatic restart ?

2021-09-17 Thread Ken Gaillot
configuration change that requires a restart. With the new > > OCF > > 1.1 standard, the trace parameter could be marked reloadable, but > > the > > agents need to be updated to do that. > > -- > > Ken Gaillot > > Hi Ken, > > but does pacemaker do it

[ClusterLabs] Pacemaker 2.1.1 final release now available

2021-09-09 Thread Ken Gaillot
thanks to all contributors of source code to this release, including Chris Lumens, Christine Caulfield, Emil Penchev, Gao,Yan, Grace Chin, Hideo Yamauchi, José Guilherme Vanz, Ken Gaillot, Klaus Wenninger, and Oyvind Albrigtsen. -- Ken Gaillot ___ Manage

Re: [ClusterLabs] 回复: (no subject)

2021-09-09 Thread Ken Gaillot
1 to other network segments, the same test cluster can > switch normally. Does the systemd journal or /var/log/pacemaker/pacemaker.log show any interesting messages around the time of the test? > > ------ 原始邮件 -- > 发件人: "Ken Gaillot" ; > 发送时间: 2021

Re: [ClusterLabs] (no subject)

2021-09-03 Thread Ken Gaillot
figure a recurring monitor on the IP resource? Without a monitor, the cluster will only start the resource. > journalctl will tell you a lot of cluster messages and might tell you > the > reason behind. > > [1] https://access.redhat.com/solutions/30564 > > Cheers, > Roger &g

[ClusterLabs] Pacemaker 2.1.1-rc3 now available (last chance to test!)

2021-08-16 Thread Ken Gaillot
-2.1.1-rc3 For details, see the ChangeLog in the source repository. Everyone is encouraged to download, compile and test the new release. We do many regression tests and simulations, but we can't cover all possible use cases, so your feedback is important and appreciated. -- Ken Gaillot

[ClusterLabs] Pacemaker 2.1.0-rc1 now available

2021-04-30 Thread Ken Gaillot
ulations, but we can't cover all possible use cases, so your feedback is important and appreciated. Many thanks to all contributors of source code to this release, including Aleksei Burlakov, Chris Lumens, Christine Caulfield, Gao,Yan, Grace Chin, Hideo Yamauchi, Jan Pokorný, Kazunori INOUE, Ke

Re: [ClusterLabs] Resolving cart before the horse with mounted filesystems.

2021-04-30 Thread Ken Gaillot
atthew, can you give more details about your use case? If you only need one NFS server, then it's fine not to clone it. The groups can be ordered after the NFS server without requiring them to be colocated with it. > > On Fri, Apr 30, 2021 at 18:44, Matthew Schumacher > > wrote: > >

Re: [ClusterLabs] VirtualDomain & "deeper" monitors - what/how?

2021-04-30 Thread Ken Gaillot
On Fri, 2021-04-30 at 19:24 +0300, Andrei Borzenkov wrote: > On 30.04.2021 17:57, Ken Gaillot wrote: > > On Fri, 2021-04-30 at 11:00 +0100, lejeczek wrote: > > > Hi guys > > > > > > I'd like to ask around for thoughts & suggestions on any > >

Re: [ClusterLabs] VirtualDomain & "deeper" monitors - what/how?

2021-04-30 Thread Ken Gaillot
t only makes sense if restarting whatever resource is listed as the container can make the dependent resource OK again. > > > On Fri, Apr 30, 2021 at 17:57, Ken Gaillot > > wrote: > > On Fri, 2021-04-30 at 11:00 +0100, lejeczek wrote: > > > Hi guys > > >

Re: [ClusterLabs] Resolving cart before the horse with mounted filesystems.

2021-04-30 Thread Ken Gaillot
t; ___________ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] VirtualDomain & "deeper" monitors - what/how?

2021-04-30 Thread Ken Gaillot
ce being monitored is inside a VirtualDomain, named vm1 for example, you can configure the nagios resource with the resource meta- attribute container="vm1". If the nagios check fails, Pacemaker will restart vm1. (This feature was added before OS containers became popular, so the naming

Re: [ClusterLabs] Antw: [EXT] Re: [Problem] In RHEL8.4beta, pgsql resource control fails.

2021-04-29 Thread Ken Gaillot
On Thu, 2021-04-29 at 09:14 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 28.04.2021 um > > > > 19:00 in > > Nachricht > : > > On Wed, 2021‑04‑28 at 18:14 +0200, Jehan‑Guillaume de Rorthais > > wrote: > > > Hi all, > > >

Re: [ClusterLabs] [Problem] In RHEL8.4beta, pgsql resource control fails.

2021-04-29 Thread Ken Gaillot
On Wed, 2021-04-28 at 19:19 +0200, Jehan-Guillaume de Rorthais wrote: > On Wed, 28 Apr 2021 12:00:40 -0500 > Ken Gaillot wrote: > > > On Wed, 2021-04-28 at 18:14 +0200, Jehan-Guillaume de Rorthais > > wrote: > > > Hi all, > > > > > > It seem

Re: [ClusterLabs] [Problem] In RHEL8.4beta, pgsql resource control fails.

2021-04-28 Thread Ken Gaillot
tire node_state XML subtree; you can add -n/--no-children to return just the node_state element itself. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] VirtualDomain craziness

2021-04-28 Thread Ken Gaillot
ceaa3a- > 3.3.1.21516.1.PTF.1182607.x86_64 > resource-agents-4.4.0+git57.70549516-3.12.1.x86_64 > > Regards, > Ulrich > > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.cluster

Re: [ClusterLabs] Stopping the last node with pcs

2021-04-28 Thread Ken Gaillot
know it's the last node and shut down without > complaint? It knows, it's just not sure you know :) pcs's design philosophy is to hand-hold users by default and give expert users --force. The idea in this case is that (especially in 3-to-5-node clusters) someone migh

Re: [ClusterLabs] Autostart/Enabling of Pacemaker and corosync

2021-04-26 Thread Ken Gaillot
at the problem. The automated approach leans more to quick self-recovery, and is more convenient in larger organizations where not every administrator that has access to the host for applying updates etc. is trained on the cluster software. -- Ken Gaillot __

Re: [ClusterLabs] Preventing multiple resources from moving at the same time.

2021-04-20 Thread Ken Gaillot
g is an optional ordering. If A is ordered after B optionally, and both have to move, A will stop, then B will stop, then B will start, then A will start. If only B has to move, it won't force A to restart. -- Ken Gaillot ___ Manage your subscription

Re: [ClusterLabs] Question about ping nodes

2021-04-19 Thread Ken Gaillot
ens. Apparently negative score completely prevents assigning > > resource to this node, and "should" here probably means "it is > still > > possible that final score may become positive". > > > > As it is not possible to refer to attributes of multiple nodes in a > > rule, you would need something that combines current pingd status > for > > individual nodes and makes it available. Logical place is > > ocf:pacemaker:ping resource agent itself. > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Question about ping nodes

2021-04-19 Thread Ken Gaillot
, and "should" here probably means "it is still > possible that final score may become positive". > > As it is not possible to refer to attributes of multiple nodes in a > rule, you would need something that combines current pingd status for > individual nodes and makes it available. Logical place is > ocf:pacemaker:ping resource agent itself. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Coming in Pacemaker 2.0.1: build-time default for resource-stickiness

2021-04-14 Thread Ken Gaillot
set the configured default via a rsc_defaults entry in new CIBs, so that it won't affect existing clusters or rolling upgrades (current users won't see behavior change), and unlike implicit defaults, users can query and remove resource defaults. -- Ken Gaillot

Re: [ClusterLabs] Single-node automated startup question

2021-04-14 Thread Ken Gaillot
er version of post_join_delay? > > B) is there a compelling argument NOT to use post_join_delay > > behaviour > > in pacemaker I am not seeing? > > > > Thanks! > > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] best practice for scripting

2021-04-13 Thread Ken Gaillot
ml has continued to use --as-xml, but will switch to --output-as=xml, thus Tomas's warning. > I will try to start with "pcs status xml" in the future. > > Regards, > tbskyd -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] VirtualDomain - monitor misses to report & plays up

2021-04-12 Thread Ken Gaillot
tch but, > naturally the obvious question would be - why that is even > allowed to happen? > many thanks, L. > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Fwd: Issue with resource-agents ocf:heartbeat:mariadb

2021-04-09 Thread Ken Gaillot
name="clone-max" value="2"/> > > name="master-max" value="1"/> > > name="notify" value="true"/> >

Re: [ClusterLabs] how to setup single node cluster

2021-04-09 Thread Ken Gaillot
use the same cluster management commands they're familiar with, and certain resources can always run at the DR site to keep it ready (e.g. shared storage or a database replicant). There are some ideas about making such a setup easier to manage, such as being able

Re: [ClusterLabs] Why my node1 couldn't back to the clustering chain?

2021-04-08 Thread Ken Gaillot
prefer a particular node, you can use location constraints to express that. However there is rarely a need to do so; in most clusters, nodes are equally interchangeable. > > On Thursday, April 8, 2021, 06:49:38 PM GMT+4:30, Ken Gaillot < > kgail...@redhat.com> wrote: > > &g

Re: [ClusterLabs] Why my node1 couldn't back to the clustering chain?

2021-04-08 Thread Ken Gaillot
he above status output, the web server is running on node2, using the IP address specified by the httpd_vip resource. Are you trying to contact the web server at a name corresponding to that IP? -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Re: cluster-recheck-interval and failure-timeout

2021-04-06 Thread Ken Gaillot
On Tue, 2021-04-06 at 09:15 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 31.03.2021 um > > > > 15:48 in > > Nachricht > <7dfc7c46442db17d9645854081f1269261518f84.ca...@redhat.com>: > > On Wed, 2021‑03‑31 at 14:32 +0200, Antony S

Re: [ClusterLabs] "iscsi.service: Unit cannot be reloaded because it is inactive."

2021-04-05 Thread Ken Gaillot
it, but it > couldn't start! > How to solve it? > > Thanks. > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] cluster-recheck-interval and failure-timeout

2021-03-31 Thread Ken Gaillot
interactions) and > the fail-count does not go away. > > > Does anyone have suggestions on how to debug this difference in > behaviour > between pacemaker 1.1.16 and 2.0.1, because at present it prevents me > being > able to upgrade an operational cluster, as the

Re: [ClusterLabs] cluster-recheck-interval and failure-timeout

2021-03-31 Thread Ken Gaillot
gt; seconds ago"? Unfortunately, there is no way. failure-timeout expires *all* failures once the *most recent* is that old. It's a bit counter-intuitive but currently, Pacemaker only remembers a resource's most recent failure and the total count of failures, and changing that would be a big project. > Thanks, > > > Antony. > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Colocation per site ?

2021-03-30 Thread Ken Gaillot
On Tue, 2021-03-30 at 08:01 +0300, Andrei Borzenkov wrote: > On 29.03.2021 20:12, Ken Gaillot wrote: > > On Sun, 2021-03-28 at 09:20 +0300, Andrei Borzenkov wrote: > > > On 28.03.2021 07:16, Strahil Nikolov wrote: > > > > I didn't mean DC as a designated

Re: [ClusterLabs] Antw: [EXT] Re: What a "high priority"?

2021-03-30 Thread Ken Gaillot
On Tue, 2021-03-30 at 08:26 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 29.03.2021 um > > > > 19:23 in > > Nachricht > : > > Scores are in the range ‑1,000,000 to +1,000,000 (also known as > > "infinity"). > > > &

Re: [ClusterLabs] Antw: [EXT] Colocation per site ?

2021-03-30 Thread Ken Gaillot
re="INFINITY" with-rsc- > > role="Master" node-attribute="hana__site"/> > > > > On Mon, Mar 29, 2021 at 9:07 PM Strahil Nikolov < > > hunter86...@yahoo.com> wrote: > > > Hi Ken, can you provide a prototype code example. > &g

Re: [ClusterLabs] What a "high priority"?

2021-03-29 Thread Ken Gaillot
_ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: Re: Antw: [EXT] staggered resource start/stop

2021-03-29 Thread Ken Gaillot
anks for help! > > > > ___ > > > > Manage your subscription: > > > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > > > > > ClusterLabs home: https://www.clusterlabs.org/ > > &g

Re: [ClusterLabs] Antw: [EXT] Colocation per site ?

2021-03-29 Thread Ken Gaillot
e essentially duplicating your RA > > functionality because you will somehow need to find out which site > > is > > currently primary. > > > > There is no guarantee that pacemaker DC wil be on the same site as > > SAP > > HANA primary system. > > > > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] [EXT] Re: Feedback wanted: OCF Resource Agent API 1.1 proposed for adoption

2021-03-29 Thread Ken Gaillot
I've made a note of these as ideas for 1.2/2.0 :) On Sun, 2021-03-28 at 03:03 +0200, Ulrich Windl wrote: > On 3/26/21 11:17 PM, Ken Gaillot wrote: > > OCF 1.1 is now formally adopted! > > > > https://github.com/ClusterLabs/OCF-spec/blob/master/ra/1.1/resource-agent-a

Re: [ClusterLabs] Feedback wanted: OCF Resource Agent API 1.1 proposed for adoption

2021-03-26 Thread Ken Gaillot
OCF 1.1 is now formally adopted! https://github.com/ClusterLabs/OCF-spec/blob/master/ra/1.1/resource-agent-api.md Thanks to everyone who gave feedback. Now to add support for it ... On Tue, 2021-03-09 at 17:07 -0600, Ken Gaillot wrote: > Hi all, > > After many false starts over the

<    1   2   3   4   5   6   7   8   9   10   >