Re: [ClusterLabs] fence_mpath and failed IP

2020-03-30 Thread Ken Gaillot
disk fencing with network access fencing via a smart switch. However there is a bug with that setup. I'm not sure what people have traditionally done about the problem. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Fedora 31 - systemd based resources don't start

2020-03-30 Thread Ken Gaillot
rst. > > > > > > SELinux is disabled: > > > > > > # getenforce > > > Disabled > > > > > > All systemd services controlled by the cluster are disabled from > > > starting at boot: > > > > > > # systemctl is-enabled http

Re: [ClusterLabs] Default resource stickiness issue with colocation constraint

2020-03-31 Thread Ken Gaillot
On Tue, 2020-03-31 at 07:37 +0300, Strahil Nikolov wrote: > On March 31, 2020 6:01:35 AM GMT+03:00, Ken Gaillot < > kgail...@redhat.com> wrote: > > On Sun, 2020-03-08 at 18:11 +, Strahil Nikolov wrote: > > > Hello All, > > > > > > can someone help

Re: [ClusterLabs] fence_mpath and failed IP

2020-03-31 Thread Ken Gaillot
On Tue, 2020-03-31 at 08:56 +0300, Andrei Borzenkov wrote: > 31.03.2020 05:56, Ken Gaillot пишет: > > On Sat, 2020-02-22 at 03:50 +0200, Strahil Nikolov wrote: > > > Hello community, > > > > > > Recently I have started playing with fence_mpath and I have &g

Re: [ClusterLabs] Resource Parameter Change Not Honoring Constraints

2020-04-01 Thread Ken Gaillot
On Thu, 2020-03-19 at 13:39 -0400, Marc Smith wrote: > On Mon, Mar 16, 2020 at 1:26 PM Marc Smith > wrote: > > > > On Thu, Mar 12, 2020 at 10:51 AM Ken Gaillot > > wrote: > > > > > > On Wed, 2020-03-11 at 17:24 -0400, Marc Smith wrote: > > >

Re: [ClusterLabs] Retrofit MySQL with pacemaker?

2020-05-04 Thread Ken Gaillot
d take over deciding which host is primary - add a colocation constraint for the IP with the mysql master role - drop the IP on the old primary, and edit your cluster IP resource to have the correct IP address; the cluster should drop the dummy IP and add the live one on the new primary - everything we

Re: [ClusterLabs] Adding node in existing cluster pcs constraint not setting properly

2020-05-15 Thread Ken Gaillot
ollowed the right procedure. If not, > kindly suggest an alternative. > > Thanks and Regards, > S Sathish S Configuration can be done from any node and will be sync'd to all nodes, but the nodes have to form a corosync membership first. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] resource-agent stack has dependencies with samba

2020-05-15 Thread Ken Gaillot
/ClusterLabs/resource-agents > > Thanks and Regards, > S Sathish It depends on samba so that the Filesystem resource can mount samba volumes. If you don't use that capability, you can safely remove samba. -- Ken Gaillot ___ M

Re: [ClusterLabs] Parallel execution of resources in resource group

2020-05-15 Thread Ken Gaillot
s crm_resource --move -r rsc1 you can add this XML to the configuration: > I will need to test the behavior of cluster while moving, clearing, > cleanup,.. > All my co-workers are used to "resource/service groups" as reference > points, so I will need to change the pro

Re: [ClusterLabs] Antw: Antw: [EXT] Re: resource-agent stack has dependencies with samba

2020-05-18 Thread Ken Gaillot
> "Ulrich Windl" schrieb am > > > > 18.05.2020 > > um > 08:01 in Nachricht <5ec2249f02a100039...@gwsmtp.uni-regensburg.de > >: > > > > > Ken Gaillot schrieb am 16.05.2020 um > > > > > 00:09 in > > > > Nachri

Re: [ClusterLabs] Coming in Pacemaker 2.0.4: crm_mon --include/--exclude

2020-03-18 Thread Ken Gaillot
On Wed, 2020-03-18 at 11:14 +0100, wf...@niif.hu wrote: > Ken Gaillot writes: > > > The crm_mon tool for showing cluster status will have --include and > > -- > > exclude options to pick and choose which types of information you > > want > > it to display. &

[ClusterLabs] Solidarity during these extraordinary times

2020-03-18 Thread Ken Gaillot
at the time, it is important. Open source has always been about more than creating and using software. It is about community, and how we can accomplish more together. Wishing you and your loved ones the best, -- Ken Gaillot ___ Manage your subscription: https

Re: [ClusterLabs] Q: All the versions

2020-03-18 Thread Ken Gaillot
t that's probably not a good idea due to backward compatibility issues. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Coming in Pacemaker 2.0.4: crm_mon --include/--exclude

2020-03-18 Thread Ken Gaillot
able to just specify something on the command line directly. The advantage is it lets you save common groupings for easy reuse and has the potential to let the same groupings be used with other tools in the future. -- Ken Gaillot ___ Manage your subscr

Re: [ClusterLabs] Antw: [EXT] Re: Q: All the versions

2020-03-19 Thread Ken Gaillot
On Thu, 2020-03-19 at 15:26 +0100, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 18.03.2020 um > > > > 17:59 in > > Nachricht > <5851_1584550779_5E72537B_5851_802_1_3128245adeaa62fd81c983c301a5acf3 > 6e59ff62.ca > e...@redhat.com>: > > On

Re: [ClusterLabs] Q: Implementing "reload" operation

2020-03-19 Thread Ken Gaillot
arameter that is required, and all the parsing and validation > checking would happen in the RA. An ugly solution! > > (Such issues appear if a resource instance does reflect some kind of > setting (like a firewall rule) instead of a process that is running) > > Regards, > Ulrich -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Coming in Pacemaker 2.0.4: crm_mon --include/--exclude

2020-03-17 Thread Ken Gaillot
-friendly). -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Re: Serialize and symmetrical=true does not work together

2020-03-17 Thread Ken Gaillot
On Tue, 2020-03-17 at 07:45 +0100, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 17.03.2020 um > > > > 00:58 in > > Nachricht > <1227_1584403147_5E7012CB_1227_2106_1_3a13cd73f8738e6967d6b1c399e994a > 1155495c3.c > m...@redhat.com>: > >

[ClusterLabs] Coming in Pacemaker 2.0.4: fencing delay based on what resources are where

2020-03-21 Thread Ken Gaillot
to recognize the primary authors of the 2.0.4 features announced so far: - shutdown locks: myself - switch to clock_gettime() for monotonic clock: Jan Pokorný - crm_mon --include/--exclude: Chris Lumens - priority-fencing-delay: Gao,Yan -- Ken Gaillot

Re: [ClusterLabs] I want to have some resource monitored and based on that make an acton. Is it possible?

2020-03-10 Thread Ken Gaillot
ted > > again. > > > > Also you can consider a colocation rule that all apps are > > started where the master DB is running - so the lattency will > > be minimal. > > > > Best Regards, > > Strahil Nikolov > > ___

Re: [ClusterLabs] Serialize and symmetrical=true does not work together

2020-03-16 Thread Ken Gaillot
rtified Enterprise Architect > IBM Services for Managed Applications > +91 98450 22258 Mobile > dilen...@in.ibm.com > > IBM Services > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/use

Re: [ClusterLabs] Parallel execution of resources in resource group

2020-05-07 Thread Ken Gaillot
would > start in parallel and rely on Ordering constraints, not their > resource group order? > We have many logical resource groups, so we don't want to have > resources without being added to any resource group. > > Regards > > Jan -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Tuchanka

2020-09-03 Thread Ken Gaillot
illaume noted, there are other cluster test platforms already, but none of them really cover everybody's desired scenarios (or is easily extensible). -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Tuchanka

2020-09-03 Thread Ken Gaillot
On Thu, 2020-09-03 at 18:10 +0200, Jehan-Guillaume de Rorthais wrote: > On Thu, 03 Sep 2020 10:58:54 -0500 > Ken Gaillot wrote: > > [...] there are other cluster test platforms already, but none of > > them really > > cover everybody's desired scenarios (or is easily ext

Re: [ClusterLabs] ovndb-servers resource agent doesn't work on pcs 0.9.164

2020-09-03 Thread Ken Gaillot
thing on my > configuration? > > > Regards Check the system log and pacemaker detail log for errors. You can also try "crm_resource --why -r ovndb_servers" to see if there's an obvious reason it's stopped. If none of that helps, try "pcs resource debug- start ovndb_servers --full" on one node to see if that gives additional info (that will launch the resource outside pacemaker's control, so it's a good idea to unmanage it in pacemaker first). -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] attrd/cib out of sync, master scores not updated in CIB after crmd "Respawn" after internal error [NOT cluster partition/rejoin]

2020-09-10 Thread Ken Gaillot
e. > > But that apparently will never reach the CIB. > > > > So. > > Question is: anyone seen anything like that before? > > Could that be fixed already? > > Version in that scenario was: 1.1.20+ (almost .21). > > > > Obviously "stonith&qu

[ClusterLabs] Coming in Pacemaker 2.0.5: limit crm_mon display to specified resources

2020-09-09 Thread Ken Gaillot
history. Happy clustering! -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Data location

2020-09-08 Thread Ken Gaillot
tion and data are shared. Another possibility is to create containers or virtual machines for each service, then use shared storage for the images. You can use bundles (for containers) or Pacemaker Remote guest nodes (for VMs) to monitor the service inside. -- Ken Gaillot

Re: [ClusterLabs] mess in the CIB

2020-10-07 Thread Ken Gaillot
, but you can modify the XML with them or cibadmin). There will still be downtime as pacemaker will see that as deleting one resource and adding another, so it will be restarted. > These domains can be stopped for a short time. > > Bernd > Helmholtz Zentrum München -- Ken Gaillot

Re: [ClusterLabs] mess in the CIB

2020-10-06 Thread Ken Gaillot
vm_snipanalysis-instance_attributes-5-migration_transport"/> > > >id="vm_snipanalysis-instance_attributes-6-migrate_options"/> > > >id="vm_snipanalysis-start-0-0"/> >id="vm_snipanalysis-stop-0-0"/> >id="vm_snipanalysis-monitor-30-0"/> >id="vm_snipanalysis-migrate_from-0-0"/> >id="vm_snipanalysis-migrate_to-0-0"/> > > >id="vm_snipanalysis-meta_attributes-0-allow-migrate"/> >id="vm_snipanalysis-meta_attributes-0-target-role"/> >id="vm_snipanalysis-meta_attributes-0-is-managed"/> >id="vm_snipanalysis-meta_attributes-0-maintenance"/> > > > > The config of vm_snipanalysis seems to be ok. > But vm_ssh ... why are some instance-attributes of it named with > snapanalysis? > I didn't change the configuration of both in the last weeks. It's unlikely that changed at any time; more likely it was created like that. Whatever was used to create the initial configuration would be where to look for clues. As long as the IDs are unique, their content doesn't matter to pacemaker, so it's just a cosmetic issue. > > Does anyone have a clue ? > Thanks. > > Bernd > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Behavior of corosync kill

2020-08-25 Thread Ken Gaillot
t; pacemaker-1.1.19-8.el7.x86_64 > centos 7.6.1810 > > Thanks, > Rohit > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manag

Re: [ClusterLabs] Antw: [EXT] Stonith failing

2020-08-17 Thread Ken Gaillot
On Mon, 2020-08-17 at 22:39 +0200, Jehan-Guillaume de Rorthais wrote: > On Mon, 17 Aug 2020 10:19:45 -0500 > Ken Gaillot wrote: > > > On Fri, 2020-08-14 at 15:09 +0200, Gabriele Bulfon wrote: > > > Thanks to all your suggestions, I now have the systems with > > >

Re: [ClusterLabs] node utilization attributes are lost during upgrade

2020-08-17 Thread Ken Gaillot
d when a resource must be > stopped/cannot > be started because the utilization constrains cannot be satisfied. > > Best regards, > Jozsef > -- > E-mail : kadlecsik.joz...@wigner.hu > PGP key: https://wigner.hu/~kadlec/pgp_public_key.txt > Address: Wigner Research Centre f

Re: [ClusterLabs] Antw: [EXT] Stonith failing

2020-08-18 Thread Ken Gaillot
On Tue, 2020-08-18 at 08:21 +0200, Klaus Wenninger wrote: > On 8/18/20 7:49 AM, Andrei Borzenkov wrote: > > 17.08.2020 23:39, Jehan-Guillaume de Rorthais пишет: > > > On Mon, 17 Aug 2020 10:19:45 -0500 > > > Ken Gaillot wrote: > > > > > > > On

Re: [ClusterLabs] node utilization attributes are lost during upgrade

2020-08-18 Thread Ken Gaillot
On Tue, 2020-08-18 at 14:35 +0200, Kadlecsik József wrote: > Hi, > > On Mon, 17 Aug 2020, Ken Gaillot wrote: > > > On Mon, 2020-08-17 at 12:12 +0200, Kadlecsik József wrote: > > > > > > At upgrading a corosync/pacemaker/libvirt/KVM cluster from > &

Re: [ClusterLabs] node utilization attributes are lost during upgrade

2020-08-18 Thread Ken Gaillot
n corosync.conf before the upgrade, so they don't change. > > Best Regards, > Strahil Nikolov > > На 18 август 2020 г. 17:15:49 GMT+03:00, Ken Gaillot < > kgail...@redhat.com> написа: > > On Tue, 2020-08-18 at 14:35 +0200, Kadlecsik József wrote: > > > H

Re: [ClusterLabs] why is node fenced ?

2020-08-18 Thread Ken Gaillot
ine: [ ha-idg-2 ] > > > vm_nextcloud (ocf::heartbeat:VirtualDomain): Stopped > > > > > > I don't understand why the cluster tries to stop a resource which > > > is > > > already stopped. > > Bernd > Helmholtz Zentrum München >

Re: [ClusterLabs] why is node fenced ?

2020-08-19 Thread Ken Gaillot
On Tue, 2020-08-18 at 12:30 -0500, Ken Gaillot wrote: > On Tue, 2020-08-18 at 16:47 +0200, Lentes, Bernd wrote: > > > > - On Aug 17, 2020, at 5:09 PM, kgaillot kgail...@redhat.com > > wrote: > > > > > > > > I checked all relevant pe-files in t

[ClusterLabs] Coming in Pacemaker 2.0.5: better start-up/shutdown coordination with sbd

2020-08-21 Thread Ken Gaillot
er or vice versa. Distributions may change the value to "yes" since they can ensure both sbd and pacemaker versions support it; users who build their own installations can set it themselves if both versions support it. -- Ken Gaillot ___ Mana

Re: [ClusterLabs] Active-Active cluster CentOS 8

2020-08-21 Thread Ken Gaillot
if there were any answers but I > cant find anything > > Thanks, > Mark Looks like it's a known CentOS packaging issue: https://bugs.centos.org/view.php?id=16939 -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.

Re: [ClusterLabs] why is node fenced ?

2020-08-19 Thread Ken Gaillot
heit und Umwelt (GmbH) > Ingolstaedter Landstr. 1 > 85764 Neuherberg > www.helmholtz-muenchen.de > Aufsichtsratsvorsitzende: MinDir.in Prof. Dr. Veronika von Messling > Geschaeftsfuehrung: Prof. Dr. med. Dr. h.c. Matthias Tschoep, Kerstin > Guenther > Registergericht: Amtsgeri

Re: [ClusterLabs] why is node fenced ?

2020-08-17 Thread Ken Gaillot
atsvorsitzende: MinDir.in Prof. Dr. Veronika von Messling > Geschaeftsfuehrung: Prof. Dr. med. Dr. h.c. Matthias Tschoep, Kerstin > Guenther > Registergericht: Amtsgericht Muenchen HRB 6466 > USt-IdNr: DE 129521671 -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] why is node fenced ?

2020-08-17 Thread Ken Gaillot
um München > > Helmholtz Zentrum Muenchen > Deutsches Forschungszentrum fuer Gesundheit und Umwelt (GmbH) > Ingolstaedter Landstr. 1 > 85764 Neuherberg > www.helmholtz-muenchen.de > Aufsichtsratsvorsitzende: MinDir.in Prof. Dr. Veronika von Messling > Geschaeftsfuehrung: Prof. Dr. med. Dr. h.c. Matthias Tschoep, Kerstin > Guenther > Registergericht: Amtsgericht Muenchen HRB 6466 > USt-IdNr: DE 129521671 > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] Stonith failing

2020-08-17 Thread Ken Gaillot
ion: > > > > > > > > > > > > > ssh based "stonith" cannot guarantee it. > > > > > > > node 1 will be perferred for pool 1, node 2 for pool 2, only in > > > > case one of the

[ClusterLabs] Pacemaker 1.1 series is now officially retired

2020-09-23 Thread Ken Gaillot
backporting fixes from Pacemaker 2 to the 1.1 branch in case anyone wants to make them readily available, but there will be no more official releases. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users

Re: [ClusterLabs] pacemaker and cluster hostname reconfiguration

2020-10-01 Thread Ken Gaillot
d this email by error, > please notify us immediately and delete this email from your system. > Email transmission cannot be guaranteed to be secured or error-free > or not to contain viruses. Athonet S.r.l. processes any personal data > exchanged in email correspondence in accordance with EU Reg. 679/2016 > (GDPR) - you may find here the privacy policy with information on > such processing and your rights. Any views or opinions presented in > this email are solely those of the sender and do not necessarily > represent those of Athonet S.r.l. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] How to stop removed resources when replacing cib.xml via cibadmin or crm_shadow

2020-10-01 Thread Ken Gaillot
Create shadow replace cib.xml with/without status and commit. > > Indeed crm_simulate -LS shows intention to stop vip-1.1.1.1, but in > > fact it > > will not after shadow commit. > > > > Sometimes I can manage to automatically clear removed/replaced VIP > > addresses f

[ClusterLabs] Coming in Pacemaker 2.1.0 (!)

2020-10-01 Thread Ken Gaillot
I compatibility in 2.1.0, so there would be no need to keep the 2.0 series alive with backports. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] attrd/cib out of sync, master scores not updated in CIB after crmd "Respawn" after internal error [NOT cluster partition/rejoin]

2020-09-17 Thread Ken Gaillot
On Tue, 2020-09-15 at 13:25 +0200, Lars Ellenberg wrote: > On Fri, Sep 11, 2020 at 11:42:46AM +0200, Lars Ellenberg wrote: > > On Thu, Sep 10, 2020 at 11:18:58AM -0500, Ken Gaillot wrote: > > > > But for some unrelated reason (stress on the cib, IPC timeout), > > &

[ClusterLabs] Coming in Pacemaker 2.0.5: integer or floating-point comparisons for node attribute rules

2020-09-17 Thread Ken Gaillot
"integer" to compare numerically. However, no one must have used that, since the configuration and code actually only accepted "number"! In 2.0.5, not only are we fixing that, but you will be able to specify "integer" for 64-bit integer comparisons or "numbe

Re: [ClusterLabs] Antw: [EXT] How to stop removed resources when replacing cib.xml via cibadmin or crm_shadow

2020-10-02 Thread Ken Gaillot
On Fri, 2020-10-02 at 21:35 +0300, Igor Tverdovskiy wrote: > > > On Thu, Oct 1, 2020 at 5:55 PM Ken Gaillot > wrote: > > There's no harm on the Pacemaker side in doing so. > > > > A resource that's running but removed from the configuration is > >

Re: [ClusterLabs] Determine a resource's current host in the CIB

2020-09-24 Thread Ken Gaillot
ot; > > > > exit-reason="" on_node="mk-a02n02" call-id="61" rc-code="0" > > > > op-status="0" interval="6" last-rc-change="1600925173" > > > > exec-time="539" queue-time="0" > > > > op-digest

[ClusterLabs] Pacemaker 1.1.23-rc1, and the future of Pacemaker 1.1

2020-05-27 Thread Ken Gaillot
on from the 1.1 series. My plan is to do one final 1.1 release at the end of this year. We could still accept backports after that time if anyone wants to keep using the 1.1 branch, but we wouldn't do any more releases, and would reduce or stop 1.1 testing. -- Ken Gaillot

Re: [ClusterLabs] Adding a node to an active cluster

2020-10-21 Thread Ken Gaillot
> They automate all steps needed to make cluster recognize new > > node > > online. > > > > > 2. which config file crm_node command reads? > > > > > > > CIB > > _________

[ClusterLabs] FYI: Pacemaker vulnerability CVE-2020-25654

2020-10-27 Thread Ken Gaillot
. It will also be fixed in the 1.1 branch along with a 1.1.24-rc1 release that includes just this. I will also post patches for the 2.0.3 and 2.0.4 releases to the develop...@clusterlabs.org list. -- Ken Gaillot ___ Manage your subscription: https

Re: [ClusterLabs] pacemaker systemd resource

2020-07-22 Thread Ken Gaillot
e#stop_0[node2.local]: (unset) -> INFINITY > Jul 21 15:53:42 node2.local pacemaker-attrd[1809]: notice: Setting > last-failure-dummy.service#stop_0[node2.local]: (unset) -> > 1595336022 > Jul 21 15:53:42 node2.local systemd[1]: dummy.service: Succeeded. > Jul 21 15:53

Re: [ClusterLabs] Antw: [EXT] Coming in Pacemaker 2.0.5: finer control over resource and operation defaults

2020-08-04 Thread Ken Gaillot
On Fri, 2020-07-24 at 09:15 +0200, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 23.07.2020 um > > > > 23:54 in > > Nachricht > <99c11c73d59560fccd472d09c3b76073dab1b73e.ca...@redhat.com>: > > Hi all, > > > > Pacemaker 2.0.4 is

[ClusterLabs] Coming in Pacemaker 2.0.5: on-fail=demote / no-quorum-policy=demote

2020-08-10 Thread Ken Gaillot
may be useful in a demoted role even if there is no quorum. A database that operates read-only when demoted and doesn't depend on any non-promotable resources might be an example. Happy clustering :) -- Ken Gaillot ___ Manage your s

Re: [ClusterLabs] Automatic recover from split brain ?

2020-08-10 Thread Ken Gaillot
> is currently assigned to two different hosts... > > > Can you help me configuring the cluster correctly so this cannot > occurs ? > > > Thanks in advance, > > Adam. > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/ma

[ClusterLabs] Coming in Pacemaker 2.0.5: on-fail=demote / no-quorum-policy=demote

2020-08-12 Thread Ken Gaillot
and doesn't depend on any non-promotable resources might be an example. Happy clustering :) -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] why is node fenced ?

2020-08-10 Thread Ken Gaillot
; although > > > "Jul 20 17:04:06 [23768] ha-idg-1 crmd: notice: > > > process_lrm_event: Result of stop operation for > > > vm_nextcloud on > > > ha-idg-1: 0 (ok) | call=3197 key=vm_nextcloud_stop_0 > > > confirmed=true > > > ci

Re: [ClusterLabs] Users Digest, Vol 44, Issue 11

2020-07-02 Thread Ken Gaillot
igure it out. Unfortunately the > creator and my mentor is dearly departed and, in times like this, > sorely missed.) My condolences ... > Any replies will be read and responded to early tomorrow AM. thanks > for understanding. > -- > Jeff Westgate -- Ken Gaillot ___

Re: [ClusterLabs] About the log indicating RA execution

2020-07-02 Thread Ken Gaillot
ult of start operation for dummy1 on r81-1: ok (in detail log) Received result of start operation for dummy1 on r81-1: ok | Transition 2 action 7 (dummy1_start_0) rc=0 call-id=10 > What do you think about this? (Do you have a better idea?) > > Best Regards, > Kazunori INOUE -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Users Digest, Vol 44, Issue 11

2020-07-02 Thread Ken Gaillot
LOL, somehow I clicked on an ancient message in my list folder ... well the advice stands if anyone has a similar issue ;) I plead a migraine, they make me miss little details like dates ... On Thu, 2020-07-02 at 09:45 -0500, Ken Gaillot wrote: > On Thu, 2018-09-06 at 00:59 +, Jeff

Re: [ClusterLabs] ethmonitor resource - for iface which does not exist yet - how?

2020-06-30 Thread Ken Gaillot
On Tue, 2020-06-30 at 15:09 +0100, lejeczek wrote: > > On 09/06/2020 15:22, Ken Gaillot wrote: > > On Wed, 2020-06-03 at 12:33 +0100, lejeczek wrote: > > > hi guys > > > > > > I wonder about an idea of 'ethmonitor' watching a net iface > > >

Re: [ClusterLabs] Still Beginner STONITH Problem

2020-07-02 Thread Ken Gaillot
stonith_cfg stonith create stonith_id_1 external/libvirt > hostlist="Host4,host2" > hypervisor_uri="qemu+ssh://192.168.1.21/system" > > > But as you can see in in the pcs status output, stonith is stopped > and > exits with an unkown error. > > Can

Re: [ClusterLabs] Antw: [EXT] Failed fencing monitor process (fence_vmware_soap) RHEL 8

2020-06-18 Thread Ken Gaillot
edulerd[26725] (unpack_rsc_op_failure) warning: > > Processing > > failed start of vmfence on srv1: OCF_TIMEOUT | rc=198 > > /var/log/pacemaker/pacemaker.log:Jun 17 08:34:36 srv1 > > pacemaker-schedulerd[26725] (check_migration_threshold)

Re: [ClusterLabs] Antw: [EXT] Failed fencing monitor process (fence_vmware_soap) RHEL 8

2020-06-18 Thread Ken Gaillot
On Thu, 2020-06-18 at 21:32 +0300, Andrei Borzenkov wrote: > 18.06.2020 18:24, Ken Gaillot пишет: > > Note that a failed start of a stonith device will not prevent the > > cluster from using that device for fencing. It just prevents the > > cluster from monitoring the

Re: [ClusterLabs] Antw: [EXT] Failed fencing monitor process (fence_vmware_soap) RHEL 8

2020-06-22 Thread Ken Gaillot
a node to execute the device, but that is planned. The priority list for selecting a node to execute a device is described above. For selecting between multiple fence devices when there is no topology, there is a priority meta-attribute for stonith devices, but

Re: [ClusterLabs] Beginner with STONITH Problem

2020-06-24 Thread Ken Gaillot
t; exec=74ms > > > > > > Daemon Status: > > corosync: active/disabled > > pacemaker: active/disabled > > pcsd: active/enabled > > > > > > > > I have researched the shown dlm Problem but eve

Re: [ClusterLabs] Antw: [EXT] Suggestions for multiple NFS mounts as LSB script

2020-06-29 Thread Ken Gaillot
ectory for the https/ftps > file > server operations should be operational, or else it's all moot. > > Is ocf_tester still available? I installed via 'yum' from the High > Availability repository and don't see it. I also did a 'yum > whatprovides *bin/ocf-tester' and no

Re: [ClusterLabs] How to reload pacemaker_remote service?

2020-06-09 Thread Ken Gaillot
gt; anyone know what signal needs to get sent to the > pacemaker_remoted > > service to reload its config? Sending a SIGHUP appears to kill > the > > process. > > > > Thanks for any help! > > Mike > -- > Ken Gaillot > > __

Re: [ClusterLabs] How to reload pacemaker_remote service?

2020-06-09 Thread Ken Gaillot
now what signal needs to get sent to the pacemaker_remoted > service to reload its config? Sending a SIGHUP appears to kill the > process. > > Thanks for any help! > Mike -- Ken Gaillot ___ Manage your subscription: https://lists.clusterl

Re: [ClusterLabs] ethmonitor resource - for iface which does not exist yet - how?

2020-06-09 Thread Ken Gaillot
ethmonitor- by default). You can then use location constraints with an attribute-based rule to keep resources where the interface is. See the man page for examples: https://www.mankier.com/7/ocf_heartbeat_ethmonitor -- Ken Gaillot ___ Manage your s

Re: [ClusterLabs] custom cluster module

2020-06-11 Thread Ken Gaillot
to override". the script supports meta-data|metadata|meta_data. I'm > not sure how to know what is valid metadat. I know i saw a utility to > check the scripts, but that does not appear to be installed/available > on redhat. -- Ken Gaillot

Re: [ClusterLabs] Resource start and stop run into timeout

2020-06-05 Thread Ken Gaillot
89965 > Jun 04 13:48:38 node1 pacemaker-execd [1159] > (action_complete) > notice: Giving up on nmb stop (rc=0): timeout (elapsed=647561ms, > remaining=-47561ms) > Jun 04 13:48:38 node1 pacemaker-based [1157] > (cib_p

Re: [ClusterLabs] pacemaker startup problem

2020-07-24 Thread Ken Gaillot
. > Jul 24 18:21:42 [968] stonith-ng: warning: stonith_ipc_server_init: > Verify pacemaker and pacemaker_remote are not both enabled. > > Any idea what's happening? > Gabriele > > > > > Sonicle S.r.l. : http://www.sonicle.com > Music: http://www.gabrielebul

Re: [ClusterLabs] why is node fenced ?

2020-07-29 Thread Ken Gaillot
runnable' > > > Why does it say "Jul 20 17:05:35 [10690] ha-idg- > 2pengine: warning: custom_action: Action vm_nextcloud_stop_0 > on ha-idg-1 is unrunnable (offline)" although > "Jul 20 17:04:06 [23768] ha-idg-1 crmd: notice: > process_lrm_event: Result of stop operation for vm_nextcloud on > ha-idg-1: 0 (ok) | call=3197 key=vm_nextcloud_stop_0 confirmed=true > cib-update=5960" > says that stop was ok ? > > > Bernd > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Coming in Pacemaker 2.0.5: finer control over resource and operation defaults

2020-07-23 Thread Ken Gaillot
, or ocf:heartbeat:podman) and if appropriate, IP resources (ocf:heartbeat:IPaddr2). Previously, there was no way to directly affect these resources, but with these new expressions you can at least configure defaults that apply to them, without having to use those same defaults for all your resources. -- Ken Gaillot

Re: [ClusterLabs] pacemaker systemd resource

2020-07-22 Thread Ken Gaillot
On Wed, 2020-07-22 at 17:04 +0300, Andrei Borzenkov wrote: > > > On Wed, Jul 22, 2020 at 4:58 PM Ken Gaillot > wrote: > > On Wed, 2020-07-22 at 10:59 +0300, Хиль Эдуард wrote: > > > Hi there! I have 2 nodes with Pacemaker 2.0.3, corosync 3.0.3 on > > > ubu

Re: [ClusterLabs] Maximum cluster size with Pacemaker 2.x and Corosync 3.x, and scaling to hundreds of nodes

2020-07-30 Thread Ken Gaillot
what are the best design approaches, especially if there is no > clear hierarchy to the nodes in use (i.e. all of the hosts are > important!). > > Are there performance implications when comparing the operation of a > pacemaker remote node to a full stack pacemaker n

Re: [ClusterLabs] Antw: Re: Antw: [EXT] Re: A bug? (SLES15 SP2 with "crm resource refresh")

2021-01-11 Thread Ken Gaillot
On Mon, 2021-01-11 at 16:31 +0100, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 11.01.2021 um > > > > 15:46 in > > Nachricht > <3df79a20eb4440357759cca4fe5b0e0729e47085.ca...@redhat.com>: > > On Mon, 2021-01-11 at 08:25 +0100, Ulrich Win

Re: [ClusterLabs] Antw: [EXT] Re: A bug? (SLES15 SP2 with "crm resource refresh")

2021-01-11 Thread Ken Gaillot
On Mon, 2021-01-11 at 08:25 +0100, Ulrich Windl wrote: > > > > Ken Gaillot schrieb am 08.01.2021 um > > > > 17:38 in > > Nachricht > <662b69bff331fae41771cf8833e819c2d5b18044.ca...@redhat.com>: > > On Fri, 2021‑01‑08 at 11:46 +0100, Ulrich Windl wrot

[ClusterLabs] Coming in Pacemaker 2.1.0: newer versions of build dependencies

2021-01-11 Thread Ken Gaillot
s.org/wiki/Pacemaker_2.1_Changes -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Configuring millisecond timestamps in pacemaker.log.

2021-01-11 Thread Ken Gaillot
n corosync.conf. I was hoping > Pacemaker has something similar but I don't see anything in > '/etc/sysconfig/pacemaker' or the Pacemaker documentation regarding > hi-res timestamps. > > Gerry Sommerville > Db2 Development, pureScale Domain > E

Re: [ClusterLabs] Antw: [EXT] Final Pacemaker 2.0.5 release now available

2020-12-03 Thread Ken Gaillot
Of cource: Likewise for the nodes > > > > > > clones and master/slave probably would need some special care. > > > > > > Opinions on that? > > > > > > Regards, > > > Ulrich > > > > > > > > > ___ > > > Manage your subscription: > > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > > > ClusterLabs home: https://www.clusterlabs.org/ > > > > > > > > > -- > > Regards, > > > > Reid Wahl, RHCA > > Senior Software Maintenance Engineer, Red Hat > > CEE - Platform Support Delivery - ClusterHA -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] resource management of standby node

2020-12-09 Thread Ken Gaillot
On Wed, 2020-12-09 at 10:57 +0800, Roger Zhou wrote: > On 12/1/20 4:03 PM, Ulrich Windl wrote: > > > > > Ken Gaillot schrieb am 30.11.2020 um > > > > > 19:52 in Nachricht > > > > : > > > > ... > > > > > > Th

Re: [ClusterLabs] Antw: [EXT] Re: resource management of standby node

2020-11-30 Thread Ken Gaillot
n active, > pacemaker will probe and discover them when existing maintenance > mode. Currently, Pacemaker can't detect renames as such. It will consider the old name as an orphan resource that must be stopped, and the new name as a new resource to be started. > > > > Maybe I should have d

Re: [ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ken Gaillot
defaults to 2 seconds. The best thing would be to do some manual testing using ipmitool or whatnot to turn off the power, and observe how long it takes between when the command returns and the server actually is powered down. Then set power_wait to a comfortable margin above that. Or just keep raising power_wait until the problem goes away :) -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Q: crm_mon "Statck:" columns

2020-12-17 Thread Ken Gaillot
gt; use_mgmtd: yes > } > > and then service-directive parameters are mandatory sections for > configurations? > > best regards. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Q: validate for VirtualDomain

2020-12-10 Thread Ken Gaillot
rich > > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] A word of warning regarding VirtualDomain and utilization

2020-12-11 Thread Ken Gaillot
em statically instead. (It might also be a good RFE for NodeUtilization to offer a CPU factor parameter, e.g. a factor of 10 would mean it would set the node attribute to 10 times the actual number.) > Regards, > Ulrich > > > > > >

Re: [ClusterLabs] Antw: [EXT] Recoveing from node failure

2020-12-11 Thread Ken Gaillot
opped > > > > I tried restarting zpool_data or other resources: > > > > # crm resource start zpool_data > > > > but nothing happens! > > > > How can I recover from this state? Node2 needs to stay down, > > > but I want

Re: [ClusterLabs] Running shell command on remote node via corosync messaging infrastructure

2020-12-18 Thread Ken Gaillot
o either way. :) Of course, you can configure sshd to listen on the cluster interface. If you give the cluster interface on each node a unique name in DNS (or hosts or whatever), you can ssh to that name. -- Ken Gaillot ___ Manage your subscription: https://l

Re: [ClusterLabs] query on pacemaker monitor timeout

2020-12-21 Thread Ken Gaillot
pted behavior or our resource operation setting is invalid.(refer > above config settings).? > 2)Any other parameter that can help to avoid this issue..? > > Thanks and Regards, > S Sathish S -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Antw: [EXT] delaying start of a resource

2020-12-16 Thread Ken Gaillot
is > > > done? Or how can I just delay the resource start so I can make it > > larger than > > > its pcmk_delay_base? > > > > We probably need to see logs and configs to understand. > > > > > > > &g

Re: [ClusterLabs] Antw: Re: Antw: [EXT] Recoveing from node failure

2020-12-16 Thread Ken Gaillot
the second node? > > > > ___ > > > > Manage your subscription: > > > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > > > > > ClusterLabs home: https://www.clusterlabs.org/ > &g

Re: [ClusterLabs] Antw: Another word of warning regarding VirtualDomain and Live Migration

2020-12-16 Thread Ken Gaillot
urce > > > refresh" (rebprobe) the cluster tried to fix the problem. > > > Well at some point the VM wouldn't start any more, because the > > > BtrFS used > > > for all (SLES default) was corrupted in a way that seems > > > unrecoverable, > > > independenlty of how many subvolumes and snapshots of those may > > > exist. > > > > > > Initially I would guess the libvirt stack and VirtualDomain is > > > less > > > > reliable > > > than the old Xen method and RA. > > > > > > Regards, > > > Ulrich > > > > > > > > > > > > > > > > > > > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Best way to create a floating identity file

2020-12-16 Thread Ken Gaillot
On Wed, 2020-12-16 at 04:46 -0500, Tony Stocker wrote: > On Tue, Dec 15, 2020 at 12:29 PM Ken Gaillot > wrote: > > > > On Tue, 2020-12-15 at 17:02 +0300, Andrei Borzenkov wrote: > > > On Tue, Dec 15, 2020 at 4:58 PM Tony Stocker < > > > akostoc...@gmail.c

Re: [ClusterLabs] Best way to create a floating identity file

2020-12-15 Thread Ken Gaillot
cate it with the workload resources. Or you could write a systemd timer unit to call your script when desired, and colocate that with the workload as a systemd resource in the cluster. Or similar to the crm_resource method, you could colocate an oc

<    8   9   10   11   12   13   14   15   16   17   >