[ClusterLabs] FYI: clusterlabs.org planned outages

2024-05-07 Thread Ken Gaillot
Hi all, We are in the process of changing the OS on the servers used to run the clusterlabs.org sites. There is an expected outage of all services from 4AM to 9AM UTC this Thursday. If problems arise, there may be more outages later Thursday and Friday. -- Ken Gaillot

Re: [ClusterLabs] Fast-failover on 2 nodes + qnetd: qdevice connenction disrupted.

2024-05-06 Thread Ken Gaillot
On Mon, 2024-05-06 at 10:05 -0500, Ken Gaillot wrote: > On Fri, 2024-05-03 at 16:18 +0300, ale...@pavlyuts.ru wrote: > > Hi, > > > > > > Thanks great for your suggestion, probably I need to think > > > > about > > > > this > > > &

Re: [ClusterLabs] Fast-failover on 2 nodes + qnetd: qdevice connenction disrupted.

2024-05-06 Thread Ken Gaillot
on. Ideally there would be some way > > to request > > the VM to be immediately destroyed (whether via fence_xvm, a cloud > > provider > > API, or similar). > What you mean by "destroyed"? Mean get down? Correct. For fencing purposes, it should not be a clean shutdo

Re: [ClusterLabs] Fast-failover on 2 nodes + qnetd: qdevice connenction disrupted.

2024-05-02 Thread Ken Gaillot
mind all the above is from my common sense and quite poor > fundamental knowledge in clustering. And please be so kind to correct > me if I am wrong at any point. > > Sincerely, > > Alex > -Original Message- > From: Users On Behalf Of Ken Gaillot > Sent: Thursda

Re: [ClusterLabs] Fast-failover on 2 nodes + qnetd: qdevice connenction disrupted.

2024-05-02 Thread Ken Gaillot
t reply > May 01 23:30:56 node2 corosync-qdevice[781]: seq = 102 > May 01 23:30:56 node2 corosync-qdevice[781]: vote = ACK > May 01 23:30:56 node2 corosync-qdevice[781]: ring id = (2.801) > May 01 23:30:56 node2 corosync-qdevice[781]: Algorithm result vote is > ACK > May 01 23:30:56 node2 corosync-qdevice[781]: Cast vote timer remains > scheduled every 250ms voting ACK. > >>> Here everything become OK and resource started on Node2 > > Also, I’ve done wireshark capture and found great mess in TCP, it > seems like connection between qdevice and qnetd really stops for some > time and packets won’t deliver. > > For my guess, it match corosync syncing activities, and I suspect > that corosync prevent any other traffic on the interface it use for > rings. > > As I switch qnetd and qdevice to use different interface it seems to > work fine. > > So, the question is: does corosync really temporary blocks any other > traffic on the interface it uses? Or it is just a coincidence? If it > blocks, is there a way to manage it? > > Thank you for any suggest on that! > > Sincerely, > > Alex > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-18 Thread Ken Gaillot
PEN} > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Likely deprecation: ocf:pacemaker:o2cb resource agent

2024-04-17 Thread Ken Gaillot
the agent for the Pacemaker 2.1.8 release and drop it for 3.0.0. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Potential deprecation: Node-attribute-based rules in operation meta-attributes

2024-04-02 Thread Ken Gaillot
ttribute expressions for operation meta-attributes, now is the time to speak up! -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Potential deprecation: Disabling schema validation for the CIB

2024-04-02 Thread Ken Gaillot
s a valid use case for this feature, now is the time to speak up! -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] resources cluster stoped with one node

2024-03-20 Thread Ken Gaillot
> ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Can I add a pacemaker v2 node to a v2 cluster ?

2024-03-04 Thread Ken Gaillot
ault-sbd-sync generated- > manpages monotonic nagios ncurses remote systemd > > If this is not possible I will have to think of another solution. > > Thanks > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Is it possible to downgrade feature-set in 2.1.6-8

2024-02-26 Thread Ken Gaillot
ssible to force new node with Pacemaker 2.1.6 to use older > feature-set (3.15.0) for a while until second node is upgraded and is > able to work with Pacemaker 2.1.6? No > > Thank you very much! > _Vitaly > -- Ken Gaillot __

Re: [ClusterLabs] clone_op_key pcmk__notify_key - Triggered fatal assertion

2024-02-19 Thread Ken Gaillot
er input that causes these messages? Also, what version are you using, and how did you get it? -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] pacemaker resource configure issue

2024-02-08 Thread Ken Gaillot
approaches. It's a longstanding goal to allow more flexibility in failure handling, but there hasn't been time to deal with it. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] how to disable pacemaker throttle mode

2024-02-05 Thread Ken Gaillot
at is what my mind recalls. > > I easily get loadavg of 128 on iscsi storage servers with almost free > CPU, no thermal reaction at all. > > Best, > Vlad > > On February 5, 2024 19:22:11 Ken Gaillot wrote: > > > On Mon, 2024-02-05 at 18:08 +0800, hywang via Use

Re: [ClusterLabs] how to disable pacemaker throttle mode

2024-02-05 Thread Ken Gaillot
able (typically in /etc/sysconfig/pacemaker, /etc/default/pacemaker, etc. depending on distro). -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Gracefully Failing Live Migrations

2024-02-01 Thread Ken Gaillot
On Thu, 2024-02-01 at 12:57 -0600, Billy Croan wrote: > How do I figure out which of the three steps failed and why? They're normal resource actions: migrate_to, migrate_from, and stop. You can investigate them in the usual way (status, logs). > > On Thu, Feb 1, 2024 at 11:15 AM Ke

Re: [ClusterLabs] Gracefully Failing Live Migrations

2024-02-01 Thread Ken Gaillot
, by definition the resource must move one way or another. Also, live migration involves three steps, and if one of them fails, the resource is in an unknown state, so it must be restarted anyway. -- Ken Gaillot ___ Manage your subscription: https://

Re: [ClusterLabs] trigger something at ?

2024-02-01 Thread Ken Gaillot
On Thu, 2024-02-01 at 14:31 +0100, lejeczek via Users wrote: > > On 31/01/2024 18:11, Ken Gaillot wrote: > > On Wed, 2024-01-31 at 16:37 +0100, lejeczek via Users wrote: > > > On 31/01/2024 16:06, Jehan-Guillaume de Rorthais wrote: > > > > On Wed, 31 Jan 2024 16

Re: [ClusterLabs] trigger something at ?

2024-01-31 Thread Ken Gaillot
On Wed, 2024-01-31 at 16:37 +0100, lejeczek via Users wrote: > > On 31/01/2024 16:06, Jehan-Guillaume de Rorthais wrote: > > On Wed, 31 Jan 2024 16:02:12 +0100 > > lejeczek via Users wrote: > > > > > On 29/01/2024 17:22, Ken Gaillot wrote: > > > > On

Re: [ClusterLabs] controlling cluster behavior on startup

2024-01-30 Thread Ken Gaillot
DC election happens quickly once both nodes are up. That makes sense > Thanks, > Chris > > From: Users on behalf of Faaland, > Olaf P. via Users > Date: Monday, January 29, 2024 at 7:46 PM > To: Ken Gaillot , Cluster Labs - All topics > related to open-source clustering

Re: [ClusterLabs] controlling cluster behavior on startup

2024-01-29 Thread Ken Gaillot
On Mon, 2024-01-29 at 14:35 -0800, Reid Wahl wrote: > > > On Monday, January 29, 2024, Ken Gaillot wrote: > > On Mon, 2024-01-29 at 18:05 +, Faaland, Olaf P. via Users > wrote: > >> Hi, > >> > >> I have configured clusters of node pairs, s

Re: [ClusterLabs] controlling cluster behavior on startup

2024-01-29 Thread Ken Gaillot
;0" dc-uuid="2"> > > > > name="stonith-action" value="off"/> > > > name="cluster-infrastructure" value="corosync"/> > name="cluster-name" value=&quo

Re: [ClusterLabs] controlling cluster behavior on startup

2024-01-29 Thread Ken Gaillot
'cib- > bootstrap-options'] complete=true > Jan 25 17:56:01 gopher12 pacemaker-controld [116040] > (controld_execute_fence_action) notice: Requesting fencing (off) > targeting node gopher11 | action=11 timeout=60 > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] trigger something at ?

2024-01-29 Thread Ken Gaillot
sky for a resource agent to modify the configuration. Finally you could write a systemd unit to do what you want and order it after pacemaker. What's wrong with leaving the constraints permanently configured? -- Ken Gaillot ___ Manage your subscription: htt

Re: [ClusterLabs] Planning for Pacemaker 3

2024-01-25 Thread Ken Gaillot
On Thu, 2024-01-25 at 10:31 +0100, Jehan-Guillaume de Rorthais wrote: > On Wed, 24 Jan 2024 16:47:54 -0600 > Ken Gaillot wrote: > ... > > > Erm. Well, as this is a major upgrade where we can affect > > > people's > > > conf and > > > break old

Re: [ClusterLabs] Planning for Pacemaker 3

2024-01-24 Thread Ken Gaillot
On Tue, 2024-01-23 at 18:49 +0100, Jehan-Guillaume de Rorthais wrote: > Hi there ! > > On Wed, 03 Jan 2024 11:06:27 -0600 > Ken Gaillot wrote: > > > Hi all, > > > > I'd like to release Pacemaker 3.0.0 around the middle of this > > year.

[ClusterLabs] New ClusterLabs wiki

2024-01-23 Thread Ken Gaillot
if they didn't apply to current software and OSes. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Beginner lost with promotable "group" design

2024-01-17 Thread Ken Gaillot
don't understand what type > of > elements I should create... > > > Thanks in advance, > > Regards, Adam. > > > PS: Bonus question should I use "pcs" or "crm" ? It seems both > command > seem to be equivalent and documentations use someti

Re: [ClusterLabs] Migrating off CentOS

2024-01-15 Thread Ken Gaillot
could blow up though > so I'm not sure that should play a factor in the decision. > > I can't be the first person to go down this path. So what do you all > think? how have you done it in the past? -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Planning for Pacemaker 3

2024-01-04 Thread Ken Gaillot
Thanks, I hadn't heard that! On Thu, 2024-01-04 at 01:13 +0100, Valentin Vidić via Users wrote: > On Wed, Jan 03, 2024 at 11:06:27AM -0600, Ken Gaillot wrote: > > I'd like to release Pacemaker 3.0.0 around the middle of this > > year. > > I'm gathering proposed changes here:

[ClusterLabs] Planning for Pacemaker 3

2024-01-03 Thread Ken Gaillot
the changes will be backward-incompatible, we will continue to make 2.1 releases for a few years, with backports of compatible fixes, to help distribution packagers who need to keep backward compatibility. -- Ken Gaillot ___ Manage your subscription: https

Re: [ClusterLabs] colocate Redis - weird

2024-01-01 Thread Ken Gaillot
e > code/internals) > thanks, L. > Transient attributes are the same as permanent ones except they get cleared when a node leaves the cluster. The constraint says that the masters must be located together, but they each still need to be enabled on a given node with either a master score attribute (permanent or transient) or a location constraint. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] colocation constraint - do I get it all wrong?

2024-01-01 Thread Ken Gaillot
a node, the primary resource might move to allow the colocated resource to run. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Pacemaker 2.1.7 final release now available

2023-12-19 Thread Ken Gaillot
Chris Lumens, Gao,Yan, Grace Chin, Hideo Yamauchi, Jan Pokorný, Ken Gaillot, liupei, Oyvind Albrigtsen, Reid Wahl, xin liang, and xuezhixin. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs

Re: [ClusterLabs] Build cluster one node at a time

2023-12-19 Thread Ken Gaillot
on the first original > node, not on the node being added? Only corosync, pacemaker and pcsd > needs to run on the node to be added and the commands being run on > the original node will speak to these on the new node? > > On Tue, 19 Dec 2023, 21:39 Ken Gaillot, wrote: > > On Tue

Re: [ClusterLabs] Build cluster one node at a time

2023-12-19 Thread Ken Gaillot
de to integrate into the cluster and once done, pcs status shows > two nodes on-line ? > Thanks Yes, you can use pcs cluster setup with the first node, then pcs cluster node add for each additional node. -- Ken Gaillot ___ Manage you

Re: [ClusterLabs] cluster doesn't do HA as expected, pingd doesn't help

2023-12-18 Thread Ken Gaillot
re3 > * OST4(ocf::lustre:Lustre):Stopped > > Again lustre3 seems unable to overrule due to lower score and pingd > DOESN'T help at all! > > > 4) Can I make a reliable HA failover without pingd to keep things as > simple as possible? > 5) Pings might help to affect cluster decisions in

[ClusterLabs] Pacemaker 2.1.7-rc4 now available (likely final for real)

2023-12-12 Thread Ken Gaillot
your last chance to test before the final release, which I expect will be next Tuesday. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] ocf:pacemaker:ping works strange

2023-12-12 Thread Ken Gaillot
should be put in front of lt/gt ? It is > possible that VM goes down, pingd to not_defined, then the rule > evaluates "lt 1" first, catches an error and doesn't evaluate next > part (after OR)? No, the order of and/or clauses doesn't matter. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] ocf:pacemaker:ping works strange

2023-12-12 Thread Ken Gaillot
On Mon, 2023-12-11 at 21:05 +0300, Artem wrote: > Hi Ken, > > On Mon, 11 Dec 2023 at 19:00, Ken Gaillot > wrote: > > > Question #2) I shut lustre3 VM down and leave it like that > > How did you shut it down? Outside cluster control, or with > > someth

Re: [ClusterLabs] resource fails manual failover

2023-12-12 Thread Ken Gaillot
(kind:Optional) (id:order-MDT00-OST1- > Optional) > start MDT00 then start OST2 (kind:Optional) (id:order-MDT00-OST2- > Optional) > > with regards to ordering constraint: OST1 and OST2 are started now, > while I'm exercising MDT00 failover. > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] ocf:pacemaker:ping works strange

2023-12-11 Thread Ken Gaillot
ot; while lustre3 > doesn't > All is according to documentation but results are strange. > Then I tried to add meta target-role="started" to pcs resource create > ping and this time ping started after node rebooted. Can I expect > that it was just missing from official setup documentation, and now > everything will work fine? -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Pacemaker 2.1.7-rc3 now available (likely final)

2023-12-07 Thread Ken Gaillot
in about two weeks. If anyone needs more time, let me know and I can delay it till early January. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Prevent cluster transition when resource unavailable on both nodes

2023-12-06 Thread Ken Gaillot
timeout is configured, then it will try once per node then wait for manual cleanup. If the colocation is made optional or reversed, the other resources can continue to run. > > Any pointers or advice will be much appreciated! > > Thank you and kind regards, > > Alex Eastwood --

Re: [ClusterLabs] Redundant entries in log

2023-12-05 Thread Ken Gaillot
at can be corrected with a new transition, this will be the maximum time until that happens. > > Cheers, > > JB > > > On Nov 29, 2023, at 18:52, Ken Gaillot wrote: > > > > Hi, > > > > Something is triggering a new transition. The most likely candida

Re: [ClusterLabs] RemoteOFFLINE status, permanently

2023-12-04 Thread Ken Gaillot
ey="lustre1_monitor_0" operation="monitor" crm-debug- > origin="controld_update_resource_history" crm_feature_set="3.17.4" > transition-key="5:88:7:288b2e10-0bee-498d-b9eb-4bc5f0f8d5bf" > transition-magic="-1:193;5:88:7:288b2e10-0bee-498d-b9eb-4bc5f0f8d5bf" > exit-reason="&quo

Re: [ClusterLabs] Redundant entries in log

2023-11-29 Thread Ken Gaillot
luster. (/var/lib/pacemaker/pengine/pe-input-250.bz2) > > I noticed the option to restrict the logging to higher levels however > some valuable information is logged under the `notice` level and I > would like to keep it in the logs. > > Please let me know if I am doing something w

[ClusterLabs] Pacemaker 2.1.7-rc1 now available

2023-10-31 Thread Ken Gaillot
Pokorný, Ken Gaillot, liupei, Oyvind Albrigtsen, Reid Wahl, and xuezhixin. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] How to output debug messages in the log file?

2023-10-03 Thread Ken Gaillot
How can I output debug messages? > Hi, Set PCMK_debug=true wherever your distro keeps environment variables for daemons (/etc/sysconfig/pacemaker, /etc/default/pacemaker, etc.). Debug messages will show up in the Pacemaker detail log (typically /var/log/pacemaker/pacema

Re: [ClusterLabs] Mutually exclusive resources ?

2023-09-27 Thread Ken Gaillot
On Wed, 2023-09-27 at 16:24 +0200, Adam Cecile wrote: > On 9/27/23 16:02, Ken Gaillot wrote: > > On Wed, 2023-09-27 at 15:42 +0300, Andrei Borzenkov wrote: > > > On Wed, Sep 27, 2023 at 3:21 PM Adam Cecile > > > wrote: > > > > Hello, > > > > &g

Re: [ClusterLabs] Mutually exclusive resources ?

2023-09-27 Thread Ken Gaillot
e of them has to stop). If you *prefer* that they run on different nodes, but want to allow them to run on the same node in a degraded cluster, use a finite negative score. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] pacemaker-remote

2023-09-18 Thread Ken Gaillot
offline, stop the resource that creates it. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Limit the number of resources starting/stoping in parallel possible?

2023-09-18 Thread Ken Gaillot
> Thanks & greets > > Steffen > Hi, Yes, see the batch-limit cluster option: https://clusterlabs.org/pacemaker/doc/2.1/Pacemaker_Explained/html/options.html#cluster-options -- Ken Gaillot ___ Manage your subscription: https://lists.c

Re: [ClusterLabs] [EXTERNE] Re: Centreon HA Cluster - VIP issue

2023-09-18 Thread Ken Gaillot
; > > Adil BOUAZZAOUI > Ingénieur Infrastructures & Technologies > GSM : +212 703 165 758 > E-mail : adil.bouazza...@tmandis.ma > > > -Message d'origine- > De : Adil BOUAZZAOUI > Envoyé : Friday, September 8, 2023 5:15 PM > À : Ken Gaill

Re: [ClusterLabs] PostgreSQL HA on EL9

2023-09-18 Thread Ken Gaillot
anage your subscription: > > https://urldefense.proofpoint.com/v2/url?u=https- > > 3A__lists.clusterlabs.org_mailman_listinfo_users=DwICAg=gRgGjJ3 > > BkIsb > > 5y6s49QqsA=- > > 46XreMySVoZzxM8t8YcpIX4ayXVWYLvAe0EnGHidNE=VO4147YbENDjp3d > > xoJeWclZ_EfLrehCht5CgW4_stkgPmryQN0kBA6G12wBwYztD=2Rx_74MVv > > kAWfZLyMhZw5GCY_37uyRffB2HV4_zkvOY= > > > > ClusterLabs home: https://urldefense.proofpoint.com/v2/url?u=https- > > 3A__www.clusterlabs.org_=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=- > > 46XreMySVoZzxM8t8YcpIX4ayXVWYLvAe0EnGHidNE=VO4147YbENDjp3d > > xoJeWclZ_EfLrehCht5CgW4_stkgPmryQN0kBA6G12wBwYztD=lofFF14IrTG > > 21epUbKbV0oUl-IrXZDSuNcaM1GM7FvU= > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] PostgreSQL HA on EL9

2023-09-13 Thread Ken Gaillot
that DB still thinks it’s a primary), and the > node is fenced. > > Is this an intended behavior for the versions of pacemaker/corosync > that I’m running, or a regression? It may be possible to put an > override into the systemd unit file for corosync to force t

Re: [ClusterLabs] MySQL cluster with auto failover

2023-09-13 Thread Ken Gaillot
cumentation about clones. There are some regression tests in the code base that include galera resources. Some use clones and others bundles (containerized). For example: https://github.com/ClusterLabs/pacemaker/blob/main/cts/scheduler/xml/unrunnable-2.xml > > Il giorno lun 11 set 2023 alle o

Re: [ClusterLabs] MySQL cluster with auto failover

2023-09-11 Thread Ken Gaillot
run > all the > time. > > > Do you have any guide that pack this everything together? > > No; I've largely made this stuff up myself as I've needed it. > > > Antony. > -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Centreon HA Cluster - VIP issue

2023-09-05 Thread Ken Gaillot
might be sufficient to use a booth ticket for just the DNS resource, and let everything else stay running all the time. For example it doesn't hurt anything for both sites' floating IPs to stay up. > Regards > Adil Bouazzaoui > > Le mar. 5 sept. 2023 à 16:48, Ken Gaillot a > écri

Re: [ClusterLabs] Centreon HA Cluster - VIP issue

2023-09-05 Thread Ken Gaillot
BOUAZZAOUI Ingénieur Infrastructures & Technologies > GSM : +212 703 165 758 E-mail : adil.bouazza...@tmandis.ma > > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users &

Re: [ClusterLabs] corosync 2.4 and 3.0 in one cluster.

2023-09-05 Thread Ken Gaillot
gt; with version 2 and 3, if corosync version 2 is configured with > crypto_hash sha256. > > > Thanks, > Anton. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Coming in Pacemaker 2.1.7: Pacemaker Remote nodes honor PCMK_node_start_state

2023-08-28 Thread Ken Gaillot
for full cluster nodes. It lets you tell the cluster that a new node should start in standby mode when it is added. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] Asking about Clusterlabs cross DC cluster

2023-08-21 Thread Ken Gaillot
ple: > Node 1 (Master) in VLAN 1: 172.30.100.10 /24 > Node 2 (slave) in VLAN 2: 172.30.200.10 /24 > > Note: i deployed Centeron HA Cluster with Corosync/Pacemaker on same > VLAN and it's working fine. > my idea is to move Slave node on another site (VLAN 2). > > Thank you in adv

Re: [ClusterLabs] DRBD Cluster Problem

2023-08-10 Thread Ken Gaillot
ure-response The cluster has no way of knowing ahead of time whether the situation is resolved -- it just cleans up the failure at the failure-timeout and tries again. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listi

Re: [ClusterLabs] Need a help with "(crm_glib_handler) crit: GLib: g_hash_table_lookup: assertion 'hash_table != NULL' failed"

2023-08-03 Thread Ken Gaillot
be straightforward. A workaround in the meantime would be to shut down nodes in sequence rather than in parallel, when shutting down just some nodes. (Shutting down the entire cluster shouldn't be subject to the race condition.) On Wed, 2023-08-02 at 16:53 -0500, Ken Gaillot wrote: > Ha! I didn't real

Re: [ClusterLabs] Need a help with "(crm_glib_handler) crit: GLib: g_hash_table_lookup: assertion 'hash_table != NULL' failed"

2023-08-02 Thread Ken Gaillot
.vm03.bz2 > 2023-07-21_pacemaker_debug.log.vm04.bz2 > blackbox_txt_vm04.tar.bz2 > On Thu, Jul 27 12:06:42 EDT 2023, Ken Gaillot kgaillot at redhat.com > wrote: > > > Running "qb-blackbox /var/lib/pacemaker/blackbox/pacemaker- > controld- > > 4257.1" (my versi

Re: [ClusterLabs] pcs node removal still crm_node it is removed node is listing as lost node

2023-07-27 Thread Ken Gaillot
On Thu, 2023-07-13 at 11:03 -0500, Ken Gaillot wrote: > On Thu, 2023-07-13 at 09:58 +, S Sathish S via Users wrote: > > Hi Team, > > > > Problem Statement : we are trying to remove node on pcs cluster, > > post > > execution also still crm_node > > it

Re: [ClusterLabs] Need a help with "(crm_glib_handler) crit: GLib: g_hash_table_lookup: assertion 'hash_table != NULL' failed"

2023-07-27 Thread Ken Gaillot
05:47 vm01 pacemaker-schedulerd[4028]: notice: Calculated > transition 17, saving inputs in > /var/lib/pacemaker/pengine/pe-input-940.bz2 > > 939: > crm-debug-origin="do_state_transition" join="down" expected="down"> > > >

Re: [ClusterLabs] Pacemaker fatal shutdown

2023-07-25 Thread Ken Gaillot
On Thu, 2023-07-20 at 12:43 +0530, Priyanka Balotra wrote: > What I mainly want to understand is that: > - why "fatal failure" is coming The logs so far don't show that. The earliest sign is: Jul 17 14:18:20.085 FILE-6 pacemaker-fenced[19411] (remote_op_done) notice: Operation 'reboot'

Re: [ClusterLabs] Fencing issue with dlm resources in pacemaker cluster.

2023-07-25 Thread Ken Gaillot
using any device > > I need the node to be powered off from fencing operations rather than > a reboot. Disabling fencing on dlm resources is not an option. Is > there any other way to solve this and make dlm issue a poweroff > action instead of a reboot as part of fencing. > &g

Re: [ClusterLabs] Pacemaker fatal shutdown

2023-07-19 Thread Ken Gaillot
invoking handler) > > Could you please help me understand the issue here. > > Regards > Priyanka > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] FEEDBACK WANTED: possible deprecation of nagios-class resources

2023-07-17 Thread Ken Gaillot
action, so it would be just a few lines of > coding. > > >If custom agents are not convenient enough, we could consider "un- > deprecating" nagios resources if there is demand to keep them. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] pcs node removal still crm_node it is removed node is listing as lost node

2023-07-13 Thread Ken Gaillot
here. > pacemaker-2.1.6-1.el8 > corosync-3.1.7-1.el8 > pcs-0.10.16-1.el8 > > Thanks and Regards, > S Sathish S -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] newly created clone waits for off-lined node

2023-07-13 Thread Ken Gaillot
de, it's based on a node attribute the agent sets, so it is only empty for newly created resources that haven't yet run on a node. I'm not sure if there's a way around it. (Anyone else have experience with that?) -- Ken Gaillot ___ Manage your subscriptio

Re: [ClusterLabs] location constraint does not move promoted resource ?

2023-07-03 Thread Ken Gaillot
On Mon, 2023-07-03 at 19:22 +0300, Andrei Borzenkov wrote: > On 03.07.2023 18:07, Ken Gaillot wrote: > > On Mon, 2023-07-03 at 12:20 +0200, lejeczek via Users wrote: > > > On 03/07/2023 11:16, Andrei Borzenkov wrote: > > > > On 03.07.2023 12:05, lejeczek via

Re: [ClusterLabs] location constraint does not move promoted resource ?

2023-07-03 Thread Ken Gaillot
ny placement strategy configured. These mainly include stickiness, location constraints, colocation constraints, and node health. Nodes may be eliminated from consideration by resource migration thresholds, standby/maintenance mode, etc. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] FEEDBACK WANTED: possible deprecation of nagios-class resources

2023-07-03 Thread Ken Gaillot
could consider "un- deprecating" nagios resources if there is demand to keep them. On Fri, 2023-06-30 at 10:51 +0800, Mr.R via Users wrote: > Hi Ken Gaillot > > There are a few questions about nagios, > In pacemaker-2.1.6, the nagios-class resource may be deprecated.

Re: [ClusterLabs] no-quorum-policy=ignore is (Deprecated ) and replaced with other options but not an effective solution

2023-06-27 Thread Ken Gaillot
t; >>> stonith-enabled=true \stonith-timeout=172 \ > > > >>> stonith-action=reboot \stop-all-resources=false \ > > > >>> no-quorum-policy=ignorersc_defaults build-resource-defaults: > > > \ > > > >>> resource-stic

Re: [ClusterLabs] no-quorum-policy=ignore is (Deprecated ) and replaced with other options but not an effective solution

2023-06-27 Thread Ken Gaillot
warning: Node FILE-4 is unclean!* > > > > > > > According to this output FILE-1 lost connection to three other > > nodes, in > > which case it cannot be quorate. > > > > > > > > Kindly help correct the configuration to make the system function > > normally > > > with all re

Re: [ClusterLabs] Pacemaker logs written on message which is not expected as per configuration

2023-06-26 Thread Ken Gaillot
; > nodelist { > > node { > > ring0_addr: node1 > > name: node1 > > nodeid: 1 > > } > > } > > > > quorum { > > provider: corosync_votequorum > > } > > > > logging { > > to_logfile: yes > > logfile: /var/log/cluster/corosync.log > > to_syslog: no > > timestamp: on > > } > > > > Thanks and Regards, > > S Sathish S -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] host in standby causes havoc

2023-06-15 Thread Ken Gaillot
ed, it's definitely not running. > > Best regards, > Jozsef > -- > E-mail : kadlecsik.joz...@wigner.hu > PGP key: https://wigner.hu/~kadlec/pgp_public_key.txt > Address: Wigner Research Centre for Physics > H-1525 Budapest 114, POB. 49, Hungary -- Ken Gaillo

Re: [ClusterLabs] cluster okey but errors when tried to move resource - ?

2023-06-12 Thread Ken Gaillot
Invalid configuration > > > > > > ___ > > > Manage your subscription: > > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > > > ClusterLabs home: https://www.clusterlabs.org/ > > > > > > -- > > Regards, > > > > Reid Wahl (He/Him) > > Senior Software Engineer, Red Hat > > RHEL High Availability - Pacemaker > > > > ___ > > Manage your subscription: > > https://lists.clusterlabs.org/mailman/listinfo/users > > > > ClusterLabs home: https://www.clusterlabs.org/ > > ___ > Manage your subscription: > https://lists.clusterlabs.org/mailman/listinfo/users > > ClusterLabs home: https://www.clusterlabs.org/ -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

Re: [ClusterLabs] cluster okey but errors when tried to move resource - ?

2023-06-05 Thread Ken Gaillot
present > in property set 'cib-bootstrap-options' That's because the custom options are in their own cluster_property_set. I believe pcs can only manage the options in the cluster_property_set with id="cib-bootstrap-options", so you'd have to use "pcs cluster edit" or crm_attribute to remove the custom ones. > > Any & all suggestions on how to fix this are much appreciated. > many thanks, L. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Pacemaker 2.1.6 final release now available

2023-05-24 Thread Ken Gaillot
, Gao,Yan, Grace Chin, Ken Gaillot, Klaus Wenninger, lihaipeng, liupei, liutong, Reid Wahl, Tahlia Richardson, wanglujun, WangMengabc, xuezhixin, and zhanghuanhuan. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman

Re: [ClusterLabs] Hyperconverged 3 Node Cluster

2023-05-16 Thread Ken Gaillot
all that active and I am looking to simplify my solution > for > reliability and ability to troubleshoot. > > > Appreciate any guidance to point me the right direction. > > Thanks, > > Adam -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] Pacemaker 2.1.6-rc2 now available

2023-05-02 Thread Ken Gaillot
ted. Many thanks to all contributors of source code and language translations to this release, including Chris Lumens, Gao,Yan, Ken Gaillot, and Klaus Wenninger. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/list

Re: [ClusterLabs] Best DRBD Setup

2023-04-26 Thread Ken Gaillot
o > restart correctly. Another thing I have noticed is that it will > sometimes take as long as 10-12 minutes to mount one of the DRBD > filesystems (XFS) so I have extended the start timeout for each *- > mount to 15 minutes. > > Thanks in advance for any advice to improve the set

Re: [ClusterLabs] How to block/stop a resource from running twice?

2023-04-26 Thread Ken Gaillot
-US/Pacemaker/1.1/html/Pacemaker_Explained/s-resource-options.html Since Pacemaker 2.1.4, multiple-active can be set to "stop_unexpected" to do what you want. It's not the default because some services may no longer operate correctly if an extra instance was started on the same host, s

Re: [ClusterLabs] Corosync 3.1.5 Fails to Autostart

2023-04-24 Thread Ken Gaillot
n > this one? Mostly checking to see if changing the After dependency > will harm us in the future. > > Thanks! > > Respectfully, > Tyler Phillippe -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/

[ClusterLabs] FEEDBACK WANTED: possible deprecation of nagios-class resources

2023-04-19 Thread Ken Gaillot
even call a nagios plugin). Does anyone here use nagios-class resources? If it's actively being used, I'm willing to keep it around. But if there's no demand, we'd rather not have to maintain that (poorly tested) code forever. -- Ken Gaillot ___ Manage

Re: [ClusterLabs] manner in which cluster migrates VirtualDomain - ?

2023-04-19 Thread Ken Gaillot
On Wed, 2023-04-19 at 08:00 +0200, lejeczek via Users wrote: > > On 18/04/2023 21:02, Ken Gaillot wrote: > > On Tue, 2023-04-18 at 19:36 +0200, lejeczek via Users wrote: > > > On 18/04/2023 18:22, Ken Gaillot wrote: > > > > On Tue, 2023-04-18 at 14:58 +0200, lejec

Re: [ClusterLabs] Offtopic - role migration

2023-04-18 Thread Ken Gaillot
which case call this special agent action to prepare it for running with a demoted instance, then demote the instance, then migrate the resource, then promote the new instance, then call this other agent action to return it to normal operation". -- Ken Gaillot __

Re: [ClusterLabs] manner in which cluster migrates VirtualDomain - ?

2023-04-18 Thread Ken Gaillot
On Tue, 2023-04-18 at 19:36 +0200, lejeczek via Users wrote: > > On 18/04/2023 18:22, Ken Gaillot wrote: > > On Tue, 2023-04-18 at 14:58 +0200, lejeczek via Users wrote: > > > Hi guys. > > > > > > When it's done by the cluster itself, eg. a node goes

Re: [ClusterLabs] manner in which cluster migrates VirtualDomain - ?

2023-04-18 Thread Ken Gaillot
s(always?) a kind of 'swarm' migration? The migration-limit cluster property specifies how many live migrations may be initiated at once (the default of -1 means unlimited). -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.org/mailman/

[ClusterLabs] Pacemaker 2.1.6-rc1 now available

2023-04-17 Thread Ken Gaillot
. Di Nitto, Gao,Yan, Grace Chin, Ken Gaillot, Klaus Wenninger, lihaipeng, liupei, liutong, Reid Wahl, Tahlia Richardson, wanglujun, WangMengabc, xuezhixin, and zhanghuanhuan. -- Ken Gaillot ___ Manage your subscription: https://lists.clusterlabs.o

Re: [ClusterLabs] ClusterMon resource creation getting illegal option -- E in ClusterMon

2023-04-12 Thread Ken Gaillot
sync-3.1.7-1.el8.x86_64 > pacemaker-2.1.4-1.2.1.4.git.el8.x86_64 > pacemaker-libs-2.1.4-1.2.1.4.git.el8.x86_64 > pacemaker-schemas-2.1.4-1.2.1.4.git.el8.noarch > > Thanks and Regards, > S Sathish S > ___ > Ma

Re: [ClusterLabs] Location not working [FIXED]

2023-04-11 Thread Ken Gaillot
the addressee whose name is specified above. Should you receive > this message by mistake, we would be most grateful if you informed us > that the message has been sent to you. In this case, we also ask that > you delete this message from your mailbox, and do not forw

Re: [ClusterLabs] Location not working

2023-04-10 Thread Ken Gaillot
On Mon, 2023-04-10 at 16:33 +0300, Andrei Borzenkov wrote: > On Mon, Apr 10, 2023 at 4:26 PM Ken Gaillot > wrote: > > On Mon, 2023-04-10 at 14:18 +0300, Miro Igov wrote: > > > Hello, > > > I have a resource with location constraint set to: > > > > >

Re: [ClusterLabs] Location not working

2023-04-10 Thread Ken Gaillot
ssage by mistake, we would be most grateful if you informed us > that the message has been sent to you. In this case, we also ask that > you delete this message from your mailbox, and do not forward it or > any part of it to anyone else. > Thank you for your cooperation

  1   2   3   4   5   6   7   8   9   10   >