Re: [ClusterLabs] no-quorum-policy=ignore is (Deprecated ) and replaced with other options but not an effective solution

2023-06-28 Thread Klaus Wenninger
On Wed, Jun 28, 2023 at 7:38 AM Klaus Wenninger wrote: > > > On Wed, Jun 28, 2023 at 3:30 AM Priyanka Balotra < > priyanka.14balo...@gmail.com> wrote: > >> I am using SLES 15 SP4. Is the no-quorum-policy still supported? >> >> > Thanks >> Pri

Re: [ClusterLabs] no-quorum-policy=ignore is (Deprecated ) and replaced with other options but not an effective solution

2023-06-27 Thread Klaus Wenninger
On Tue, Jun 27, 2023 at 5:24 PM Andrei Borzenkov wrote: > On 27.06.2023 07:21, Priyanka Balotra wrote: > > Hi Andrei, > > After this state the system went through some more fencings and we saw > the > > following state: > > > > :~ # crm status > > Cluster Summary: > >* Stack: corosync > >

Re: [ClusterLabs] cluster doesn't do HA as expected, pingd doesn't help

2023-12-19 Thread Klaus Wenninger
On Tue, Dec 19, 2023 at 10:00 AM Andrei Borzenkov wrote: > On Tue, Dec 19, 2023 at 10:41 AM Artem wrote: > ... > > Dec 19 09:48:13 lustre-mds2.ntslab.ru pacemaker-schedulerd[785107] > (update_resource_action_runnable)warning: OST4_stop_0 on lustre4 is > unrunnable (node is offline) > > Dec

[ClusterLabs] Pacemaker 2.1.7-rc2 now available

2023-11-24 Thread Klaus Wenninger
Hi all, Source code for the 2nd release candidate for Pacemaker version 2.1.7 is available at: https://github.com/ClusterLabs/pacemaker/releases/tag/Pacemaker-2.1.7-rc2 This is primarily a bug fix release. See the ChangeLog or the link above for details. Everyone is encouraged to download,

Re: [ClusterLabs] trigger something at ?

2024-01-29 Thread Klaus Wenninger
On Mon, Jan 29, 2024 at 5:22 PM Ken Gaillot wrote: > On Fri, 2024-01-26 at 13:55 +0100, lejeczek via Users wrote: > > Hi guys. > > > > Is it possible to trigger some... action - I'm thinking specifically > > at shutdown/start. > > If not within the cluster then - if you do that - perhaps

Re: [ClusterLabs] controlling cluster behavior on startup

2024-01-30 Thread Klaus Wenninger
On Tue, Jan 30, 2024 at 2:21 PM Walker, Chris wrote: > >>> However, now it seems to wait that amount of time before it elects a > >>> DC, even when quorum is acquired earlier. In my log snippet below, > >>> with dc-deadtime 300s, > >> > >> The dc-deadtime is not waiting for quorum, but for

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-18 Thread Klaus Wenninger
On Thu, Apr 18, 2024 at 5:07 PM NOLIBOS Christophe via Users < users@clusterlabs.org> wrote: > Classified as: {OPEN} > > I'm using RedHat 8.8 (4.18.0-477.21.1.el8_8.x86_64). > When I kill Corosync, no new corosync process is created and pacemaker is > in failure. > The only solution is to restart

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-18 Thread Klaus Wenninger
*De la part de* NOLIBOS > Christophe via Users > *Envoyé :* jeudi 18 avril 2024 18:34 > *À :* Klaus Wenninger ; Cluster Labs - All topics > related to open-source clustering welcomed > *Cc :* NOLIBOS Christophe > *Objet :* Re: [ClusterLabs] "pacemakerd: recover properly

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-18 Thread Klaus Wenninger
On Thu, Apr 18, 2024 at 6:09 PM Klaus Wenninger wrote: > > > On Thu, Apr 18, 2024 at 6:06 PM NOLIBOS Christophe < > christophe.noli...@thalesgroup.com> wrote: > >> Classified as: {OPEN} >> >> >> >> Well… why do you say that « Well if c

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-22 Thread Klaus Wenninger
rocess - so that the exit-code could be set to 0 - should be fine. Klaus > > *De :* Klaus Wenninger > *Envoyé :* jeudi 18 avril 2024 20:17 > *À :* NOLIBOS Christophe > *Cc :* Cluster Labs - All topics related to open-source clustering > welcomed > *Objet :* Re: [ClusterLabs]

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-22 Thread Klaus Wenninger
Maybe pacemaker changed behavior here without syncing enough with corosync behavior. We'll look into that to see which approach is better - restart corosync on failure - or have pacemaker be restarted by systemd which should in turn restart corosync as well. Klaus > > > Thanks a lot. >

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-23 Thread Klaus Wenninger
On Tue, Apr 23, 2024 at 10:34 AM Klaus Wenninger wrote: > > > On Tue, Apr 23, 2024 at 9:53 AM NOLIBOS Christophe < > christophe.noli...@thalesgroup.com> wrote: > >> Classified as: {OPEN} >> >> >> >> Other strange thing. >> >> On RHE

Re: [ClusterLabs] "pacemakerd: recover properly from Corosync crash" fix

2024-04-23 Thread Klaus Wenninger
it would be restarted. Klaus > > > *De :* Klaus Wenninger > *Envoyé :* lundi 22 avril 2024 12:41 > *À :* NOLIBOS Christophe > *Cc :* Cluster Labs - All topics related to open-source clustering > welcomed > *Objet :* Re: [ClusterLabs] "pacemakerd: recover properly from

Re: [ClusterLabs] Fast-failover on 2 nodes + qnetd: qdevice connenction disrupted.

2024-05-06 Thread Klaus Wenninger
On Fri, May 3, 2024 at 8:59 PM wrote: > Hi, > > > > Also, I've done wireshark capture and found great mess in TCP, it > > > seems like connection between qdevice and qnetd really stops for some > > > time and packets won't deliver. > > > > Could you check UDP? I guess there is a lot of UDP

Re: [ClusterLabs] crm services not getting started after upgrading to snmp40

2024-05-23 Thread Klaus Wenninger
On Wed, May 22, 2024 at 5:16 PM ., Anoop wrote: > What is the "certain filesystem"? If cluster services require it, that > would explain why they can't start. > - Here we have btrfs and xfs filesystems. Yes cluster services require > these filesystem to be mounted. > What do the systemd

Re: [ClusterLabs] Strange behavior of Resource stickiness

2024-05-28 Thread Klaus Wenninger
On Tue, May 28, 2024 at 12:34 PM Александр Руденко wrote: > Andrei, thank you! > > I tried to find node's scores and have found location constraints for > these 3 resources: > > pcs constraint > Location Constraints: > Resource: fsmt-28085F00 > Enabled on: > Node: vdc16

Re: [ClusterLabs] Strange behavior of Resource stickiness

2024-05-28 Thread Klaus Wenninger
On Tue, May 28, 2024 at 10:40 AM Александр Руденко wrote: > Hi! > > I can't understand this strange behavior, help me please. > > I have 3 nodes in my cluster, 4 vCPU/8GB RAM each. And about 70 groups, 2 > resources in each group. First one resource is our custom resource which > configures

Re: [ClusterLabs] Need advice: deep pacemaker integration, best approach?

2024-06-10 Thread Klaus Wenninger
On Mon, Jun 10, 2024 at 6:12 PM Ken Gaillot wrote: > On Sun, 2024-06-09 at 23:13 +0300, ale...@pavlyuts.ru wrote: > > Hi All, > > > > We intend to integrate Pacemaker as failover engine into a very > > specific product. The handmade prototype works pretty well. It > > includes a couple of dozens

Re: [ClusterLabs] Users Digest, Vol 104, Issue 5

2023-09-05 Thread Klaus Wenninger via Users
r body 'help' to >> users-requ...@clusterlabs.org >> >> You can reach the person managing the list at >> users-ow...@clusterlabs.org >> >> When replying, please edit your Subject line so it is more specific >> than "Re: Contents of Users d

<    1   2   3   4   5   6