[ClusterLabs] Antw: Re: Antw: Re: Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ulrich Windl
>>> Andrei Borzenkov schrieb am 18.12.2020 um 08:21 in Nachricht <58579c3b-33ce-a121-5d67-00305f3d7...@gmail.com>: > 18.12.2020 10:09, Ulrich Windl пишет: > Andrei Borzenkov schrieb am 18.12.2020 um 08:01 in >> Nachricht : >>> 17.12.2020 21:30, Ken Gaillot пишет: This reminded me

[ClusterLabs] FYI: crm shell enhancement proposal #699

2020-12-17 Thread Ulrich Windl
Hi! For those using crm shell might be interested in the enhancement proposal I just had made: https://github.com/ClusterLabs/crmsh/issues/699 Enhancements to the enhancement welcome ;-) Regards, Ulrich ___ Manage your subscription:

Re: [ClusterLabs] Antw: Re: Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Andrei Borzenkov
18.12.2020 10:09, Ulrich Windl пишет: Andrei Borzenkov schrieb am 18.12.2020 um 08:01 in > Nachricht : >> 17.12.2020 21:30, Ken Gaillot пишет: >>> >>> This reminded me that some IPMI implementations return "success" for >>> commands before they've actually been completed. This is why >>>

[ClusterLabs] Antw: Re: Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ulrich Windl
>>> Andrei Borzenkov schrieb am 18.12.2020 um 08:01 in Nachricht : > 17.12.2020 21:30, Ken Gaillot пишет: >> >> This reminded me that some IPMI implementations return "success" for >> commands before they've actually been completed. This is why >> fence_ipmilan has a "power_wait" parameter that

Re: [ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Andrei Borzenkov
17.12.2020 21:30, Ken Gaillot пишет: > > This reminded me that some IPMI implementations return "success" for > commands before they've actually been completed. This is why > fence_ipmilan has a "power_wait" parameter that defaults to 2 seconds. > But on this case we also do not know whether

Re: [ClusterLabs] Antw: [EXT] Changing order in resource group after it's created

2020-12-17 Thread Reid Wahl
I agree that it is somewhat counter-intuitive. It is in the `--help` output, however -- although you have to already have an idea of where to look in order to find it ;). [root@fastvm-rhel-7-6-21 ~]# pcs resource group --help Usage: pcs resource group... group add [resource id] ...

Re: [ClusterLabs] Antw: Re: Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Gabriele Bulfon
Would a change of network class on one node be ok?     Sonicle S.r.l. : http://www.sonicle.com Music: http://www.gabrielebulfon.com eXoplanets : https://gabrielebulfon.bandcamp.com/album/exoplanets   -- Da: Ulrich

Re: [ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ken Gaillot
On Thu, 2020-12-17 at 19:13 +0300, Andrei Borzenkov wrote: > 17.12.2020 14:02, Ulrich Windl пишет: > > > > > Andrei Borzenkov schrieb am 17.12.2020 > > > > > um 09:50 in > > > > Nachricht > > > >: > > > > ... > > > According to logs from xstha1, it started to activate resources > > > only > >

Re: [ClusterLabs] Q: crm_mon "Statck:" columns

2020-12-17 Thread Ken Gaillot
On Thu, 2020-12-17 at 21:20 +0900, Techs-yama wrote: > Hello there, > > This is probably a fairly rudimentary question. > I recently started studying corosync and pacemaker. > > I have a question about how to recognize heartbeat or corosync status > type. > Where does the "Stack:" columns

Re: [ClusterLabs] Changing order in resource group after it's created

2020-12-17 Thread Strahil Nikolov
Use the syntax as if your resource was never in a group and use '--before/--after' to specify the new location. Best Regards, Strahil Nikolov В четвъртък, 17 декември 2020 г., 13:21:55 Гринуич+2, Tony Stocker написа: I have a resource group that has a number of entries. If I want

Re: [ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Andrei Borzenkov
17.12.2020 14:02, Ulrich Windl пишет: Andrei Borzenkov schrieb am 17.12.2020 um 09:50 in > Nachricht > : > > ... >> According to logs from xstha1, it started to activate resources only >> after stonith was confirmed >> >> Dec 16 15:08:12 [708] stonith‑ng: notice: log_operation: >>

Re: [ClusterLabs] Antw: [EXT] delaying start of a resource

2020-12-17 Thread Gabriele Bulfon
Sorry, somtimes I want to make it simpler, and maybe I'm missing informations. I think I found what happened, and actually xstha2 DID NOT mount the zpool, nor start the IP address.   Let's make a step back. We have two ip resources, one is normally for xstha1, the other is normally for xstha2.

Re: [ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Roger Zhou
Here is a tool intend to standardize the approach to simulate split-brain https://software.opensuse.org/package/python3-cluster-preflight-check After installation, simply run the comand: `ha-cluster-preflight-check --split-brain-iptables` Thanks, Roger On 12/17/20 4:14 PM, Gabriele Bulfon

[ClusterLabs] Q: crm_mon "Statck:" columns

2020-12-17 Thread Techs-yama
Hello there, This is probably a fairly rudimentary question. I recently started studying corosync and pacemaker. I have a question about how to recognize heartbeat or corosync status type. Where does the "Stack:" columns recognize for configurations when run the crm_mon command? [root@host1 ~]#

Re: [ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Gabriele Bulfon
I see, but then I have to issues:   1. it is a dual node server, the HA interface is internal, I have no way to unplug it, that's why I tried turning it down   2. even in case I could test it by unplugging it, there is still the possibility that someone turns the interface down, causing a bad

Re: [ClusterLabs] Antw: [EXT] delaying start of a resource

2020-12-17 Thread Gabriele Bulfon
Yes, sorry took same bash by mistake...here are the correct logs.   Yes, xstha1 has delay 10s so that I'm giving him precedence, xstha2 has delay 1s and will be stonished earlier. During the short time before xstha2 got powered off, I saw it had time to turn on NFS IP (I saw duplicated IP on

Re: [ClusterLabs] Antw: Re: Antw: [EXT] Recoveing from node failure

2020-12-17 Thread Gabriele Bulfon
These one:   https://github.com/OpenIndiana/oi-userland/tree/oi/hipster/components/cluster/pacemaker/patches   and some specifics of the build are here:   https://github.com/OpenIndiana/oi-userland/blob/oi/hipster/components/cluster/pacemaker/Makefile   Gabriele     Sonicle S.r.l. : 

Re: [ClusterLabs] Antw: [EXT] Changing order in resource group after it's created

2020-12-17 Thread Tony Stocker
On Thu, Dec 17, 2020 at 6:29 AM Ulrich Windl wrote: > > >>> Tony Stocker schrieb am 17.12.2020 um 12:21 in > Nachricht > : > > I have a resource group that has a number of entries. If I want to > > reorder them, how do I do that? > > > > I tried doing this: > > > > pcs resource update FileMount

[ClusterLabs] Antw: Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ulrich Windl
>>> "Ulrich Windl" schrieb am 17.12.2020 um 12:23 in Nachricht <5fdb3fbe02a10003d...@gwsmtp.uni-regensburg.de>: ... > > I wonder: Did you remove the hostnames from the log messages? Also are the > times in sync, wondering that at the same second a resource is fallged s/fallged/flagged/ >

[ClusterLabs] Antw: [EXT] Changing order in resource group after it's created

2020-12-17 Thread Ulrich Windl
>>> Tony Stocker schrieb am 17.12.2020 um 12:21 in Nachricht : > I have a resource group that has a number of entries. If I want to > reorder them, how do I do that? > > I tried doing this: > > pcs resource update FileMount ‑‑after InternalIP > > but got this error: > > Error: Specified

[ClusterLabs] Antw: Re: Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ulrich Windl
>>> Gabriele Bulfon schrieb am 17.12.2020 um 09:14 in Nachricht <2080536991.1106.1608192888030@www>: > I see, but then I have to issues: > > 1. it is a dual node server, the HA interface is internal, I have no way to > unplug it, that's why I tried turning it down You could block traffic

[ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ulrich Windl
>>> Gabriele Bulfon schrieb am 17.12.2020 um 09:11 in Nachricht <2129123894.1061.1608192712316@www>: > Yes, sorry took same bash by mistake...here are the correct logs. > > Yes, xstha1 has delay 10s so that I'm giving him precedence, xstha2 has > delay 1s and will be stonished earlier. >

[ClusterLabs] Changing order in resource group after it's created

2020-12-17 Thread Tony Stocker
I have a resource group that has a number of entries. If I want to reorder them, how do I do that? I tried doing this: pcs resource update FileMount --after InternalIP but got this error: Error: Specified option '--after' is not supported in this command Is there a way to change this? I

[ClusterLabs] Antw: Re: Antw: [EXT] delaying start of a resource

2020-12-17 Thread Ulrich Windl
>>> Andrei Borzenkov schrieb am 17.12.2020 um 09:50 in Nachricht : ... > According to logs from xstha1, it started to activate resources only > after stonith was confirmed > > Dec 16 15:08:12 [708] stonith‑ng: notice: log_operation: > Operation 'off' [1273] (call 4 from crmd.712) for host

Re: [ClusterLabs] Antw: [EXT] delaying start of a resource

2020-12-17 Thread Andrei Borzenkov
On Thu, Dec 17, 2020 at 1:17 PM Gabriele Bulfon wrote: > Actually, reading again the "duplicated IP" message, it was xstha1 that > (having the pool mounted and not seeing xstha2 anymore) got the xstha2 IP for > NFS. Which confirms my conclusions. > > So I think there is no worry about the

Re: [ClusterLabs] Antw: [EXT] delaying start of a resource

2020-12-17 Thread Andrei Borzenkov
On Thu, Dec 17, 2020 at 11:11 AM Gabriele Bulfon wrote: > > Yes, sorry took same bash by mistake...here are the correct logs. > > Yes, xstha1 has delay 10s so that I'm giving him precedence, xstha2 has delay > 1s and will be stonished earlier. > During the short time before xstha2 got powered