Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Gang He
Hello Digimer, >>> > On 2018-03-08 12:10 PM, David Teigland wrote: >>> I use active rrp_mode in corosync.conf and reboot the cluster to let the > configuration effective. >>> But, the about 5 mins hang in new_lockspace() function is still here. >> >> The last time I tested connection

Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Digimer
On 2018-03-08 12:10 PM, David Teigland wrote: >> I use active rrp_mode in corosync.conf and reboot the cluster to let the >> configuration effective. >> But, the about 5 mins hang in new_lockspace() function is still here. > > The last time I tested connection failures with sctp was several

Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Gang He
Hello David, If sctp implementation did not fix this problem, there is any workaround for a two-rings cluster? Could we use TCP protocol in DLM under a two-rings cluster to by-pass connection channel switch issue? Thanks Gang >>> >> I use active rrp_mode in corosync.conf and reboot the

[ClusterLabs] 答复: 答复: 答复: 答复: How to configure to make each slave resource has one VIP

2018-03-08 Thread 范国腾
Thanks Rorthais, Got it. The following command could make sure that it move to the master if there is no standby alive: pcs constraint colocation add pgsql-ip-stby1 with slave pgsql-ha 100 pcs constraint colocation add pgsql-ip-stby1 with pgsql-ha 50 -邮件原件- 发件人: Jehan-Guillaume de

Re: [ClusterLabs] copy file

2018-03-08 Thread Ken Gaillot
On Thu, 2018-03-08 at 18:49 +0100, Mevo Govo wrote: > Hi,  > thanks for advice and your intrest. > We would use oracle database over DRBD. Datafiles (and control and > redo files) will be on DRBD. FRA also (on an other DRBD device). But > we are new in DRBD, and DRBD is also a component what can

Re: [ClusterLabs] copy file

2018-03-08 Thread Mevo Govo
Hi, thanks for advice and your intrest. We would use oracle database over DRBD. Datafiles (and control and redo files) will be on DRBD. FRA also (on an other DRBD device). But we are new in DRBD, and DRBD is also a component what can fails. We plan a scenario to recover the database without DRBD

Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread David Teigland
> I use active rrp_mode in corosync.conf and reboot the cluster to let the > configuration effective. > But, the about 5 mins hang in new_lockspace() function is still here. The last time I tested connection failures with sctp was several years ago, but I recall seeing similar problems. I had

Re: [ClusterLabs] Antw: Re: [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Gang He
>>> "Ulrich Windl" 03/08/18 7:24 PM >>> Hi! What surprises me most is that a connect(...O_NONBLOCK) actually blocks: EINPROGRESS The socket is non-blocking and the connection cannot be com- pleted immediately. Yes, the

[ClusterLabs] Antw: Re: [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Ulrich Windl
Hi! What surprises me most is that a connect(...O_NONBLOCK) actually blocks: EINPROGRESS The socket is non-blocking and the connection cannot be com- pleted immediately. Regards, Ulrich >>> "Gang He" schrieb am 08.03.2018 um 10:48 in Nachricht

Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Gang He
Hi Feldhost, I use active rrp_mode in corosync.conf and reboot the cluster to let the configuration effective. But, the about 5 mins hang in new_lockspace() function is still here. Thanks Gang >>> > Hi, so try to use active mode. > >

Re: [ClusterLabs] 答复: 答复: 答复: How to configure to make each slave resource has one VIP

2018-03-08 Thread Jehan-Guillaume de Rorthais
On Thu, 8 Mar 2018 01:45:43 + 范国腾 wrote: > Sorry, Rorthais, I have thought that the link and the attachment was the same > document yesterday. No problem. For your information, I merged the draft in the official documentation yesterday. > I just read the attachment

Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread FeldHost™ Admin
Hi, so try to use active mode. https://www.suse.com/documentation/sle_ha/book_sleha/data/sec_ha_installation_terms.html That fixes I saw in 4.14.* > On 8 Mar 2018, at 09:12, Gang He wrote: > > Hi Feldhost, > > >> Hello Gang He, >> >> which type of corosync rrp_mode you

Re: [ClusterLabs] [Cluster-devel] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread Gang He
Hi Feldhost, >>> > Hello Gang He, > > which type of corosync rrp_mode you use? Passive or Active? clvm1:/etc/corosync # cat corosync.conf | grep rrp_mode rrp_mode: passive Did you try test both? No, only this mode. Also, what kernel version you use? I see some SCTP fixes in

Re: [ClusterLabs] DLM connection channel switch take too long time (> 5mins)

2018-03-08 Thread FeldHost™ Admin
Hello Gang He, which type of corosync rrp_mode you use? Passive or Active? Did you try test both? Also, what kernel version you use? I see some SCTP fixes in latest kernels. > On 8 Mar 2018, at 08:52, Gang He wrote: > > Hello list and David Teigland, > > I got a problem under