Re: [ClusterLabs] Replicated PGSQL woes

2016-10-14 Thread Jehan-Guillaume de Rorthais
On Fri, 14 Oct 2016 08:10:08 -0800 Israel Brewster wrote: > On Oct 14, 2016, at 1:39 AM, Jehan-Guillaume de Rorthais > wrote: > > > > On Thu, 13 Oct 2016 14:11:06 -0800 > > Israel Brewster wrote: [...] > > I **guess** if you

Re: [ClusterLabs] Can't do anything right; how do I start over?

2016-10-14 Thread Dimitri Maziuk
On 10/14/2016 02:48 PM, Jay Scott wrote: > When I "start over" I stop all the services, delete the packages, > empty the configs and logs as best I know how. But this doesn't > completely clear everything: the drbd metadata is evidently still > on the partitions I've set aside for it. If it's

[ClusterLabs] Trouble with drbd/pacemaker: switch to secondary/secondary

2016-10-14 Thread Anne Nicolas
Hi! I'm having trouble with a 2 nodes cluster used for DRBD / Apache / Samba and some other services. Whatever I do, it always goes to the following state: Last updated: Fri Oct 14 17:41:38 2016 Last change: Thu Oct 13 10:42:29 2016 via cibadmin on bzvairsvr Stack: corosync Current DC:

Re: [ClusterLabs] Can't do anything right; how do I start over?

2016-10-14 Thread Ken Gaillot
On 10/14/2016 02:48 PM, Jay Scott wrote: > I've been trying a lot of things from the introductory manual. > I have updated the instructions (on my hardcopy) to the versions > of corosync etc. that I'm using. I can't get hardly anything to > work reliably beyond the ClusterIP. > > So I start over

Re: [ClusterLabs] Antw: Trying this question again re: arp_interval

2016-10-14 Thread Ken Gaillot
On 10/14/2016 03:36 AM, Ulrich Windl wrote: Eric Robinson schrieb am 14.10.2016 um 09:15 in > Nachricht > > >> Does anyone know how many arp_intervals must pass without a reply before

Re: [ClusterLabs] Antw: Re: Antw: Unexpected Resource movement after failover

2016-10-14 Thread Ken Gaillot
On 10/14/2016 06:56 AM, Nikhil Utane wrote: > Hi, > > Thank you for the responses so far. > I added reverse colocation as well. However seeing some other issue in > resource movement that I am analyzing. > > Thinking further on this, why doesn't "/a not with b" does not imply "b > not with a"?/

Re: [ClusterLabs] Replicated PGSQL woes [solved]

2016-10-14 Thread Israel Brewster
> > On Oct 14, 2016, at 12:30 AM, Keisuke MORI > wrote: >> >> 2016-10-14 2:04 GMT+09:00 Israel Brewster > >: >>> Summary: Two-node cluster setup with latest pgsql resource

Re: [ClusterLabs] Antw: Replicated PGSQL woes

2016-10-14 Thread Israel Brewster
On Oct 13, 2016, at 11:36 PM, Ulrich Windl wrote: > Israel Brewster schrieb am 13.10.2016 um 19:04 in > Nachricht <34091524-d35e-4e28-9c3e-dda6c6a1e...@ravnalaska.net>: > [...] >> Oct 13 08:29:39 CentTest1 crmd[30096]: notice:

Re: [ClusterLabs] Replicated PGSQL woes

2016-10-14 Thread Israel Brewster
On Oct 14, 2016, at 12:30 AM, Keisuke MORI wrote: > > 2016-10-14 2:04 GMT+09:00 Israel Brewster : >> Summary: Two-node cluster setup with latest pgsql resource agent. Postgresql >> starts initially, but failover never happens. > >> Oct 13

Re: [ClusterLabs] Replicated PGSQL woes

2016-10-14 Thread Israel Brewster
On Oct 14, 2016, at 1:39 AM, Jehan-Guillaume de Rorthais wrote: > > On Thu, 13 Oct 2016 14:11:06 -0800 > Israel Brewster wrote: > >> On Oct 13, 2016, at 1:56 PM, Jehan-Guillaume de Rorthais >> wrote: >>> >>> On Thu, 13 Oct 2016

Re: [ClusterLabs] Antw: Re: Antw: Unexpected Resource movement after failover

2016-10-14 Thread Nikhil Utane
I feel the behavior has become worse after adding reverse co-location constraint. I started with this. And it was all I wanted it to be. cu_5 <-> Redund_CU1_WB30 cu_4 <-> Redund_CU2_WB30 cu_3 <-> Redund_CU3_WB30 cu_2 <-> Redund_CU5_WB30 However for some reason pacemaker decided to move cu_2 from

Re: [ClusterLabs] Antw: Re: Antw: Unexpected Resource movement after failover

2016-10-14 Thread Nikhil Utane
Hi, Thank you for the responses so far. I added reverse colocation as well. However seeing some other issue in resource movement that I am analyzing. Thinking further on this, why doesn't "*a not with b" does not imply "b not with a"?* Coz wouldn't putting "b with a" violate "a not with b"? Can

Re: [ClusterLabs] Antw: Re: Replicated PGSQL woes

2016-10-14 Thread Jehan-Guillaume de Rorthais
On Fri, 14 Oct 2016 09:59:04 +0200 "Ulrich Windl" wrote: > >>> Jehan-Guillaume de Rorthais schrieb am 13.10.2016 um > >>> 23:56 in > Nachricht <20161013235606.007018eb@firost>: > > [...] > > As far as I know, the pgsql resource agent create

Re: [ClusterLabs] Replicated PGSQL woes

2016-10-14 Thread Jehan-Guillaume de Rorthais
On Thu, 13 Oct 2016 14:11:06 -0800 Israel Brewster wrote: > On Oct 13, 2016, at 1:56 PM, Jehan-Guillaume de Rorthais > wrote: > > > > On Thu, 13 Oct 2016 10:05:33 -0800 > > Israel Brewster wrote: > > > >> On Oct 13, 2016, at

Re: [ClusterLabs] Antw: Re: Antw: Re: Antw: Re: When the DC crmd is frozen, cluster decisions are delayed infinitely

2016-10-14 Thread renayama19661014
Hi Klaus, Hi All, I tried prototype of watchdog using WD service.  -  https://github.com/HideoYamauchi/pacemaker/commit/3ee97b76e0212b1790226864dfcacd1a327dbcc9 Please comment. Best Regards, Hideo Yamauchi. - Original Message - > From: "renayama19661...@ybb.ne.jp"

[ClusterLabs] Antw: Trying this question again re: arp_interval

2016-10-14 Thread Ulrich Windl
>>> Eric Robinson schrieb am 14.10.2016 um 09:15 in Nachricht > Does anyone know how many arp_intervals must pass without a reply before the > bonding driver downs the primary NIC? Just

Re: [ClusterLabs] Antw: Replicated PGSQL woes

2016-10-14 Thread Keisuke MORI
2016-10-14 16:36 GMT+09:00 Ulrich Windl : Israel Brewster schrieb am 13.10.2016 um 19:04 in > Nachricht <34091524-d35e-4e28-9c3e-dda6c6a1e...@ravnalaska.net>: > [...] >> Oct 13 08:29:39 CentTest1 crmd[30096]: notice: State

Re: [ClusterLabs] Replicated PGSQL woes

2016-10-14 Thread Keisuke MORI
2016-10-14 2:04 GMT+09:00 Israel Brewster : > Summary: Two-node cluster setup with latest pgsql resource agent. Postgresql > starts initially, but failover never happens. > Oct 13 08:29:47 CentTest1 pgsql(pgsql_96)[19602]: INFO: Master does not > exist. > Oct 13 08:29:47

[ClusterLabs] Antw: Re: Replicated PGSQL woes

2016-10-14 Thread Ulrich Windl
>>> Jehan-Guillaume de Rorthais schrieb am 13.10.2016 um >>> 23:56 in Nachricht <20161013235606.007018eb@firost>: [...] > As far as I know, the pgsql resource agent create such a lock file on > promote > and delete it on graceful stop. If the PostgreSQL instance couldn't be >

[ClusterLabs] Antw: Re: cross DC cluster using public ip?

2016-10-14 Thread Ulrich Windl
Hi! The misconception that a "node" is an "IP address" originates from the times where each Internet host had one IP address. Hardware was considered to be so expensive that nobody would ever afford a second NIC ;-) Today it's different: No system should try to derive a node id from an ID of

Re: [ClusterLabs] Antw: Re: Antw: Unexpected Resource movement after failover

2016-10-14 Thread Vladislav Bogdanov
On October 14, 2016 10:13:17 AM GMT+03:00, Ulrich Windl wrote: Nikhil Utane schrieb am 13.10.2016 um >16:43 in >Nachricht >: >> Ulrich, >> >> I have 4

[ClusterLabs] Antw: Replicated PGSQL woes

2016-10-14 Thread Ulrich Windl
>>> Israel Brewster schrieb am 13.10.2016 um 19:04 in Nachricht <34091524-d35e-4e28-9c3e-dda6c6a1e...@ravnalaska.net>: [...] > Oct 13 08:29:39 CentTest1 crmd[30096]: notice: State transition S_IDLE -> > S_POLICY_ENGINE [ input=I_PE_CALC cause=C_FSA_INTERNAL >

[ClusterLabs] Antw: Re: Antw: Re: Antw: Re: OCFS2 on cLVM with node waiting for fencing timeout

2016-10-14 Thread Ulrich Windl
>>> Ken Gaillot schrieb am 13.10.2016 um 16:49 in >>> Nachricht <97fdafc7-7efe-41d8-99fa-20abb2050...@redhat.com>: > On 10/13/2016 03:36 AM, Ulrich Windl wrote: >> That's what I'm talking about: If 1 of 3 nodes is rebooting (or the cluster > is split-brain 1:2), the single

[ClusterLabs] Trying this question again re: arp_interval

2016-10-14 Thread Eric Robinson
Does anyone know how many arp_intervals must pass without a reply before the bonding driver downs the primary NIC? Just one? -- Eric Robinson ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: