Re: [Linux-HA] unable to recover from split-brain in a two-node cluster

2014-06-20 Thread fank
Thanks, Digimer. This is an existing setup so I'm stuck with them. Currently my workaround is to increase the dead time so it won't flap and cause all these issues. Best, -Kaiwei - Original Message - From: "Digimer" To: "General Linux-HA mailing list" Sent: Friday, June 20, 2014 4:19:

Re: [Linux-HA] unable to recover from split-brain in a two-node cluster

2014-06-20 Thread Digimer
On 20/06/14 03:18 PM, f...@vmware.com wrote: Hi, New to this list and hope I can get some help here. I'm using pacemaker 1.0.10 and heartbeat 3.0.5 for a two-node cluster. I'm having split-brain problem when heartbeat messages sometimes get dropped when system is under high load. However the

[Linux-HA] unable to recover from split-brain in a two-node cluster

2014-06-20 Thread fank
Hi, New to this list and hope I can get some help here. I'm using pacemaker 1.0.10 and heartbeat 3.0.5 for a two-node cluster. I'm having split-brain problem when heartbeat messages sometimes get dropped when system is under high load. However the problem is it never recover back when system l

Re: [Linux-HA] Hawk error message on CentOS 6.5

2014-06-20 Thread Dejan Muhamedagic
Hi, On Fri, Jun 20, 2014 at 10:43:55AM +0200, Bart Coninckx wrote: > > On 05 Jun 2014, at 11:17, Bart Coninckx wrote: > > > > > On 04 Jun 2014, at 12:18, Kristoffer Grönlund wrote: > > > >> On Wed, 4 Jun 2014 12:08:13 +0200 > >> Bart Coninckx wrote: > >> > >>> > >>> I just found another i

Re: [Linux-HA] Hawk error message on CentOS 6.5

2014-06-20 Thread Bart Coninckx
On 05 Jun 2014, at 11:17, Bart Coninckx wrote: > > On 04 Jun 2014, at 12:18, Kristoffer Grönlund wrote: > >> On Wed, 4 Jun 2014 12:08:13 +0200 >> Bart Coninckx wrote: >> >>> >>> I just found another issue while clicking the history explorer: it >>> reports: /usr/sbin/hb_report: No such fil