Re: [ClusterLabs] BUG in crmd/ccm membership handling when crmd registers with cluster a bit late

2015-11-28 Thread Shyam
pacemaker-1.1.6? At this point it will be hard for us to shift to corosync & will be better to continue with heartbeat. So your inputs are greatly appreciated. Thank you! --Shyam > Date: Fri, 27 Nov 2015 12:58:15 +0100 > From: Lars Ellenberg > To: users@clusterlabs.org > Sub

[ClusterLabs] BUG in crmd/ccm membership handling when crmd registers with cluster a bit late

2015-11-26 Thread Shyam
rmd_ha_msg_callback: Ignoring HA message (op=join_announce) from node1: not in our membership list (size=1) --Shyam On Mon, Nov 23, 2015 at 9:30 PM, Shyam wrote: > One note on this. > > This problem doesnt happen if > > Nov 19 08:36:30 node1 crmd[3298]: notice: crmd_client_stat

Re: [ClusterLabs] Timing issue with CRMD on one node not joining cluster/remains in S_PENDING

2015-11-23 Thread Shyam
all hosts for: master-MYSQL:0 (100) Nov 19 08:36:41 node1 pengine[3299]: notice: unpack_config: On loss of CCM Quorum: Ignore Nov 19 08:36:41 node1 pengine[3299]: notice: LogActions: Start IPADDR:1#011(node2) Any help/pointers greatly apprecited. Thanks. --Shyam On Mon, Nov 23, 2015 at 12:14

[ClusterLabs] Timing issue with CRMD on one node not joining cluster/remains in S_PENDING

2015-11-22 Thread Shyam
roblem. Can anyone suggest if this issue has already been fixed in latest pacemaker or any other suggestions how to debug this issue? If I enable higher debug level (both in heartbeat/pacemaker), this problem doesnt show up. Any help/pointers on how to go forwa

[ClusterLabs] heartbeat/pacemaker: running without DC when having a split-brain like situation due to linux crash

2015-11-11 Thread Shyam
uery 74: Requesting the current CIB: S_POLICY_ENGINE Nov 9 17:29:32 vm-1 cib: [2474]: info: cib_process_request: Operation complete: op cib_modify for section nodes (origin=local/crmd/70, version=1.30.183): ok (rc=0) Nov 9 17:29:32 vm-1 cib: [2474]: info: cib_pr

Re: [ClusterLabs] heartbeat/pacemaker: stonith-ng does a broadcast that doesnt reach it back resulting stonith_async_timeout_handler() called

2015-10-15 Thread Shyam
Hi Dejan, Thanks a lot for your input! I cherry picked this commit & this solves the problem. I will raise a ubuntu launchpad bug for them to pull this correction in trusty stable. Thanks! --Shyam On Thu, Oct 15, 2015 at 1:21 PM, Dejan Muhamedagic wrote: > Hi, > > On Wed, Oct 1

[ClusterLabs] heartbeat/pacemaker: stonith-ng does a broadcast that doesnt reach it back resulting stonith_async_timeout_handler() called

2015-10-14 Thread Shyam
s. eventually crmd gives up on timeout Oct 14 14:59:48 node0 crmd[14483]:error: stonith_async_timeout_handler: Async call 2 timed out after 168000ms Thanks. --Shyam ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users P