[ClusterLabs Developers] MariaDB resource-agent - help with choosing a master

Nils Carlson Tue, 14 Feb 2017 12:53:04 -0800

Hi,

I'm working on implementing a MariaDB resource-agent based on the mysql one.

The idea is to take advantage of new features in MariaDB, especiallysemi-synchronous replication and GTID.

GTID (Global Transaction ID) means that there is a counter that appliesto the replicated databases, which is unique within the cluster (therecan be multiple replication clusters with overlapping ID's).

Semi-synchronous replication means that the master will replicatesynchronously to AT LEAST ONE slave, before actually performing thetransaction. In theory there can be no data-loss due to a single nodefailure, a big improvement compared to the normal async replication inMariaDB.

These two sets of technologies should allow for quite a straightforwardset of semantics in the resource-agent.On master failure, the node with the highest GTID must be the one thatwas replicating synchronously, and should be promoted to be the newmaster. The question is how to relay the information to crmd.

My current working hypothesis is that I can place the GTID as acrm-attribute both when starting the resource-agent and in a post-demotenotify. During the subsequent monitor operation the resource-agents canthen scan the the crm-attributes from other nodes and simply prioritisethemselves in relation to others (some relative scoring?).


This requires a few things though:

- If there is no master when the resource agent starts we need to waitfor all nodes to come online (i.e) the cluster is just starting beforepromoting any to master, so they can read GTID from the attributes.- There must be a monitor step after start and demote and before thepromotion of any resource to master, and this must execute on all nodesso they can set their priority for promotion.- The post-demote notifier must complete execution before a node canstart the monitor operation. I THINK that it is ok for not all nodes tohave completed the post-demote notifier before the monitor operationstarts, probably this can work by creating a sparse prioritydistribution, i.e. First node to execute monitor sets a priority of 100- the next one down 90 - the next one in the middle at 95, based on thenumber of nodes etc.

I hope this doesn't sound too tangled, I will try this out, but I can'tfind any clear documentation on the ordering and completion of start,notifiers, monitor and promote operations as well as master selection,so all pointers are very much welcome.


And completely alternative suggestions also very much welcome.

Thanks for any and all assistance,
Nils


_______________________________________________
Developers mailing list
[email protected]
http://lists.clusterlabs.org/mailman/listinfo/developers

[ClusterLabs Developers] MariaDB resource-agent - help with choosing a master

Reply via email to