Re: Using Replication in mySQL version 5

Jed Reynolds Thu, 20 Nov 2008 00:29:32 -0800

Tompkins Neil wrote:

Hi
We are looking to upgrade our version of mySQL to the latest version of
mySQL 5.  One of the main features we are going to think about using is
replication for our website data.  Basically we have 2 websites located in
the UK and US which share similar information, and we are going to be using
replication as a way of keeping the data up to date and in sync.


Based on your experiences, is there anything we should be aware of before
investigating this route and putting it into practice ?

I've had to take servers out for bad raid-controllers, bad ram, badmobos. Disks have been the least of my problems. So make sure yourarchitecture tolerates the ability to take members of your pool outwithout load-spiking the remaining members. And if you're doingfilesystem snapshots from a master to a replicant, you will have toeither have policy or extra servers available to maintain your uptimewhen you interrupt the master to flush all the tables, sync thefilesystem and do an LVM snapshot. Innodb would require a shutdown.Don't forget that LVM snapshots are copy-on-write, so when that mastercomes back up and starts processing modifying tables, you'll get amazingsystem load on a busy system as your file system starts madly copyingextents into the snapshot volume.

Define a procedure for junior staff how to properly down and up a poolmember. Like, if you get a disk-full on one member, and it borksreplication, what's the step-by-step for a) determining if replicationcan re-establish after you do a FLUSH LOGS, b) under what conditions doyou have to re-copy all data from one master to another because yourreplication window has expired and your logs have gotten flushed. Yourreplication binlogs get really big if you're pushing large materializedviews regularly via replication, or your servers have fast disks, notenough size to handle a more than a weekend or whole day (for example)of neglect.

Define a procedure for checking your my.cnf files for correctauto-increment-* settings and server-id settings. Junior staff, and evensenior staff rarely add more members to the pool, so these settings areoften mistaken during a midnight maintenance hour. Procedure for addingmembers and changing master replication settings is very important.Often your DBA is not racking and changing the equipment.

Make sure that you have a good understanding of what kind of capacityyou're growing at. I started a project with two four-core boxes withplenty of 15krpm disk and when they got into production, they regularlyspiked to load 20 and 30. Not pretty. Not only had my old architecturerefused traffic to lighten the load, my new architecture didn't. My dataset was growing so fast my sort-buffer settings for the old servers weretoo small for new servers. I ended up with four DL380s with 8 cores perbox. I really had to scramble to get more servers in there. The additionof two more read-only members really helped, and backups handled byreplication to an off-site replicant.

Another load capacity warning: if your traffic is very spiky, and youget high-load conditions, I've seen reset/dropped connections and alsoplain old connection timeouts. So if you have RAM for 1024 connections,you prolly can't service 1024 connections when you've got tablecontention and connections from your web-nodes just start failing. Ifthey fail for too long, then you have to do some FLUSH HOSTS to resetconnection attempt counters.

I don't know what your application does, but I certainly monitorreplication lag. Load spikes can certainly increase lag. I've had tomove from single instances of mysql to mysqld_multi and separatedatabases by replication rate. Your monitoring should also track sqlthreads. You might need to define procedure on how to deal withpooling-out members that fall too far behind in replication.

I've written an iptables script to block webnode connections but allowsql pool member connections. I use this to take a member out to runtable repairs or to lighten the load while it does replication catch-up.

WAN connectivity for replication is interesting! I did site-to-sitetransfer using stunnel. I had to negotiate weird Cisco 5502 VPNbehavior. Copying gigs of myisam files between sites would knock over myvpn so I had to rate-limit using rsync --bwlimit. Bursting bandwidthcharges were still brutal, though. Later, we ended up configuring CBQ(search freshmeat.net for cbq-init) on my backup replicant to limitbandwidth so it wouldn't provoke bursting charges.


Jed


--
MySQL General Mailing List
For list archives: http://lists.mysql.com/mysql
To unsubscribe:    http://lists.mysql.com/[EMAIL PROTECTED]

Re: Using Replication in mySQL version 5

Reply via email to