Re: [HACKERS] Synchronous Standalone Master Redoux

Shaun Thomas Wed, 11 Jul 2012 06:42:34 -0700

On 07/10/2012 06:02 PM, Daniel Farina wrote:

For example, what if DRBD can only complete one page per second for
some reason?  Does it it simply have the primary wait at this glacial
pace, or drop synchronous replication and go degraded?  Or does it do
something more clever than just a timeout?

That's a good question, and way beyond what I know about the internals.:) In practice though, there are configurable thresholds, and ifexceeded, it will invalidate the secondary. When using Pacemaker, we'veactually had instances where the 10G link we had between the serversdied, so each node thought the other was down. That lead to thesecondary node self-promoting and trying to steal the VIP from theprimary. Throw in a gratuitous arp, and you get a huge mess.

That lead to what DRBD calls split-brain, because both nodes wererunning and writing to the block device. Thankfully, you can actuallytell one node to discard its changes and re-subscribe. Doing that willreplay the transactions from the "good" node on the "bad" one. And eventhen, it's a good idea to run an online verify to do a block-by-blockchecksum and correct any differences.

Of course, all of that's only possible because it's a block-levelreplication. I can't even imagine PG doing anything like that. It wouldhave to know the last good transaction from the primary and do animplied PIT recovery to reach that state, then re-attach for sync commits.

Regardless of what DRBD does, I think the problem with the
async/sync duality as-is is there is no nice way to manage exposure
to transaction loss under various situations and requirements.

Which would be handy. With synchronous commits, it's given that theprotocol is bi-directional. Then again, PG can detect when clientsdisconnect the instant they do so, and having such an event implicitlydisable synchronous_standby_names until reconnect would be an easy fix.The database already keeps transaction logs, so replaying would stillhappen on re-attach. It could easily throw a warning for everysync-required commit so long as it's in "degraded" mode. Those alone arevery small changes that don't really harm the intent of sync commit.

That's basically what a RAID-1 does, and people have been fine with thatfor decades.


--
Shaun Thomas
OptionsHouse | 141 W. Jackson Blvd. | Suite 500 | Chicago IL, 60604
312-444-8534
stho...@optionshouse.com



______________________________________________

See http://www.peak6.com/email_disclaimer/ for terms and conditions related to 
this email

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Synchronous Standalone Master Redoux

Reply via email to