Re: [HACKERS] Transaction-controlled robustness for replication

Markus Wanner Tue, 22 Jul 2008 23:22:19 -0700

Hi,

Jens-Wolfhard Schicke wrote:

* Does WAL get forced to disk on primary at commit time?
* Does WAL get forced across link to standby at commit time?
* Does WAL get forced to disk on standby at commit time?

* Does WAL get applied [and synced] to disk on standby at commit time?

I think that's what Simon means by his question no 3. It wouldn't makemuch sense to me otherwise.

I'm assuming the standby node has it's own physical format, so thechanges from the remote WAL need to be transformed to a local WAL, whichthen needs to be written to disc. For Postgres, this pretty much meansapplying the changes and committing them. You never need to store theremote WAL on physical storage, what would that be good for?

I think that questions 2 and 3 are trivially bundled together. Once the
user can specify 2, implementing 3 should be trivial and vice versa.

That might well be, yes. The code to collect changes from a transactionand then apply them remotely is pretty much the same, no matter when itis being executed. But it certainly makes a difference in the balancebetween performance and availability, which is a decision the usershould be able to make for his specific application (or even better, pertransaction, as proposed here and in Postgres-R).

I am not even convinced that these need to be two different parameters.

Consider a standby heavily loaded (i/o) with some OLAP queries. Whyshould the master wait until the standby has written anything to diskfor him?

Also please note that an answer of "yes" to 3 means that 2 must also
be answered "yes".


Agreed. There's no 'AS' mode possible, only 'SS', 'SA' and 'SS'.

How about creating named modes? This would give the user the ability to
define more fine-grained control especially in larger clusters of 
fail-over/read-only
servers without totally clogging the parameter space and application code.
Whether this should be done SQL-style or in some config file is not so clear to 
me,
although I'd prefer SQL-style like

CREATE SYNCHRONIZING MODE immediate_readonly AS
  LOCAL        SYNCHRONOUS APPLY
  192.168.0.10 SYNCHRONOUS APPLY        -- read-only slave
  192.168.0.11 SYNCHRONOUS APPLY        -- read-only slave
  192.168.0.20 SYNCHRONOUS SHIP         -- backup-server
  192.168.0.21 SYNCHRONOUS SHIP         -- backup-server
  192.168.0.30 SYNHCRONOUS FSYNC        -- backup-server with fast disks
;

Hm.. that's an interesting idea. Especially considering the number ofoptions that arise with more than two or three nodes, where you maybealso want to specify how many nodes must have written the changes todisk before confirming the commit.

In Postgres-R, I've added a TRANSACTION REPLICATION LEVEL, which can beeither SYNC, EAGER or LAZY. Maybe that's not quite sufficient. On theother hand, I don't think any other option here makes any sense. (Above,you yourself doubt that sync is different enough from eager).


Regards

Markus


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Transaction-controlled robustness for replication

Reply via email to