Re: [GENERAL] Geographic High-Availability/Replication

Markus Schiltknecht Sun, 26 Aug 2007 10:10:09 -0700

Hi,

Bill Moran wrote:

I'm curious as to how Postgres-R would handle a situation where the
constant throughput exceeded the processing speed of one of the nodes.

Well, what do you expect to happen? This case is easily detectable, butI can only see two possible solutions: either stop the node which is toslow or stop accepting new transactions for a while.

This technique is not meant to allow nodes to lag behind severalthousands of transactions - that should better be avoided. Rather it'smeant to decrease the commit delay necessary for synchronous replication.

I can see your system working if it's just spike loads and the slow
nodes can catch up during slow periods, but I'm wondering about the
scenarios where an admin has underestimated the hardware requirements
and one or more nodes is unable to keep up.

Please keep in mind, that replication per se does not speed yourdatabase up, it rather adds a layer of reliability, which *costs* someperformance. To increase the transactional throughput you would need toadd partitioning to the mix. Or you could try to make use of the gainedreliability and abandon WAL - you won't need that as long as at leastone replica is running - that should increase the single node'sthroughput and therefore the cluster's throughput, too.

When replication meets partitioning and load balancing, you'll get intoa whole new world, where new trade-offs need to be considered. Some looksimilar to those with RAID storage - probably Sequoia's term RAIDb isn'tbad at all.


Regards

Markus


---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

Re: [GENERAL] Geographic High-Availability/Replication

Reply via email to