Re: [HACKERS] Sync Rep: First Thoughts on Code

Mark Mielke Tue, 23 Dec 2008 09:10:10 -0800

Simon Riggs wrote:

You scare me that you see failover as sufficiently frequent that you are
worried that being without one of the servers for an extra 60 seconds
during a failover is a problem. And then say you're not going to add the
feature after all. I really don't understand. If its important, add the
feature, the whole feature that is. If not, don't.


My expectation is that most failovers are serious ones, that the primary
system is down and not coming back very fast. Your worries seem to come
from a scenario where the primary system is still up but Postgres
bounces/crashes, we can diagnose the cause of the crash, decide the
crashed server is safe and then wish to recommence operations on it
again as quickly as possible, where seconds count it doing so.

Are failovers going to be common? Why?


Hi Simon:

I agree with most of your criticism to the "fail over only approach" -but don't agree that fail over frequency should really impactexpectations for the failed system to return to service. I see "soft"fails (*not* serious) to potentially be common - somewhere on thenetwork, something went down or some packet was lost, and the systemtook a few too many seconds to respond. My expectation is that thesystem can quickly detect that the node is out of service, be removedfrom the pool, when the situation is resolved (often automaticallyoutside of my control) automatically "catch up" and be put back into thepool. Having to run some other process such as rsync seems unreliable aswe already have a mechanism for streaming the data. All that is missingis streaming from an earlier point in time to catch up efficiently andreliably.

I think I'm talking more about the complete solution though which is inline with what you are saying? :-)


Cheers,
mark

--
Mark Mielke <m...@mielke.cc>


--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Sync Rep: First Thoughts on Code

Reply via email to