Re: [HACKERS] Timeline following for logical slots

Petr Jelinek Thu, 07 Apr 2016 11:34:00 -0700

On 07/04/16 17:32, Robert Haas wrote:

Second, I'm not sure whether it was a good design decision to make
logical slots a special kind of object that sit off to the side,
neither configuration (like postgresql.conf) nor WAL-protected data
(like pg_clog and the data files themselves), but it was certainly a
very deliberate decision.  I sort of expected them to be WAL-logged,
but Andres argued (not unconvincingly) that we'd want to have slots on
standbys, and making them WAL-logged would preclude that.


Yeah. I understand the reasons for that decision. Per an earlier reply I
think we can avoid making them WAL-logged so they can be used on standbys
and still achieve usable failover support on physical replicas.


I think at one point we may have discussed doing this via additional
side-channel protocol messages.  Is that what you are thinking about
now, or something else?

I think the most promising idea was to use pull model instead of pushmodel for slot updates and using feedback to tell master how far theoldest slot on standby is. This has the additional advantage of solvingthe biggest problem with decoding on standby (keeping old enough catalogxid and lsn). And also solves your concern of propagating through wholecascade as this can be done in more controllable way (from bottom up inthe replication chain).

For version one, I would cut all of
the stuff that allows data to be sent in any format other than text,
and just use in/outfuncs all the time.

Agreed here, I think doing the binary transfer for base types, etc isjust optimization, not necessity. This is one of the reasons why Iwanted to get bit broader feedback on the protocol - to get some kind ofconsensus about what we want now, what we might want in the future andhow to handle getting from now to future without too much complexity orbreakage. For example currently we have flags for every protocol messagewhich I am not sure are completely necessary there, OTOH we probablydon't want to bump protocol version with every new version of Postgreseither.


I do generally think that logical decoding relies too much on trying
to set up situations where it will never fail, and I've said from the
beginning that it should make more provision to cope with failure
rather than just trying to avoid it.  If logical decoding never
breaks, great.  But the approach I would favor is to set things up so
that it automatically reclones if there is a replication break, and
then as an optimization project, try to eliminate those cases one by
one.

Well that really depends. I've seen way too many cases where people uselogical replication as transport mechanism, rather than replicationwhere the destination is same as source, and in those scenarios there isoften no way to "reclone" either because the historical data are nolonger on the source or because the data on the target were alreadyupdated after they've been replicated. But in general the idea ofrecovering from error rather than being hell bent on preventing it issomething I am pushing as well. For example it should be easier to lookat what's in replication queue and remove things from there if needed.


--
  Petr Jelinek                  http://www.2ndQuadrant.com/
  PostgreSQL Development, 24x7 Support, Training & Services


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Timeline following for logical slots

Reply via email to