Re: [HACKERS] Sync Rep: First Thoughts on Code

Mark Mielke Sun, 14 Dec 2008 10:04:37 -0800

Simon Riggs wrote:

I am truly lost to understand why the *name* "synchronous replication"
causes so much discussion, yet nobody has discussed what they would
actually like the software to *do* (this being a software discussion
list...). AFAICS we can make the software behave like *any* of the
definitions discussed so far.

I think people have talked about 'like' in the context of userexpectations. That is, there seems to exist a set of people (probablythose who've never worked with a multi-replica solution before) whoexpect that once commit completes on one server, they can query anyother master or slave and be guaranteed visibility of the transactionthey just committed. These people may theoretically change theirdecision to not use Postgres-R, or at least change their approach to howthey work with Postgres-R, if the name was in some way more intuitive tothem in terms of what is actually being provided.

"Synchronous replication" itself says only details about replication, itdoes not say anything about visibility, so to some degree, people arefocusing on the wrong term as the problem. Even if it says "asynchronousreplication" - not sure that I care either way - this doesn't improvethe understanding for the casual user of what is happening behind thescenes. Neither synchronous nor asynchronous guarantees that the changewill be immediately visible from other nodes after I type 'commit;'.Asynchronous might err on the side of not immediately visible, wheresynchronous might (incorrectly) imply immediate visibility, but it's notan accurate guarantee to provide.

Synchronous does not guarantee visibility immediately after. Someindefinite but usually short time must normally pass from when my'commit;' completes until when the shared memory visible to my process"sees" the transaction. Multiple replicas with network latency orreliability issues increases the theoretical minimum size of this windowto something that would be normally encountered as opposed to somethingthat is normally not encountered.

The only way to guarantee visibility is to ensure that the newtransaction is guaranteed to be visible from a shared memory perspectiveon every machine in the pool, and every active backend process. If my'commit;' is going to wait for this to occur, first, I think this forcesevery commit to have numerous network round trips to each machine in thepool, it forces each machine in the pool to be network accessible andresponsive, it forces all commits to be serialized in the sense of "theslowest machine in the pool determines the time for my commit tocomplete", and I think it implies some sort of inter-process signalling,or at the very least CPU level signalling about shared memory (in thecase of multiple CPUs).

People such as myself think that a visibility guarantee is unreasonableand certain to cause scalability or reliability problems. So, my 'like'is an efficient multi-master solution where if I put 10 machines in thepool, I expect my normal query/commit loads to approach 10X as fast. Mylike prefers scalability over guarantees that may be difficult toprovide, and probably are not provided today even in a single serverscenario.

It is certainly far too early to say what the final exact behaviour will
be and there is no reason at all to pre-suppose that it need only be a
single behaviour. I'm in favour of options, generally, but I would say
that the distinction between some of these options is mostly very fine
and strongly doubt whether people would use them if they existed. *But*
I think we can add them at a later stage of development if requirements
genuinely exist once all the benefits *and* costs are understood.

The above 'commit;' behaviour difference - whether it completes when thecommit is permanent (it definitely will be applied for certain to allreplicas - it just may take time to apply to all replicas), or when thecommit has actually taken effect (two-phase commit on all replicas - andboth phases have completed on all replicas - what happens if secondphase commit fails on one or more servers?), or when the commit isguaranteed to be visible from all existing and new sessionss (two-phasecommit plus additional signalling required?) might be such an option.

I'm doubtful, though - as the difference in implementation between thefirst and second is pretty significant.

I'm curious about your suggestion to direct queries that need the latestsnapshot to the 'primary'. I might have misunderstood it - but it seemsthat the expectation from some is that *all* sessions see the latestsnapshot, so would this not imply that all sessions would be redirect tothe 'primary'? I don't think it is reasonable myself, but I might bemisunderstanding something...


Cheers,
mark

--
Mark Mielke <[email protected]>


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Sync Rep: First Thoughts on Code

Reply via email to