Re: [HACKERS] max_standby_delay considered harmful

Greg Smith Tue, 04 May 2010 16:27:14 -0700

Tom Lane wrote:

1. The timestamps we are reading from the log might be historical,
if we are replaying from archive rather than reading a live SR stream.
In the current implementation that means zero grace period for standby
queries.  Now if your only interest is catching up as fast as possible,
that could be a sane behavior, but this is clearly not the only possible
interest --- in fact, if that's all you care about, why did you allow
standby queries at all?

If the standby is not current, you may not want people to executequeries against it. In some situations, returning results againstobsolete data is worse than not letting the query execute at all. As Isee it, the current max_standby_delay implementation includes theexpectation that the results you are getting are no more thanmax_standby_delay behind the master, presuming that new data is stillcoming in. If the standby has really fallen further behind than that,there are situations where you don't want it doing anything but catchingup until that is no longer the case, and you especially don't want itreturning stale query data.

The fact that tuning in that direction could mean the standby neveractually executes any queries is something you need to monitor for--itsuggests the standby isn't powerful/well connected to the master enoughto keep up--but that's not necessarily the wrong behavior. Saying "Ionly want the standby to execute queries if it's not too far behind themaster" is the answer to "why did you allow standby queries at all?"when tuning for that use case.

2. There could be clock skew between the master and slave servers.

Not the database's problem to worry about. Document that time should becarefully sync'd and move on. I'll add that.

3. There could be significant propagation delay from master to slave,
if the WAL stream is being transmitted with pg_standby or some such.
Again this results in cutting into the standby queries' grace period,
for no defensible reason.

Then people should adjust their max_standby_delay upwards to account forthat. For high availability purposes, it's vital that the delay numberbe referenced to the commit records on the master. If lag is eating aportion of that, again it's something people should be monitoring for,but not something we can correct. The whole idea here is thatmax_standby_delay is an upper bound on how stale the data on the standbycan be, and whether or not lag is a component to that doesn't impact howthe database is being asked to act.

In addition to these fundamental problems there's a fatal implementation
problem: the actual comparison is not to the master's current clock
reading, but to the latest commit, abort, or checkpoint timestamp read
from the WAL.

Right; this has been documented for months athttp://wiki.postgresql.org/wiki/Hot_Standby_TODO and on the list beforethat, i.e. "If there's little activity in the master, that can lead tosurprising results." The suggested long-term fix has been addingkeepalive timestamps into SR, which seems to get reinvented every timesomebody plays with this for a bit. The HS documentation improvementsI'm working on will suggest that you make sure this doesn't happen, thatpeople have some sort of keepalive WAL-generating activity on themaster regularly, if they expect max_standby_delay to work reasonably inthe face of an idle master. It's not ideal, but it's straightforward towork around in user space.

I'm inclined to think that we should throw away all this logic and just
have the slave cancel competing queries if the replay process waits
more than max_standby_delay seconds to acquire a lock.  This is simple,
understandable, and behaves the same whether we're reading live data or
not.

I don't consider something that allows queries to execute when notplaying recent "live" data is necessarily a step forward, from theperspective of implementations preferring high-availability. It'sreasonable for some people to request that the last thing a standbythat's not current (<max_standby_delay behind the master, based on thelast thing received) should be doing is answering any queries, when itdoesn't have current data and it should be working on catchup instead.

Discussion here obviously has wandered past your fundamental objectionshere and onto implementation trivia, but I didn't think the differencebetween what you expected and what's actually committed already wasproperly addressed before doing that.


--
Greg Smith  2ndQuadrant US  Baltimore, MD
PostgreSQL Training, Services and Support
[email protected]   www.2ndQuadrant.us


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] max_standby_delay considered harmful

Reply via email to