Re: [HACKERS] Spread checkpoint sync

Greg Smith Sat, 15 Jan 2011 14:57:30 -0800

Robert Haas wrote:

That seems like a bad idea - don't we routinely recommend that people
crank this up to 0.9?  You'd be effectively bounding the upper range
of this setting to a value to the less than the lowest value we
recommend anyone use today.

I was just giving an example of how I might do an initial split.There's a checkpoint happening now at time T; we have a rough idea thatit needs to be finished before some upcoming time T+D. Currently withdefault parameters this becomes:


Write:  0.5 * D; Sync:  0

Even though Sync obviously doesn't take zero. The slop here is enoughthat it usually works anyway.


I was suggesting that a quick reshuffling to:

Write:  0.4 * D; Sync:  0.4 * D

Might be a good first candidate for how to split the time up better.The fact that this gives less writing time than the current biggestspread possible:


Write:  0.9 * D; Sync: 0

Is true. It's also true that in the case where sync time really iszero, this new default would spread writes less than the currentdefault. I think that's optimistic, but it could happen if checkpointsare small and you have a good write cache.

Step back from that a second though. Ultimately, the person who isgetting checkpoints at a 5 minute interval, and is being nailed byspikes, should have the option of just increasing the parameters to makethat interval bigger. First you increase the measly default segments toa reasonable range, then checkpoint_completion_target is the second oneyou can try. But from there, you quickly move onto makingcheckpoint_timeout longer. At some point, there is no option but togive up checkpoints every 5 minutes as being practical, and make theaverage interval longer.

Whether or not a refactoring here makes things slightly worse for casescloser to the default doesn't bother me too much. What bothers me isthe way trying to stretch checkpoints out further fails to deliver aswell as it should. I'd be OK with saying "to get the exact same spreadsituation as in older versions, you may need to retarget for checkpointsevery 6 minutes" *if* in the process I get a much better sync latencydistribution in most cases.

Here's an interesting data point from the customer site this all startedat, one I don't think they'll mind sharing since it helps make thesituation more clear to the community. After applying this code tospread sync out, in order to get their server back to functional we hadto move all the parameters involved up to where checkpoints were spaced35 minutes apart. It just wasn't possible to write any faster than thatwithout disrupting foreground activity.The whole current model where people think of this stuff in terms ofsegments and completion targets is a UI disaster. The direction I wantto go in is where users can say "make sure checkpoints happen every Nminutes", and something reasonable happens without additional parameterfiddling. And if the resulting checkpoint I/O spike is too big, theyjust increase the timeout to N+1 or N*2 to spread the checkpointfurther. Getting too bogged down thinking in terms of the current,really terrible interface is something I'm trying to break myself of.Long-term, I want there to be checkpoint_timeout, and all the otherparameters are gone, replaced by an internal implementation of the bestpractices proven to work even on busy systems. I don't have as muchclarity on exactly what that best practice is the way that, say, I justsuggested exactly how to eliminate wal_buffers as an important thing tomanually set. But that's the dream UI: you shoot for a checkpointinterval, and something reasonable happens; if that's too intense, youjust increase the interval to spread further. There probably will besmall performance regression possible vs. the current code withparameter combination that happen to work well on it. Preserving everyone of those is something that's not as important to me as making thetuning interface simple and clear.


--
Greg Smith   2ndQuadrant US    [email protected]   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support  www.2ndQuadrant.us
"PostgreSQL 9.0 High Performance": http://www.2ndQuadrant.com/books


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Spread checkpoint sync

Reply via email to