Re: [HACKERS] Redesigning checkpoint_segments

Heikki Linnakangas Thu, 06 Jun 2013 06:23:21 -0700

On 06.06.2013 15:31, Kevin Grittner wrote:

Heikki Linnakangas<[email protected]>  wrote:

On 05.06.2013 22:18, Kevin Grittner wrote:

Heikki Linnakangas<[email protected]>   wrote:

I was not thinking of making it a hard limit. It would be just
like checkpoint_segments from that point of view - if a
checkpoint takes a long time, max_wal_size might still be
exceeded.


Then I suggest we not use exactly that name.  I feel quite sure we
would get complaints from people if something labeled as "max" was
exceeded -- especially if they set that to the actual size of a
filesystem dedicated to WAL files.


You're probably right. Any suggestions for a better name?
wal_size_soft_limit?


After reading later posts on the thread, I would be inclined to
support making it a hard limit and adapting the behavior to match.

Well, that's a lot more difficult to implement. And even if we have ahard limit, I think many people would still want to have a soft limitthat would trigger a checkpoint, but would not stop WAL writes fromhappening. So what would we call that?


I'd love to see a hard limit too, but I see that as an orthogonal feature.

How about calling the (soft) limit "checkpoint_wal_size"? That goes welltogether with checkpoint_timeout, meaning that a checkpoint will betriggered if you're about to exceed the given size.

I'm also concerned about the "spin up" from idle to high activity.
Perhaps a "min" should also be present, to mitigate repeated short
checkpoint cycles for "bursty" environments?

With my proposal, you wouldn't get repeated short checkpoint cycles withbursts. The checkpoint interval would be controlled bycheckpoint_timeout, and checkpoint_wal_size. If there is a lot ofactivity, then checkpoints will happen more frequently, ascheckpoint_wal_size is reached sooner. But it would not depend on theactivity in previous checkpoint cycles, only the current one, so itwould not make a difference if you have a continuously high load, or abursty one.

The history would matter for the calculation of how many segments topreallocate/recycle, however. Under the proposal, that would becalculated separately from checkpoint_wal_size, and for that we'd usesome kind of a moving average of how many segments were used in previouscycles. A min setting might be useful for that. We could also try tomake WAL file creation cheaper, ie. by using posix_fallocate(), as wasproposed in another thread, and doing it in bgwriter or walwriter. Thatwould make it less important to get the estimate right, from aperformance point of view, although you'd still want to get it right toavoid running out of disk space (having the segments preallocatedensures that they are available when needed).


- Heikki


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Redesigning checkpoint_segments

Reply via email to