Re: [HACKERS] checkpointer continuous flushing - V18

Fabien COELHO Sun, 21 Feb 2016 00:02:19 -0800


Hallo Andres,

[...] I do think that this whole writeback logic really does make sense*per table space*,
Leads to less regular IO, because if your tablespaces are evenly sized
(somewhat common) you'll sometimes end up issuing sync_file_range's
shortly after each other.  For latency outside checkpoints it's
important to control the total amount of dirty buffers, and that's
obviously independent of tablespaces.
I do not understand/buy this argument.
The underlying IO queue is per device, and table spaces should be per deviceas well (otherwise what the point?), so you should want to coalesce and"writeback" pages per device as wel. Calling sync_file_range on distinctdevices should probably be issued more or less randomly, and should notinterfere one with the other.
If you use just one context, the more table spaces the less performancegains, because there is less and less aggregation thus sequential writes perdevice.
So for me there should really be one context per tablespace. That wouldsuggest a hashtable or some other structure to keep and retrieve them, whichwould not be that bad, and I think that it is what is needed.

Note: I think that an easy way to do that in the "checkpoint sort" patchis simply to keep a WritebackContext in CkptTsStatus structure which isper table space in the checkpointer.

For bgwriter & backends it can wait, there is few "writeback" coalescingbecause IO should be pretty random, so it does not matter much.


--
Fabien.


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] checkpointer continuous flushing - V18

Reply via email to