Re: [HACKERS] checkpointer continuous flushing

Fabien COELHO Sun, 06 Sep 2015 07:05:45 -0700


Hello Andres,

Here's a bunch of comments on this (hopefully the latest?)


Who knows?! :-)

version of the patch:

* I'm not sure I like the FileWrite & FlushBuffer API changes. Do you
 forsee other callsites needing similar logic?

I foresee that the bgwriter should also do something more sensible thangenerating random I/Os over HDDs, and this is also true for workers... Butthis is for another time, maybe.

Wouldn't it be just as easy to put this logic into the checkpointingcode?

Not sure it would simplify anything, because the checkpointer currentlyknows about buffers but flushing is about files, which are hidden fromview.

Doing it with this API change means that the code does not have to computetwice in which file is a buffer: The buffer/file boundary has to be brokensomewhere anyway so that flushing can be done when needed, and thesolution I took seems the simplest way to do it, without having to makethe checkpointer too much file concious.

* We don't do one-line ifs;


Ok, I'll return them.

function parameters are always in the same line as the function name


Ok, I'll try to improve.

* Wouldn't a binary heap over the tablespaces + progress be nicer?


I'm not sure where it would fit exactly.

Anyway, I think it would complicate the code significantly (compared tothe straightforward array), so I would not do anything like that without astrong intensive, such as an actual failing case.

Moreover such a data structure would probably require some kind of pointer(probably 8 bytes added per node, maybe more), and the amount of memory isalready a concern, at least to me, and moreover it has to reside in sharedmemory which does not simplify allocation of tree data structures.

If you make the sorting criterion include the tablespace id you wouldn'tneed the lookahead loop in NextBufferToWrite().

Yep, I thought of it. It would mean 4 more bytes per buffer, and bsearchto find the boundaries, so significantly less simple code. I think thatthe current approach is ok as the number of tablespace should be small.


It may be improved upon later if there is a motivation to do so.

Isn't the current approach O(NBuffers^2) in the worst case?

ISTM that the overall lookahead complexity is Nbuffers * Ntablespace:buffers are scanned once for each tablespace. I assume that the number oftablespace is kept low, and having a simpler code which use less memoryseems a good idea.

ISTM that using a tablespace in the sorting would reduce the complexityto ln(NBuffers) * Ntablespace for finding the boundaries, and thenNbuffers * (Ntablespace/Ntablespace) = NBuffers for scanning, at theexpense of more memory and code complexity.


So this is a voluntary design decision.

--
Fabien.


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] checkpointer continuous flushing

Reply via email to