Re: [HACKERS] Partitioned checkpointing

Fabien COELHO Fri, 11 Sep 2015 09:29:32 -0700


Hello Simon,

The idea to do a partial pass through shared buffers and only write a
fraction of dirty buffers, then fsync them is a good one.


Sure.

The key point is that we spread out the fsyncs across the whole checkpoint
period.


Yes, this is really Andres suggestion, as I understood it.

I think we should be writing out all buffers for a particular file in one
pass, then issue one fsync per file.  >1 fsyncs per file seems a bad idea.

This is one of the things done in the "checkpoint continuous flushing"patch, as buffers are sorted, they are written per file, and in orderwithin a file, which help getting sequencial writes instead of randomwrites.


See https://commitfest.postgresql.org/6/260/

However for now the final fsync is not called, but Linux is told that thewritten buffers must be flushed, which is akin to an "asynchronous fsync",i.e. it asks to move data but does not wait for the data to be actuallywritten, as a blocking fsync would.

Andres suggestion, which has some common points to Takashi-san patch, isto also integrate the fsync in the buffer writing process. There are somedetails to think about, because probably it is not a a good to issue anfsync right after the corresponding writes, it is better to wait for somedelay before doing so, so the implementation is not straightforward.


--
Fabien.


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Partitioned checkpointing

Reply via email to