Re: [HACKERS] Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule?

Heikki Linnakangas Tue, 12 May 2015 05:15:45 -0700

On 05/12/2015 03:27 AM, digoal zhou wrote:

    PostgreSQL (<=9.4) trend to smooth buffer write smooth in a
checkpoint_completion_target (checkpoint_timeout or checkpoint_segments),
but when we use synchronous_commit=off, there is a little problem for
the checkpoint_segments
target, because xlog write fast(for full page write which the first page
write after checkpoint), so checkpointer cann't sleep and write buffer not
smooth.
...
I think we can add an condition to the IsCheckpointOnSchedule,
                 if (synchronous_commit != SYNCHRONOUS_COMMIT_OFF)
                 {
                     recptr = GetInsertRecPtr();
                     elapsed_xlogs = (((double) (recptr -
ckpt_start_recptr)) / XLogSegSize) / CheckPointSegments;


                     if (progress < elapsed_xlogs)
                     {
                         ckpt_cached_elapsed = elapsed_xlogs;
                         return false;
                     }
                  }

This has nothing to do with asynchronous_commit, except that settingasynchronous_commit=off makes your test case run faster, and hit theproblem harder.

I think the real problem here is that IsCheckpointOnSchedule assumesthat the rate of WAL generated is constant throughout the checkpointcycle, but in reality you generate a lot more WAL immediately after thecheckpoint begins, thanks to full_page_writes. For example, in thebeginning of the cycle, you quickly use up, say, 20% of the WAL space inthe first 10 seconds, and the scheduling thinks it's in a lot of hurryto finish the checkpoint because it extrapolates that the rest of theWAL will be used up in the next 40 seconds. But in reality, the WALconsumption levels off, and you have many minutes left untilCheckPointSegments.

Can you try the attached patch? It modifies the above calculation totake the full-page-write effect into account. I used X^1.5 as thecorrective function, which roughly reflects the typical WAL consumptionpattern. You can adjust the exponent, 1.5, to make the correction moreor less aggressive.


- Heikki

diff --git a/src/backend/postmaster/checkpointer.c b/src/backend/postmaster/checkpointer.c
index 0dce6a8..fb02f56 100644
--- a/src/backend/postmaster/checkpointer.c
+++ b/src/backend/postmaster/checkpointer.c
@@ -763,6 +763,19 @@ IsCheckpointOnSchedule(double progress)
 		recptr = GetInsertRecPtr();
 		elapsed_xlogs = (((double) (recptr - ckpt_start_recptr)) / XLogSegSize) / CheckPointSegments;
 
+		/*
+		 * Immediately after a checkpoint, a lot more WAL is generated when
+		 * full_page_write is enabled, because every WAL record has to include
+		 * a full image of the modified page. It levels off as time passes and
+		 * more updates fall on pages that have already been modified since
+		 * the last checkpoint.
+		 *
+		 * To correct for that effect, apply a corrective factor on the
+		 * amount of WAL consumed so far.
+		 */
+		if (fullPageWrites)
+			elapsed_xlogs = pow(elapsed_xlogs, 1.5);
+
 		if (progress < elapsed_xlogs)
 		{
 			ckpt_cached_elapsed = elapsed_xlogs;

-- 
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Let PostgreSQL's On Schedule checkpoint write buffer smooth spread cycle by tuning IsCheckpointOnSchedule?

Reply via email to