[HACKERS] CheckpointStartLock starvation

Heikki Linnakangas Mon, 02 Apr 2007 12:03:52 -0700

I'm seeing a problem on my benchmark machine: checkpoints stop happeningafter the ramp-up period.

It looks like the bgwriter gets starved waiting on theCheckpointStartLock. The CheckpointStartLock is held in shared mode overan XLogFlush when committing, which on an extremely busy system like abenchmark is always long enough to have a new transaction to acquire theCheckpointStartLock again.

I'm running another test with more logging to confirm that's what'shappening, but I'm pretty sure that's it...

As a proposed fix, instead of acquiring the CheckpointStartLock inRecordTransactionCommit, we set a flag in MyProc saying "commit inprogress". Checkpoint will scan through the procarray and make note ofany commit in progress transactions, after computing the new redo recordptr, and wait for all of them to finish before flushing clog.


Unless someone has a better idea, I'll write a patch to do the above.

--
  Heikki Linnakangas
  EnterpriseDB   http://www.enterprisedb.com

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

              http://www.postgresql.org/docs/faq

[HACKERS] CheckpointStartLock starvation

Reply via email to