[HACKERS] Log levels for checkpoint/bgwriter monitoring

Greg Smith Mon, 19 Feb 2007 20:02:19 -0800

I have a WIP patch that adds the main detail I have found I need toproperly tune checkpoint and background writer activity. I think it'salmost ready to submit (you can see the current patch against 8.2 athttp://www.westnet.com/~gsmith/content/postgresql/patch-checkpoint.txt )after making it a bit more human-readable. But I've realized that alongwith that, I need some guidance in regards to what log level isappropriate for this information.


An example works better than explaining what the patch does:

2007-02-19 21:53:24.602 EST - DEBUG: checkpoint required (wrotecheckpoint_segments)

2007-02-19 21:53:24.685 EST - DEBUG:  checkpoint starting
2007-02-19 21:53:24.705 EST - DEBUG:  checkpoint flushing buffer pool
2007-02-19 21:53:24.985 EST - DEBUG:  checkpoint database fsync starting
2007-02-19 21:53:42.725 EST - DEBUG:  checkpoint database fsync complete

2007-02-19 21:53:42.726 EST - DEBUG: checkpoint buffer flush dirty=8034write=279956 us sync=17739974 us

Remember that "Load distributed checkpoint" discussion back in December?You can see exactly how bad the problem is on your system with this logstyle (this is from a pgbench run where it's postively awful--it reallydoes take over 17 seconds for the fsync to execute, and there are clientsthat are hung the whole time waiting for it).


I also instrumented the background writer.  You get messages like this:

2007-02-19 21:58:54.328 EST - DEBUG: BGWriter Scan All - Written = 5/5Unscanned = 23/54

This shows that we wrote (5) the maximum pages we were allowed to write(5) while failing to scan almost half (23) of the buffers we meant to lookat (54). By taking a look at this output while the system is under load,I found I was able to do bgwriter optimization that used to take me daysof frustrating testing in hours. I've been waiting for a good guide tobgwriter tuning since 8.1 came out. Once you have this, combined withknowing how many buffers were dirty at checkpoint time because thebgwriter didn't get to them in time (the number you want to minimize), thetuning guide practically writes itself.

So my question is...what log level should all this go at? Right now, Ihave the background writer stuff adjusting its level dynamically based onwhat happened; it logs at DEBUG2 if it hits the write limit (which shouldbe a rare event once you're tuned properly), DEBUG3 for writes thatscanned everything they were supposed to, and DEBUG4 if it scanned butdidn't find anything to write. The source of checkpoint information logsat DEBUG1, the fsync/write info at DEBUG2.

I'd like to move some of these up. On my system, I even have many of thecheckpoint messages logged at INFO (the source of the checkpoint and thetotal write time line). It's a bit chatty, but when you get some weirdsystem pause issue it makes it easy to figure out if checkpoints were toblame. Given how useful I feel some of these messages are to systemtuning, and to explaining what currently appears as inexplicable pauses, Idon't want them to be buried at DEBUG levels where people are unlikely toever see them (I think some people may be concerned about turning onthings labeled DEBUG at all). I am aware that I am too deep into this tohave an unbiased opinion at this point though, which is why I ask forfeedback on how to proceed here.


--
* Greg Smith [EMAIL PROTECTED] http://www.gregsmith.com Baltimore, MD

---------------------------(end of broadcast)---------------------------
TIP 7: You can help support the PostgreSQL project by donating at

               http://www.postgresql.org/about/donate

[HACKERS] Log levels for checkpoint/bgwriter monitoring

Reply via email to