Re: [HACKERS] PATCH: regular logging of checkpoint progress

Greg Smith Fri, 26 Aug 2011 00:35:56 -0700

On 08/25/2011 04:57 PM, Tomas Vondra wrote:

(b) sends bgwriter stats (so that the buffers_checkpoint is updated)

The idea behind only updating the stats in one chunk, at the end, isthat it makes one specific thing easier to do. Let's say you're runninga monitoring system that is grabbing snapshots of pg_stat_bgwriterperiodically. If you want to figure out how much work a checkpoint did,you only need two points of data to compute that right now. Wheneveryou see either of the checkpoint count numbers increase, you justsubtract off the previous sample; now you've got a delta for how manybuffers that checkpoint wrote out. You can derive the information aboutthe buffer counts involved that appears in the logs quite easily thisway. The intent was to make that possible to do, so that people canfigure this out without needing to parse the log data.

Spreading out the updates defeats that idea. It also makes it possibleto see the buffer writes more in real-time, as they happen. You canmake a case for both approaches having their use cases; the above isjust summarizing the logic behind why it's done the way it is rightnow. I don't think many people are actually doing things with this tothe level where their tool will care. The most popular consumer ofpg_stat_bgwriter data I see is Munin graphing changes, and I don't thinkit will care either way.

Giving people the option of doing it the other way is a reasonable idea,but I'm not sure there's enough use case there to justify adding a GUCjust for that. My next goal here is to eliminate checkpoint_segments,not to add yet another tunable extremely few users would ever touch.

As for throwing more log data out, I'm not sure what new analysis you'rethinking of that it allows. I/O gets increasingly spiky as you zoom inon it; averaging over a shorter period can easily end up providing lessinsight about trends. If anything, I spend more time summarizing thedata that's already there, rather than wanting to break them down. It'salready providing way too much detail for most people. Customers tellme they don't care to see checkpoint stats unless they're across a dayor more of sampling, so even the current "once every ~5 minutes" is waymore info than they want. I have all this log parsing code and thingsthat look at pg_stat_bgwriter to collect that data and produce higherlevel reports. And lots of it would break if any of this patch is addedand people turn it on. I imagine other log/stat parsing programs mightsuffer issues too. That's your other hurdle for change here: the newanalysis techniques have to be useful enough to justify that somedownstream tool disruption is inevitable.

If you have an idea for how to use this extra data for something useful,let's talk about what that is and see if it's possible to build it ininstead. This problem is harder than it looks, mainly because the waythe OS caches writes here makes trying to derive hard numbers from whatthe background writer is doing impossible. When the database writesthings out, and when they actually get written to disk, they are not thesame event. The actual write is often during the sync phase, and notbeing able to tracking that beast is where I see the most problems at.The write phase, the easier part to instrument in the database, that ispretty boring. That's why the last extra logging I added here focusedon adding visibility to the sync activity instead.


--
Greg Smith   2ndQuadrant US    [email protected]   Baltimore, MD
PostgreSQL Training, Services, and 24x7 Support  www.2ndQuadrant.us


--
Sent via pgsql-hackers mailing list ([email protected])
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] PATCH: regular logging of checkpoint progress

Reply via email to