Re: [HACKERS] Simple postgresql.conf wizard

Greg Smith Wed, 03 Dec 2008 21:57:19 -0800

On Thu, 4 Dec 2008, Gregory Stark wrote:

My point was more that you could have a data warehouse on anon-dedicated machine, you could have a web server on a non-dedicatedmachine, or you could have a mixed server on a non-dedicated machine.

I should just finish the documentation, where there will be a bigdisclaimer saying "THESE SETTINGS ASSUME A SERVER DEDICATED TOPOSTGRESQL!" That's the context here. Why, after you follow my tuninginstructions, you're lucky if the server will run anything but thedatabase afterwards.

Josh's logic is impeccable -- for the specific use case he's describing of a
truly dedicated server with enough disk space for a major production database.
But not every install is going to have gigabytes of space reserved for it and
not every admin is going to realize that he really should set aside gigabytes
of space even though he only expects his database to be a few megabytes.

It's really quite simple. Josh and I don't care directly about disk spaceused by the WAL for people with trivial databases. At all. Whatsoever.Maybe once, long ago, when we were young and frugal and skinny[1]; notnow, or probably ever again the future. If that's your concern, maybethere can be some companion utility named pgmiser that lowers parametersback down again. Your mascot can be some sort of animal that efficientlylives off small scraps of food or something.[2]

The context here is pgtune, which is aiming to make a fat elephant of aserver faster so that there's an answer to people who say "My benchmarksare all running really slow, is this because my system with 16PT of RAM isonly using 32MB of it for the database? This sucks, I'm going back toOracle which used all my RAM." If there are people who instead think,"hey, I'll run this tuning utility to make my database faster, then itwill also be a lot smaller!", maybe we can find a class about space/timetradeoffs in algorithm design to send them to or something.[3]

There are exactly two important things here. The first is how largecheckpoint_settings needs to be in order to for the considerable overheadof checkpoints to be bearable. That drives the setting up. Our super-fatDW application gets set to at least 64 so that when you bulk-load anotherTB of data into it, that doesn't get bottlenecked dumping gigabytes ofdirty buffers every few seconds. If the database crashes and recoveryreads or writes a bunch of data, who cares about random writes becauseyour SAN has a 4GB write cache on it and dozens of drives slaving away.

Driving the setting down is knowing how much time you'll have to wait forrecovery to happen, which is really a measure of what your tolerance fordowntime is. We're thinking that someone who picks the Desktop tuning mayhave no tolerance for the database to be sluggish coming back up afterWindows crashed and they rebooted, so tiny setting for them to makerecovery super fast.

Everybody else in our sample profiles fall in the middle of those twoextremes, which is why the values curve the way they do. Web app?Probably not a lot of write volume, probably trouble if it's down a longtime; how about 8, on the low side, but it gives checkpoints more time tospread out their I/O so worst-case latency isn't as bad. That's the sortof analysis those numbers come from. Do performance tuning and jugglethese trade-offs for long enough for new people all the time, you get agut feel for the right ballpark an app should start at based on its type.The whole idea behind this tool is that we're taking some of that hard-wonknowledge and trying to automate the distribution of it.

It's great that Postgres has such great documentation but whenever we have the
chance to replace something with an option which doesn't need any
documentation that would be even better. I'm just exploring whether that's an
option here.

I would be glad to have a post-CommitFest discussion of this very topic asit's quite a pain to me in its current form. Just not right now becauseit's too late to touch it.

Nobody's even tried to do this side of things before. They always gotbogged down in trying to parse config files and such.

It's actually because most of them were working in Perl, which encouragesdeviant behavior where people delight in converting useful ideas intoillegible punctuation rather than actually getting anything done. Exceptfor that other Greg around here who's not involved in this discussion, hisPerl is pretty good.

[1] Josh is being aggressively bulked up right now for his next sumomatch.

[2] Like a rat, which would give you an excuse to add the long overduePL/Ratfor.

[3] This wouldn't actually help them learn anything, but it would maketheir heads explode at which point all their problems are gone.


--
* Greg Smith [EMAIL PROTECTED] http://www.gregsmith.com Baltimore, MD

--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Simple postgresql.conf wizard

Reply via email to