Re: [HACKERS] Per table autovacuum vacuum cost limit behaviour strange

Gregory Smith Sun, 21 Sep 2014 22:01:54 -0700

On 8/28/14, 12:18 PM, Robert Haas wrote:

At least in situations that I've encountered, it's typical to be ableto determine the frequency with which a given table needs to bevacuumed to avoid runaway bloat, and from that you can work backwardsto figure out how fast you must process it in MB/s, and from there youcan work backwards to figure out what cost delay will achieve thateffect. But if the system tinkers with the cost delay under the hood,then you're vacuuming at a different (slower) rate and, of course, thetable bloats.

The last time I took a whack at this, I worked toward making all of theparameters operate in terms of target MB/s, for exactly this style ofthinking and goal. Those converted into the same old mechanism underthe hood and I got the math right to give the same behavior for thesimple cases, but that could have been simplified eventually. Iconsider that line of thinking to be the only useful one here.

The answer I like to these values that don't inherit as expected in theGUC tree is to nuke that style of interface altogether in favor ofsimplifer bandwidth measured one, then perhaps add multiple QoS levels.Certainly no interest in treating the overly complicated innards of costcomputation as a bug and fixing them with even more complicated behavior.

The part of this I was trying hard to find time to do myself by the nextCF was a better bloat measure tool needed to actually see the problembetter. With that in hand, and some nasty test cases, I wanted to comeback to simplified MB/s vacuum parameters with easier to understandsharing rules again. If other people are hot to go on that topic, Idon't care if I actually do the work; I just have a pretty clear view ofwhat I think people want.

The only plausible use case for setting a per-table rate that I cansee is when you actually want the system to use that exact rate forthat particular table. That's the main one, for these must run onschedule or else jobs.

Yes.

On 8/29/14, 9:45 AM, Alvaro Herrera wrote:

Anyway it seems to me maybe there is room for a new table storage
parameter, say autovacuum_do_balance which means to participate in the
balancing program or not.


If that eliminates some of the hairy edge cases, sure.

A useful concept to consider is having a soft limit that most thing workagainst, along with a total hard limit for the server. When one ofthese tight schedule queries with !autovacuum_do_balance starts, theymust run at their designed speed with no concern for anyone else. Whichmeans:

a) Their bandwidth gets pulled out of the regular, soft limit numbersuntil they're done. Last time I had one of these jobs, once the bigimportant boys were running, everyone else in the regular shared setwere capped at vacuum_cost_limit=5 worth of work. Just enough to keepup with system catalog things, and over the course of many hours processsmall tables.

b) If you try to submit multiple locked rate jobs at once, and the totalgoes over the hard limit, they have to just be aborted. If the rush ofusers comes back at 8AM, and you can clean the table up by then if yougive it 10MB/s, what you cannot do is let some other user decrease yourrate such that you're unfinished at 8AM. Then you'll have aggressive AVcompeting against the user load you were trying to prepare for. It'sbetter to just throw a serious error that forces someone to look at thehard limit budget and adjust the schedule instead. The systems withthis sort of problem are getting cleaned up every single day, almostcontinuously; missing a day is not bad as long as it's noted and fixedagain before the next cleanup window.



--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Per table autovacuum vacuum cost limit behaviour strange

Reply via email to