Re: [HACKERS] Cached Query Plans (was: global prepared statements)

PFC Sun, 13 Apr 2008 05:29:41 -0700

On Fri, Apr 11, 2008 at 12:34 PM, PFC <[EMAIL PROTECTED]> wrote:
        Well, I realized the idea of global prepared statements actually
sucked, so I set on another approach thanks to ideas from this list,this is
caching query plans.
Well, that's a blatantly bad realization. Perhaps you should do moreresearch.

No, what I meant is that the "global prepared statements" as I tried toimplement them before weren't that good...I think simple caching based on the query text itself is preferable tohaving to name each of your queries, extract them from your programs andreplace them by executes, issue a "create statement" command for each ofthem, etc. Few people would actually use that feature because it wouldmean lots of modifications to the application, so all the applicationsthat have to be compatible with other databases would not use the feature(*)It could be useful for permissions and fine access control, though, butviews and stored procs already provide that functionality...

(*) = Note that caching the plans based on the query text (with $ params)from a parse message will not provide caching for oldskool queries withparams inside in the form of escaped strings. This is good, because itmeans the safer solution (using $-quoted params) will also be the fastersolution. And in the application, only a very small part of the code needsto be changed, that's the DB abstraction layer.

 Doesn't Oracle do this now transparently to clients?


Of course it does, and it has since the late 80's I believe.

 Oracle keeps a statement/plan cache in its shared memory segment (SGA)
 that greatly improves its performance at running queries that don't
 change very often.


        Can we have more details on how Oracle does it ? For "inspiration"...

        Here is what I'm thinking about :

Don't flame me too much about implementation issues, this is justthrowing ideas in the air to see where they'll fall ;)

* global plan cache in shared memory, implemented as hashtable, hash keybeing the (search_path, query_string)Doubt : Can a plan be stored in shared memory ? Will it have to be copiedto local memory before being executed ?


This stores :
- the plans (not for all keys, see below)
- the stats :
        - number of times this query has been executed,

- total, min and max wallclock time and CPU time spent planning thisquery,- total, min and max wallclock time, CPU time and RAM spent executingthis query,

        - total, min and max number of rows returned,
        - last timestamp of execution of this query,

There should be separate GUCs to control this :
        - should the whole thing be activated ?
        - should the cache be active ? or just the stats ? and what stats ?

There should be also a way to query this to display the statistics (ie"what query is killing my server ?"), and a way to purge old plans.


* every time a Parse message comes up :
- look if the (search_path, query_string) is in the cache
- if it is in the cache :

- if there is a cached plan, make the unnamed statement point to it, andwe're done.- if there is no cached plan, prepare the query, and put it in theunnamed statement.

Now, the query has been parsed, so we can decide if it is cacheable.Should this be done in Parse, in Bind, or somewhere else ? I have no idea.

For instance, queries which contain VALUES() or IN( list of consts )should not be cached, since the IN() is likely to change all the time, itwould just trash the cache. Using =ANY( $1 ) instead will work with cachedplans.

Also, will a plan to be cached have to be prepared with or without theparameters ? That's also an interesting question...Perhaps the user should also be able to specify wether to cache a plan ornot, or wether to use the params or not, with hint flags in the querystring ?

(like mysql, /* flags */ SELECT blah )

Now, if the query is cacheable, store it in the cache, and update thestats. If we decided to store the plan, do that too. For instance we mightdecide to store the plan only if this query has been executed a certainnumber of times, etc.

* In the Execute message, if a cached plan was used, execute it and updatethe stats (time spent, etc).

Now, about contention, since this is one shared hashtable for everyone,it will be fought for...However, the lock on it is likely to be held during a very small time(much less than a microsecond), so would it be that bad ?Also, GUC can be used to mitigate the contention, for instance if theuser is not interested in the stats, the thing becomes mostly read-only






















--
Sent via pgsql-hackers mailing list (pgsql-hackers@postgresql.org)
To make changes to your subscription:
http://www.postgresql.org/mailpref/pgsql-hackers

Re: [HACKERS] Cached Query Plans (was: global prepared statements)

Reply via email to