[HACKERS] - Proposal for repreparing prepared statements

Stephen Marshall Wed, 13 Sep 2006 10:49:04 -0700

The following is a proposal for work I'd like to do to forcelong-running backend processes to reprepare their prepared statements.It would be used in cases where the user knows they have made a databasechange that will invalidate an existing prepared statement.

I look forward to comments from the community.
------------

I propose creating a new system administration function to forcerepreparation of prepared statements in all backends. The functionalitycould be extended to include re-initialization of other kinds of"per-backend" data.

This proposal addresses, to some degree, the prepare-alter-exec issuediscussed in various mailing list postings, and the following wish-listitem:

# Invalidate prepared queries, like INSERT, when the table definition isaltered

However, the solution would only be partial, as it would be theresponsibility of database clients to call the system administrationfunction when needed. Alternately, additional integration work could bedone to invoke this logic automatically whenever the columns of anytable are altered.


------
Here is what I propose:

We define a new system administration function calledpg_reload_per_backend_data. This function would work much likepg_reload_conf, i.e. it would require superuser privileges and wouldwork by sending a signal to the postmaster that would then be propagatedto all the child backends (but not the special ones, like thebgwriter). The signal handling logic for the backends would be modifiedto respond to the signal by reinitializing any data cached in thebackend's memory space, such as prepared statements. Each kind of datathat would be reinitialized would require special logic, as they wouldall be reinitialized in their own particular way.

Choosing an appropriate signal to send might be difficult, as the listof available signals is somewhat restricted. The "user-defined" signalswould be a natural choice, but it appears SIGUSR1 is used for "sinval"or catchup events, while SIGUSR2 is used for asynchronous notification.Use of the "real time" signals (signal numbers >= 32) might be possible,but could have portability problems. Another alternative would be tooverload SIGHUP, so that it causes both configuration reloads andreloading of per-backend data. This makes some sense, since mostconfiguration parameters are basically a special form of per-backenddata. However, changing the behavior of an existing signal might haveundesirable side effects. Overall, I'm very open to suggestionsregarding the appropriate signal to use.

To implement the repreparation logic, a new function calledRepreparePreparedStatements() could be added to source filesbackend/commands/prepare.[ch]. This function would be called by asignal handler installed the backends within backend/tcop/postgres.c.RepreparePreparedStatements would do the equivalent of iterating overthe prepared_queries hash table and executing DropPreparedStatement()and PrepareQuery on each. However, it is possible that some refactoringof the logic would be needed to improve performance and make the codemore robust.

The scope of pg_reload_per_backend_data could also be expanded toinclude reinitialization of other data that resides in the memory spaceof individual backend processes. An example of such cached entities arereusable modules associated with a particular procedural language, e.g.the TCL modules found in the table pltcl_modules. Once a such a moduleis used in a particular backend, it remains held in backend memory andchanges to the disk version are not noticed. There is also no way toundefine any global variables associated with such modules.

I have not given much consideration to the implementation for reloadingmodules, but doing the equivalent of the SQL command "LOAD '<libname>'for all dynamically loaded libraries should have the desired effect (atleast it does for the library that implements the PL/TCL language,pltcl.so). Perhaps the the general response should be to reload anylibraries that have been dynamically-loaded by the particular backend.


------
Here are few permutations of this plan that could be considered:

1. Bundle pg_reload_per_backend_data functionality with pg_reload_conf.

Pros: Avoids having to find an appropriate unused signal
     Logical consistancy with reloading config, which could be considered a
     special case of reloading per-backend data.
Cons: Changes behavior of an existing functionality, which has the risk of
     unintended side-effects.

Gives less fine-grained control over when per-backend data isreloaded.


2. Break pg_reload_per_backend_data functional into multiple functions.

Pros: Can assign more descriptive names to the functionality, e.g.
     pg_reload_ddl, pg_reprepare_statements, etc.
     Finer grained control over which kind of reloading is performed.
Cons: Require more use of the scarce list of available signals.




---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

              http://archives.postgresql.org

[HACKERS] - Proposal for repreparing prepared statements

Reply via email to