Re: Multiple Query IDs for a rewritten parse tree

Andrey V. Lepikhov Sun, 09 Jan 2022 20:11:33 -0800

On 1/9/22 5:13 PM, Julien Rouhaud wrote:

For now the queryid mixes two different things: fingerprinting and query text
normalization.  Should each calculation method be allowed to do a different
normalization too, and if yes where should be stored the state data needed for
that?  If not, we would need some kind of primary hash for that purpose.

Do You mean JumbleState?

I think, registering queryId generator we should store also a pointer(void **args) to an additional data entry, as usual.

Looking at Andrey's use case for wanting multiple hashes, I don't think that
adaptive optimization needs a normalized query string.  The only use would be
to output some statistics, but this could be achieved by storing a list of
"primary queryid" for each adaptive entry.  That's probably also true for
anything that's not monitoring intended.  Also, all monitoring consumers should
probably agree on the same queryid, both fingerprint and normalized string, as
otherwise it's impossible to cross-reference metric data.

I can add one more use case.

Our extension for freezing query plan uses query tree comparisontechnique to prove, that the plan can be applied (and we don't need toexecute planning procedure at all).The procedure of a tree equality checking is expensive and we usecheaper queryId comparison to identify possible candidates. So here, forthe better performance and queries coverage, we need to use query treenormalization - queryId should be stable to some modifications in aquery text which do not change semantics.As an example, query plan with external parameters can be used toexecute constant query if these constants correspond by place and typeto the parameters. So, queryId calculation technique returns alsopointers to all constants and parameters found during the calculation.


--
regards,
Andrey Lepikhov
Postgres Professional

Re: Multiple Query IDs for a rewritten parse tree

Reply via email to