Re: Pushing down a subquery relation's ppi_clauses, and more ...

Andrei Lepikhov Mon, 28 Jul 2025 06:29:30 -0700

On 26/7/2025 05:09, Richard Guo wrote:

Here, I'd like to discuss whether it's worthwhile to also consider
pushing down a subquery relation's ppi_clauses if the subquery is
LATERAL.

In my opinion, this direction makes sense. Moreover, I have seensophisticated cases where SQL Server pushes parameterisation throughGROUP BY down into a subquery, significantly speeding up execution.

First, it's important to note that pushing down ppi_clauses doesn't
always result in a better execution plan.  While doing so can reduce
the amount of data processed in each aggregation invocation within the
subquery, it also means that the aggregation needs to be re-evaluated
for every outer tuple.  If t1 is very small and t2 is large, pushing
down ppi_clauses can be a win.  As t1 gets larger, this gets less
attractive, and eventually it will have a higher cost than the current
plan, where the aggregation is evaluated only once.

Heh, let me propose a way to mitigate the issue I implemented in thePostgres fork. Instead of implementing numerous 'subplan flattening'transformations, I found that we can smooth the performance cliff byinserting a Memoise node at the top of the subplan. It reduces subplanevaluations in case we have duplicated parameter values.It is designed close to the subplan hashing feature, but, of course,logically distinct: it requires a top-down step after the bottom-upplanning. It has some limitations, but if you have the resources torestructure the planning procedure slightly, it may be feasible in thePostgres core as well.


Therefore, if we decide to pursue this approach, we would need to
generate two paths: one with the ppi_clauses pushed down, and one
without, and then compare their costs.  A potential concern is that
this might require re-planning the subquery twice, which could
increase planning overhead.

Here, we also designed an approach that may provide some insights forthe further core development.Correlated subquery pull-up techniques always have bad corner cases(like you proposed). We added an extension list field to PlannerGlobaland PlannedStmt, enabling features to report the upper-level caller. Thecaller may build a series of plans with and without these contradictoryfeatures applied and compare the costs.I implemented the 'plan probing' technique in the GetCachedPlan, whichis obviously has the most chances to be profitable because it is reusedmultiple times and has the infrastructure to track previous planningefforts. At the high architectural level, it seems close to the currentplan cache auto mode logic: try options, compare costs, and rememberdecisions.


I'm not sure it provides any answers - just existing techniques to ponder.

--
regards, Andrei Lepikhov

Re: Pushing down a subquery relation's ppi_clauses, and more ...

Reply via email to