Anton Haidai created CALCITE-2666: ------------------------------------- Summary: JoinPushThroughJoinRule can't reach an optimal plan in some 3+ joins cases Key: CALCITE-2666 URL: https://issues.apache.org/jira/browse/CALCITE-2666 Project: Calcite Issue Type: Bug Components: core Reporter: Anton Haidai Assignee: Julian Hyde Attachments: calcite.join.pushdown.issue.png
For example, the input query is the following: {code:java} SELECT * FROM X INNER JOIN A ON X.id = A.id INNER JOIN Y ON X.id = Y.id INNER JOIN Z ON X.id = Z.id {code} According to the cost model used, it would be beneficial to push the "A" scan to the right node of the top join (grouping X, Y, Z scans in two bottom joins in any order). But this state is never reached, "A" scan could be pushed only one join up, but never two joins up. h2. Cause According to my debugging, the cause of the issue is the following. As far as the optimal state could hypothetically be achieved only by JoinPushThroughJoinRule.RIGHT, lets review only the behavior of this rule (while JoinPushThroughJoinRule.LEFT is also affected by the issue described). After each transformation, JoinPushThroughJoinRule.RIGHT not only swaps right nodes of joins, but also adds an additional project node on top of transformed joins. The rule expects the following input structure: {code:java} operand(LogicalJoin.class, operand(LogicalJoin.class, any()), operand(RelNode.class, any()) ) {code} But after applying the rule to two bottom joins, there will be an additional project between these joins and the top join, so the middle join is no longer the left input of the top join and the rule can't match and produce the optimal result. See the attachment for a visual representation of this explanation: !calcite.join.pushdown.issue.png! h2. -- This message was sent by Atlassian JIRA (v7.6.3#76005)