Re: [DISCUSS] Proposal to add API to force rules matching specific rels

Xiening Dai Fri, 01 Nov 2019 09:48:23 -0700

I think the Convention as a trait is something special to Calcite (not a 
Volcano concept). Calcite uses it for federation queries over heterogeneous 
data source. Look at Calcite paper[1] 4 Traits. It explains the idea quite well.


[1] https://arxiv.org/pdf/1802.10233.pdf


> On Nov 1, 2019, at 1:42 AM, Stamatis Zampetakis <zabe...@gmail.com> wrote:
> 
> @Vladimir: Given that you want to make the optimizer to work in bottom-up
> fashion maybe it suffices to inverse the comparator in the RuleQueue [1].
> For sure there are rules which would not be compatible with this change but
> maybe for your project it makes sense.
> I never tried it my self but I would be curious to see if it works.
> 
> @Jinfeng: I guess that the reason that we have the convention as a trait is
> because results from different systems need to be combined together.
> Given that these results arrive in different forms we need a way to know
> how to transform one to the other. In some sense the convention trait can
> be thought to be similar to the "assembledness" property mentioned in the
> paper.
> "For query optimization in object-oriented systems, we plan on defining
> "assembledness" of complex objects in memory as a physical property and
> using the assembly operator described in ... as the enforcer for this
> property." [2]
> For sure I am not the best person to talk about this choice so don't take
> anything that I say as 100% accurate. I am sure other people can provide a
> much better explanation.
> 
> Best,
> Stamatis
> 
> [1]
> https://github.com/apache/calcite/blob/65043f290a7be45a668cf941ab48ee3c30efbe4e/core/src/main/java/org/apache/calcite/plan/volcano/RuleQueue.java#L94
> [2]
> https://www.cse.iitb.ac.in/infolab/Data/Courses/CS632/Papers/Volcano-graefe.pdf
> 
> On Fri, Nov 1, 2019 at 7:53 AM Jinfeng Ni <j...@apache.org> wrote:
> 
>> I think "pull-up traits" is necessary, since the physical traits are
>> propagated  upwards. However, I'm not fully convinced "On Demand
>> Trait" is the best solution, as I feel it may restrict the choices the
>> planner may consider.  Maybe after the proposed design doc is ready,
>> we may look into the detail and reevaluate.
>> 
>> One question that I have always had ever since I started using Calcite
>> couple of years ago,  is the concept of Convention being a trait, and
>> introduction of AbstractConverter in Calcite's VolcanoPlanner.
>> Quickly looking through the original paper of Volcano [1], or the new
>> one [2], I did not see mentioning of such concepts.  The original
>> paper has a classification of "transformation rules" which operates on
>> logical relation expression  and "implementation rules" which provides
>> the mapping to physical operators.
>> 
>> 
>> 1. https://paperhub.s3.amazonaws.com/dace52a42c07f7f8348b08dc2b186061.pdf
>> 2.
>> https://www.cse.iitb.ac.in/infolab/Data/Courses/CS632/Papers/volcano-prasan.pdf
>> 
>> On Thu, Oct 31, 2019 at 2:40 PM Xiening Dai <xndai....@gmail.com> wrote:
>>> 
>>> Actually we solved this problem in our setup using a mechanism called
>> “Pull-Up Traits”, which explores the possible trait set of children’s input
>> to decide parent’s physical properties. In order to determine child input
>> trait, you would have to look at child’s children, and all the way to the
>> leaves nodes or a barrier. A barrier is a rel node which cannot derive any
>> traits regardless the input. A good example would be a user define function
>> which would throw off any distribution or collation. Then we realize just
>> pulling up is not enough, sometimes we would need to look at parent’s
>> requirement as well. So we try to solve this in a unified framework, which
>> we call “On Demand Trait” and implement it as part of the framework so
>> anyone can be benefited. I hope Haisheng can share a design doc once we
>> have more concrete ideas.
>>> 
>>> 
>>>> On Oct 31, 2019, at 11:37 AM, Jinfeng Ni <j...@apache.org> wrote:
>>>> 
>>>> Hi Vladimir,
>>>> 
>>>> The SubsetTransformer interface and the iterating over the RelNodes
>>>> within a RelSubset in Drill  is exactly implemented to do the trait
>>>> propagation. We also had to rely on AbstractConverter to fire
>>>> necessary rule to avoid the CanNotPlan issue. At some point, Calcite
>>>> community chooses to remove AbstractConverter and Drill had to add it
>>>> back, which is probably one of the main reasons for us to continue
>>>> using a Calcite fork.  I still remember we constantly had to deal with
>>>> the dilemma between "CanNotPlan" and long planing time due to explored
>>>> search space.
>>>> 
>>>> Glad to see more people are joining the effort to solve this long
>>>> overdue issue, something missing in Calcite's core optimizer framework
>>>> "since before Calcite was Calcite" (Jacques's words).
>>>> 
>>>> Jinfeng
>>>> 
>>>> 
>>>> On Thu, Oct 31, 2019 at 3:38 AM Vladimir Ozerov <ppoze...@gmail.com>
>> wrote:
>>>>> 
>>>>> Hi Danny,
>>>>> 
>>>>> Thank you very much for the links. What is described here is pretty
>> much
>>>>> similar to the problem I describe. Especially the discussion about
>> trait
>>>>> propagation, as this is basically what I need - to explore potential
>> traits
>>>>> of children before optimizing parents. And this is basically what
>> Drill
>>>>> already does with it's SubsetTransformer:
>>>>> 1) There is a SubsetTransformer interface, which iterates over
>> physical
>>>>> relations of the given subset [1]
>>>>> 2) If you want to make a physical project, you iterate over physical
>>>>> relations of the input subset and create possible physical projects
>> [2]
>>>>> 3) But if you cannot find any physical input, then you trigger
>> creation of
>>>>> a "bad" physical project, which is very likely to have poor cost
>> because it
>>>>> cannot take advantage of input's distribution information [3]
>>>>> So, step 2 - is a trait set propagation which is needed by many
>>>>> distributed engines. Step 3 - an attempt to workaround current
>>>>> VolcanoPlanner behavior, when a parent rule is fired only if produced
>> child
>>>>> node has compatible trait set.
>>>>> 
>>>>> I do not know Calcite's architecture that good but at first glance,
>> the
>>>>> proposed ability to re-fire rules of a specific Rel seems good
>> enough. It
>>>>> doesn't expand search space, because no new nodes are created, and it
>> seems
>>>>> to be relatively simple to implement.
>>>>> 
>>>>> [1]
>>>>> 
>> https://github.com/apache/drill/blob/1.16.0/exec/java-exec/src/main/java/org/apache/drill/exec/planner/physical/SubsetTransformer.java
>>>>> [2]
>>>>> 
>> https://github.com/apache/drill/blob/1.16.0/exec/java-exec/src/main/java/org/apache/drill/exec/planner/physical/ProjectPrule.java#L66
>>>>> [3]
>>>>> 
>> https://github.com/apache/drill/blob/1.16.0/exec/java-exec/src/main/java/org/apache/drill/exec/planner/physical/ProjectPrule.java#L69
>>>>> 
>>>>> чт, 31 окт. 2019 г. в 12:21, Danny Chan <yuzhao....@gmail.com>:
>>>>> 
>>>>>> Thanks Vladimir for bringing up this discussion !
>>>>>> 
>>>>>> Did you notice that there is a JIRA issue about this problem ? [1]
>> Also a
>>>>>> discussion about how to propagate the traits [2]
>>>>>> 
>>>>>> [1] https://issues.apache.org/jira/browse/CALCITE-2970
>>>>>> [2]
>>>>>> 
>> https://ponymail-vm.apache.org/_GUI_/thread.html/79dac47ea50b5dfbd3f234e368ed61d247fb0eb989f87fe01aedaf25@%3Cdev.calcite.apache.org%3E
>>>>>> 
>>>>>> Best,
>>>>>> Danny Chan
>>>>>> 在 2019年10月31日 +0800 PM3:56，Vladimir Ozerov <ppoze...@gmail.com>，写道：
>>>>>>> Hi colleagues,
>>>>>>> 
>>>>>>> I would like to discuss with the community the possibility of
>> adding a
>>>>>> new
>>>>>>> public method to VolcanoPlanner which will forcefully re-trigger the
>>>>>> rules
>>>>>>> for the specific rel. This is a follow up of a discussion started
>> in the
>>>>>>> other thread [1].
>>>>>>> 
>>>>>>> **Problem statement**
>>>>>>> When converting between conventions during optimization
>> VolcanoPlanner
>>>>>>> prefers the top-bottom approach, so that the nodes are converted
>> from the
>>>>>>> root. But in some cases, the intermediate node must be converted
>> after
>>>>>> its
>>>>>>> children. This is especially true for distributed SQL engines,
>> which rely
>>>>>>> on distribution traits during the optimization process: it is not
>>>>>> possible
>>>>>>> to efficiently choose a proper physical implementation of a parent
>> node
>>>>>>> unless the physical representation of a child node is known.
>>>>>>> 
>>>>>>> It seems that presently VolcanoPlanner cannot address such cases by
>>>>>>> default. Consider that we have two nodes and associated rules which
>>>>>> convert
>>>>>>> them to physical counterparts:
>>>>>>> [LogicalParent <- LogicalChild]
>>>>>>> The parent should be converted after the child. When the child is
>>>>>>> converted, the physical node is created:
>>>>>>> [LogicalParent <- {LogicalChild, PhysicalChild}]
>>>>>>> In order to finish the optimization process, we need to convert the
>>>>>> parent.
>>>>>>> But parent rules are not fired, because PhysicalChild has traits
>>>>>>> incompatible with LogicalParent.
>>>>>>> 
>>>>>>> Presently the problem could be solved in two ways:
>>>>>>> 1) Always produce conversions when going top-down. In this case, I
>> first
>>>>>>> create a physical parent, then a physical child. The problem is that
>>>>>>> created parent is not optimal because it didn't know child
>> distribution
>>>>>> at
>>>>>>> the time of creation. So the full flow would be: create a bad
>> parent,
>>>>>>> create a good child, create a good parent.
>>>>>>> 1.1) [LogicalParent <- LogicalChild]
>>>>>>> 1.2) [{LogicalParent, PhysicalParentBad} <- LogicalChild]
>>>>>>> 1.3) [{LogicalParent, PhysicalParentBad} <- {LogicalChild,
>>>>>> PhysicalChild}]
>>>>>>> 1.4) [{LogicalParent, PhysicalParentBad, PhysicalParentGood} <-
>>>>>>> {LogicalChild, PhysicalChild}]
>>>>>>> What is worse, the creation of a not optimal parent will trigger
>> rules on
>>>>>>> its parent, which in turn may create a not optimal parent-parent
>> node,
>>>>>> etc.
>>>>>>> 
>>>>>>> 2) Make sure that your convention returns true for
>>>>>>> Convention.canConvertConvention. In this case, every time a new rel
>> is
>>>>>>> added to a RelSet, its traitset will be compared to traitsets of all
>>>>>> other
>>>>>>> rels in the set. For every pair of traitset we may ask the engine to
>>>>>> create
>>>>>>> a relevant AbstractConverter. The net effect is that
>>>>>> "physical-to-logical"
>>>>>>> converter is created, which re-triggers the rule on the logical
>> parent
>>>>>>> since their conventions are compatible:
>>>>>>> 2.1) [LogicalParent <- LogicalChild]
>>>>>>> 2.2) [LogicalParent <- {LogicalChild, PhysicalChild}]
>>>>>>> 2.3) [LogicalParent <- {LogicalChild, PhysicalChild,
>>>>>>> PhysicalToLogicalConverter}]
>>>>>>> 2.4) [{LogicalParent, PhysicalParent} <- {LogicalChild,
>> PhysicalChild,
>>>>>>> PhysicalToLogicalConverter}]
>>>>>>> 
>>>>>>> The problem with that approach is that it is too coarse-grained
>> since we
>>>>>>> operate on traitsets rather than rels. As a result, additional
>> memory and
>>>>>>> CPU pressure are introduced because usually too many converters are
>>>>>>> created, which triggers the rules over and over again.
>>>>>>> 
>>>>>>> **Affected products**
>>>>>>> At the moment two distributed engines are being developed for
>> Hazelcast
>>>>>> and
>>>>>>> Apache Ignite. Both require bottom-up optimization and currently
>> rely on
>>>>>>> the second workaround.
>>>>>>> Another example is Apache Drill. I do not know whether its
>> community is
>>>>>>> concerned with the issue, but it also uses bottom-up optimization
>> for
>>>>>> many
>>>>>>> rules and employs both the aforementioned workarounds. As a result,
>> I
>>>>>> guess
>>>>>>> that Drill's optimizer also creates too many rels during
>> optimization and
>>>>>>> suffer from huge search space. Please correct me if I am wrong.
>>>>>>> 
>>>>>>> **Proposal**
>>>>>>> The key problem is that there is no way to re-fire rules on a
>> specific
>>>>>>> node. The original problem could have been solved, if it would be
>>>>>> possible
>>>>>>> to re-fire rules on a LogicalParent without creating any additional
>> rels.
>>>>>>> That would lead to a clear optimization flow:
>>>>>>> 2.1) [LogicalParent <- LogicalChild]
>>>>>>> 2.2) [LogicalParent <- {LogicalChild, PhysicalChild}]
>>>>>>> 2.3) [{LogicalParent, PhysicalParent} <- {LogicalChild,
>> PhysicalChild}]
>>>>>>> 
>>>>>>> We can add the following method to VolcanoPlanner (RelOptPlanner?)
>>>>>>> interface:
>>>>>>> void fireRules(RelNode rel)
>>>>>>> 
>>>>>>> This method will fire the rules on a passed node in a deferred mode
>> as if
>>>>>>> it was the new node just added to the optimizer. This would require
>>>>>> slight
>>>>>>> changes to RuleQueue.addMatch method, and possibly some other
>> places.
>>>>>>> 
>>>>>>> Usage example:
>>>>>>> class PhysicalChildRule extends RelOptRule {
>>>>>>> void onMatch(RelOptRuleCall call) {
>>>>>>> LogicalChild logicalRel = call.get(0);
>>>>>>> PhysicalChild physicalRel = ...;
>>>>>>> 
>>>>>>> call.transformTo(physicalRel);
>>>>>>> optimizer.fireRules(logicalRel);
>>>>>>> }
>>>>>>> }
>>>>>>> 
>>>>>>> What does the community think about such a method? Does it make any
>> sense
>>>>>>> to you? If not, do you aware of any other ways on how to organize
>>>>>> bottom-up
>>>>>>> optimization with VolcanoPlanner without the creation of additional
>> rels?
>>>>>>> 
>>>>>>> If the community is OK in general, I can create try to create a PR
>> with a
>>>>>>> prototype.
>>>>>>> 
>>>>>>> Would appreciate your feedback.
>>>>>>> 
>>>>>>> Regards,
>>>>>>> Vladimir.
>>>>>>> 
>>>>>>> [1]
>>>>>>> 
>>>>>> 
>> https://ponymail-vm.apache.org/_GUI_/thread.html/13e7ab2040bfa4902db6647992ec4203ceb0262cfcb28d38ef7e9e32@%3Cdev.calcite.apache.org%3E
>>>>>> 
>>> 
>>

Re: [DISCUSS] Proposal to add API to force rules matching specific rels

Reply via email to