[ 
https://issues.apache.org/jira/browse/TAJO-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14323790#comment-14323790
 ] 

Jihoon Son commented on TAJO-1310:
----------------------------------

Hi guys,
In this issue, I intended to improve the join operators to maintain join 
filters by themselves.
However, I found that this may cause a lot of bugs during join processing.
This problem is closely related to several bugs of FilterPushDown and 
JoinOrderAlgorithm.
IMHO, it would be better to fix the bugs slowly, but completely.
So, I changed the fix version and the priority from 0.10 to 0.11 and from 
Blocker to Major, respectively.

I'll fix the bugs of FilterPushDown and JoinOrderAlgorithm at TAJO-1350 and 
TAJO-1352.

> Maintaining join filters in join operators
> ------------------------------------------
>
>                 Key: TAJO-1310
>                 URL: https://issues.apache.org/jira/browse/TAJO-1310
>             Project: Tajo
>          Issue Type: Improvement
>          Components: parser, physical operator, planner/optimizer
>            Reporter: Jihoon Son
>            Assignee: Jihoon Son
>             Fix For: 0.11
>
>
> *Introduction*
> A join statement can contain join predicates and join filters.
> Join predicates are evaluated during performing the join operation, while 
> join filters are evaluated on the set of join results.
> Let me consider an example join query as follows:
> {noformat}
> default> select n_nationkey from nation left outer join region on n_nationkey 
> = r_regionkey where r_regionkey is null;
> {noformat}
> In this query, the join predicates and filters are as follows:
> * Join predicates: n_nationkey = r_regionkey
> * Join filters: r_regionkey is null
> *Problem*
> Currently, in query plans, join filters are handled as selection operators, 
> while join predicates are maintained as member variables of join operators.
> This approach makes the implementation simple, but difficult to find the 
> selection operators corresponding to join operators because they are 
> separately maintained.
> This problem is critical when the logical plan optimizer optimizes the join 
> order of a query statement that contains two or more joins each of that has 
> join filters. 
> *Solution*
> Join filters should be distinguished from selection filters, and maintained 
> in the corresponding join operators. For this, we should add join filtlers to 
> the join expression, the logical join node, and several physical join 
> executors. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to