[ 
https://issues.apache.org/jira/browse/HIVE-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16244259#comment-16244259
 ] 

Ashutosh Chauhan commented on HIVE-18008:
-----------------------------------------

{code}
 
joinInfo.rightSet().equals(ImmutableBitSet.range(rightAggregate.getGroupCount()));
{code}

Just count check may not be sufficient, we should also check if they are same 
column.

> Add optimization rule to remove gby from right side of left semi-join
> ---------------------------------------------------------------------
>
>                 Key: HIVE-18008
>                 URL: https://issues.apache.org/jira/browse/HIVE-18008
>             Project: Hive
>          Issue Type: Improvement
>          Components: Query Planning
>            Reporter: Vineet Garg
>            Assignee: Vineet Garg
>         Attachments: HIVE-18008.1.patch
>
>
> Group by (on same keys as semi join) as right side of Left semi join is 
> unnecessary and could be removed. We see this pattern in subqueries with 
> explicit distinct keyword e.g.
> {code:sql}
> explain select * from src b where b.key in (select distinct key from src a 
> where a.value = b.value)
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to