[ 
https://issues.apache.org/jira/browse/HIVE-2340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13556620#comment-13556620
 ] 

Yin Huai commented on HIVE-2340:
--------------------------------

The current implementation of the patch of YSmart covers scenarios when a join 
or aggregation operator share the same partition keys with its all parents 
(join or aggregation operators). 
For example, a single MR job will be generated if all operators in the 
following plan share the same partition keys.
{code}
JOIN----                    
       \                          
        ----JOIN----  
       /            \              
GBY----              \
                      ----JOIN
                     /
GBY--- -------------/
{code}


Also, it requires that the bottom join or aggregation operators which will be 
processed in the same MR job take input tables instead of intermediate tables. 
In future, it should be extended to cover scenarios that involve intermediate 
tables, that correlated operators share common partition keys (not exactly the 
same keys), and that a join or aggregation operator share common keys with some 
of its parents. 
                
> optimize orderby followed by a groupby
> --------------------------------------
>
>                 Key: HIVE-2340
>                 URL: https://issues.apache.org/jira/browse/HIVE-2340
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Query Processor
>            Reporter: Navis
>            Assignee: Navis
>            Priority: Minor
>              Labels: perfomance
>         Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.1.patch, 
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.2.patch, 
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.3.patch, 
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.4.patch, 
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.5.patch, HIVE-2340.1.patch.txt
>
>
> Before implementing optimizer for JOIN-GBY, try to implement RS-GBY 
> optimizer(cluster-by following group-by).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to