[jira] [Commented] (HIVE-6222) Make Vector Group By operator abandon grouping if too many distinct keys

Jitendra Nath Pandey (JIRA) Sat, 22 Mar 2014 09:36:07 -0700

    [ 
https://issues.apache.org/jira/browse/HIVE-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944142#comment-13944142
 ]


Jitendra Nath Pandey commented on HIVE-6222:
--------------------------------------------

[~rhbutani] For too many distinct keys, map side grouping can become too slow 
in vectorized mode under memory pressure. This affects hive-0.13 as well, 
therefore we should port it to branch-0.13 as well.

> Make Vector Group By operator abandon grouping if too many distinct keys
> ------------------------------------------------------------------------
>
>                 Key: HIVE-6222
>                 URL: https://issues.apache.org/jira/browse/HIVE-6222
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Query Processor
>    Affects Versions: 0.13.0
>            Reporter: Remus Rusanu
>            Assignee: Remus Rusanu
>            Priority: Minor
>              Labels: vectorization
>             Fix For: 0.14.0
>
>         Attachments: HIVE-6222.1.patch, HIVE-6222.2.patch, HIVE-6222.3.patch, 
> HIVE-6222.4.patch, HIVE-6222.5.patch
>
>
> Row mode GBY is becoming a pass-through if not enough aggregation occurs on 
> the map side, relying on the shuffle+reduce side to do the work. Have VGBY do 
> the same.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (HIVE-6222) Make Vector Group By operator abandon grouping if too many distinct keys

Reply via email to