[ https://issues.apache.org/jira/browse/HIVE-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13944142#comment-13944142 ]
Jitendra Nath Pandey commented on HIVE-6222: -------------------------------------------- [~rhbutani] For too many distinct keys, map side grouping can become too slow in vectorized mode under memory pressure. This affects hive-0.13 as well, therefore we should port it to branch-0.13 as well. > Make Vector Group By operator abandon grouping if too many distinct keys > ------------------------------------------------------------------------ > > Key: HIVE-6222 > URL: https://issues.apache.org/jira/browse/HIVE-6222 > Project: Hive > Issue Type: Sub-task > Components: Query Processor > Affects Versions: 0.13.0 > Reporter: Remus Rusanu > Assignee: Remus Rusanu > Priority: Minor > Labels: vectorization > Fix For: 0.14.0 > > Attachments: HIVE-6222.1.patch, HIVE-6222.2.patch, HIVE-6222.3.patch, > HIVE-6222.4.patch, HIVE-6222.5.patch > > > Row mode GBY is becoming a pass-through if not enough aggregation occurs on > the map side, relying on the shuffle+reduce side to do the work. Have VGBY do > the same. -- This message was sent by Atlassian JIRA (v6.2#6252)