[ https://issues.apache.org/jira/browse/HIVE-5157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yin Huai updated HIVE-5157: --------------------------- Description: If hive.groupby.skewindata=true, we should generate two MR jobs. But, ReduceSinkDeDuplication will merge these two into a single MR job. Example: groupby2_map_skew.q and groupby2.q (was: If hive.groupby.skewindata=true, we should generate two MR jobs. But, ReduceSinkDeDuplication will merge these two into a single MR job. Example: groupby2_map_skew.q) > ReduceSinkDeDuplication ignores hive.groupby.skewindata=true > ------------------------------------------------------------- > > Key: HIVE-5157 > URL: https://issues.apache.org/jira/browse/HIVE-5157 > Project: Hive > Issue Type: Bug > Reporter: Yin Huai > Assignee: Yin Huai > > If hive.groupby.skewindata=true, we should generate two MR jobs. But, > ReduceSinkDeDuplication will merge these two into a single MR job. Example: > groupby2_map_skew.q and groupby2.q -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira