[
https://issues.apache.org/jira/browse/HIVE-1802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Siying Dong updated HIVE-1802:
------------------------------
Attachment: HIVE-1802.2.patch
Refactored PlanUtils a little bit. I didn't come up a straight forward way to
refactor to be a factory and make it clear. I tried to break up PlanUtils to
several classes, by those return TableDesc and ReduceSyncDesc, as well as
others. Hope it can be better maintained.
No function change from previous patch.
> Encode MapReduce Shuffling Keys Differently for Single string/bigint Key
> -------------------------------------------------------------------------
>
> Key: HIVE-1802
> URL: https://issues.apache.org/jira/browse/HIVE-1802
> Project: Hive
> Issue Type: Improvement
> Reporter: Siying Dong
> Assignee: Siying Dong
> Attachments: HIVE-1802.1.patch, HIVE-1802.2.patch
>
>
> Delimiters are not needed if we only have one shuffling key, and in the same
> time escaping delimiters are not needed. We can save some CPU time on
> serializing and shuffle slightly less amount of data to save memory footprint
> and network traffic.
> Also there is a bug that for group-by, we by mistake add a -1 to the end of
> the key and pay one more unnecessary mem-copy. Can be easily fixed.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.