[ https://issues.apache.org/jira/browse/HIVE-1802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965021#action_12965021 ]
He Yongqiang commented on HIVE-1802: ------------------------------------ Yes. I am ok with the current approach. (moving forward, we still need to figure out a better way which can be more easy to maintain and extend. Like, we may want to try to separate serdes used for group-by and join. If we do that in the current approach, we need to have 4 serdes for reduce-sink.) > Encode MapReduce Shuffling Keys Differently for Single string/bigint Key > ------------------------------------------------------------------------- > > Key: HIVE-1802 > URL: https://issues.apache.org/jira/browse/HIVE-1802 > Project: Hive > Issue Type: Improvement > Reporter: Siying Dong > Assignee: Siying Dong > Attachments: HIVE-1802.1.patch, HIVE-1802.2.patch > > > Delimiters are not needed if we only have one shuffling key, and in the same > time escaping delimiters are not needed. We can save some CPU time on > serializing and shuffle slightly less amount of data to save memory footprint > and network traffic. > Also there is a bug that for group-by, we by mistake add a -1 to the end of > the key and pay one more unnecessary mem-copy. Can be easily fixed. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.