[
https://issues.apache.org/jira/browse/PIG-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12872753#action_12872753
]
Jeff Zhang commented on PIG-1426:
---------------------------------
Alan, but when writing we use the VInt, so it won't affect the reading. One
problem I can think of is the incompatibility with previous Tuple
Implementation using Int. The VInt version Tuple can not read the data written
using Int.
> Change the size of Tuple from Int to VInt when Serialize Tuple
> --------------------------------------------------------------
>
> Key: PIG-1426
> URL: https://issues.apache.org/jira/browse/PIG-1426
> Project: Pig
> Issue Type: Improvement
> Components: data
> Affects Versions: 0.8.0
> Reporter: Jeff Zhang
> Assignee: Jeff Zhang
> Fix For: 0.8.0
>
> Attachments: PIG_1426.patch
>
>
> Most of time, the size of tuple is not very large, one byte is enough for
> store the size of tuple. So I suggest to use VInt instead of Int for the size
> of tuple when doing Serialization. Because the key type of map output is
> Tuple, so this can reduce the amount of data transferred from mapper to
> reducer.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.