[
https://issues.apache.org/jira/browse/PIG-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13487186#comment-13487186
]
Jonathan Coveney commented on PIG-3017:
---------------------------------------
Well, I don't know the absolute size because I had a script where the JobConf
was failing out at about 6.5MB...I'm not sure if it fails as soon as it crosses
the thresh-hold, or if it fails after serializing everything. That said, after
this patch, the same JobConf was 600KB, so about 10x (note that I also changed
it to use Base64 encoding). Also, as far as serialization time, it's still in
the realm of ~5MB, so compression time is negligible. I did not do extensive
testing around the specifics, though.
> Pig's object serialization should use compression
> -------------------------------------------------
>
> Key: PIG-3017
> URL: https://issues.apache.org/jira/browse/PIG-3017
> Project: Pig
> Issue Type: Bug
> Reporter: Jonathan Coveney
> Assignee: Jonathan Coveney
> Fix For: 0.12
>
> Attachments: PIG-3017-0.patch
>
>
> We have run into cases of very large JobConf objects, and part of this is the
> fact that serialized objects are quite large. There is no reason not to use
> compression here, and ratios should be quite high.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira