[ 
https://issues.apache.org/jira/browse/PIG-3017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13487186#comment-13487186
 ] 

Jonathan Coveney commented on PIG-3017:
---------------------------------------

Well, I don't know the absolute size because I had a script where the JobConf 
was failing out at about 6.5MB...I'm not sure if it fails as soon as it crosses 
the thresh-hold, or if it fails after serializing everything. That said, after 
this patch, the same JobConf was 600KB, so about 10x (note that I also changed 
it to use Base64 encoding). Also, as far as serialization time, it's still in 
the realm  of ~5MB, so compression time is negligible. I did not do extensive 
testing around the specifics, though. 
                
> Pig's object serialization should use compression
> -------------------------------------------------
>
>                 Key: PIG-3017
>                 URL: https://issues.apache.org/jira/browse/PIG-3017
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Jonathan Coveney
>            Assignee: Jonathan Coveney
>             Fix For: 0.12
>
>         Attachments: PIG-3017-0.patch
>
>
> We have run into cases of very large JobConf objects, and part of this is the 
> fact that serialized objects are quite large. There is no reason not to use 
> compression here, and ratios should be quite high.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to