[ 
https://issues.apache.org/jira/browse/PIG-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159707#comment-13159707
 ] 

Dmitriy V. Ryaboy commented on PIG-2359:
----------------------------------------

That's a sexy idea, I like it -- especially since it will let us handle strings 
in addition to numbers.
We'll have to implement the same codegen on the deseralization side, or somehow 
serialize generated class names.. that could get somewhat ugly. Doable, though. 
Could also try to serialize the codegenned classes using kryo.

I'll finish up this patch and run some timing tests; unless you are ready to 
work on this right now, let's open a separate ticket for the codegen approach. 
                
> Support more efficient Tuples when schemas are known
> ----------------------------------------------------
>
>                 Key: PIG-2359
>                 URL: https://issues.apache.org/jira/browse/PIG-2359
>             Project: Pig
>          Issue Type: New Feature
>            Reporter: Dmitriy V. Ryaboy
>            Assignee: Dmitriy V. Ryaboy
>         Attachments: PIG-2359.1.patch
>
>
> Pig Tuples have significant overhead due to the fact that all the fields are 
> Objects.
> When a Tuple only contains primitive fields (ints, longs, etc), it's possible 
> to avoid this overhead, which would result in significant memory savings.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to