[ https://issues.apache.org/jira/browse/PIG-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159707#comment-13159707 ]
Dmitriy V. Ryaboy commented on PIG-2359: ---------------------------------------- That's a sexy idea, I like it -- especially since it will let us handle strings in addition to numbers. We'll have to implement the same codegen on the deseralization side, or somehow serialize generated class names.. that could get somewhat ugly. Doable, though. Could also try to serialize the codegenned classes using kryo. I'll finish up this patch and run some timing tests; unless you are ready to work on this right now, let's open a separate ticket for the codegen approach. > Support more efficient Tuples when schemas are known > ---------------------------------------------------- > > Key: PIG-2359 > URL: https://issues.apache.org/jira/browse/PIG-2359 > Project: Pig > Issue Type: New Feature > Reporter: Dmitriy V. Ryaboy > Assignee: Dmitriy V. Ryaboy > Attachments: PIG-2359.1.patch > > > Pig Tuples have significant overhead due to the fact that all the fields are > Objects. > When a Tuple only contains primitive fields (ints, longs, etc), it's possible > to avoid this overhead, which would result in significant memory savings. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira