[ https://issues.apache.org/jira/browse/PIG-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149421#comment-13149421 ]
Daniel Dai commented on PIG-2359: --------------------------------- Great first step! Some initial observations: 1. We can pass the schema into JobConf and read it back in BinInterSedes.setConf() 2. Support string in PrimitiveTuple is super helpful, is that in the plan? 3. BinInterSedes should take advantage of it to serialize/deserialize the PrimitiveTuple in a more efficient way Worth to mention we need to put fastutil_small.jar(http://www.java2s.com/Code/Jar/f/Downloadfastutilsmalljar.htm) into lib in order to compile. > Support more efficient Tuples when schemas are known > ---------------------------------------------------- > > Key: PIG-2359 > URL: https://issues.apache.org/jira/browse/PIG-2359 > Project: Pig > Issue Type: New Feature > Reporter: Dmitriy V. Ryaboy > Assignee: Dmitriy V. Ryaboy > Attachments: PIG-2359.1.patch > > > Pig Tuples have significant overhead due to the fact that all the fields are > Objects. > When a Tuple only contains primitive fields (ints, longs, etc), it's possible > to avoid this overhead, which would result in significant memory savings. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira