[ https://issues.apache.org/jira/browse/PIG-2319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13190849#comment-13190849 ]
Dmitriy V. Ryaboy commented on PIG-2319: ---------------------------------------- Ok, so it looks like we'll automatically do the right thing for storage if we have output.compression.enabled and output.compression.codec set. We don't do the same for reading. PIG-2143 had a sketch of how to make the whole thing a more flexible implementation (see the first comment). Should be straightforward enough to allow specifying "-compression=$foo" and have the codec for $foo looked up dynamically. > Pig should support snappy as a value for pig.tmpfilecompression.codec > --------------------------------------------------------------------- > > Key: PIG-2319 > URL: https://issues.apache.org/jira/browse/PIG-2319 > Project: Pig > Issue Type: Improvement > Affects Versions: 0.8.1, 0.9.1 > Reporter: Joe Crobak > > Utils.tmpFileCompressionCodec() hard-codes support for only "gz" and "lzo" > compression. Since support for snappy was added in HADOOP-7206, it would be > nice to allow this codec as well. > A future-proof solution to this problem might let the user provide a full > classname (like in the hadoop settings) or the short-hand, in case the > short-hand doesn't exist for a given codec. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira