[
https://issues.apache.org/jira/browse/HDFS-2115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108020#comment-13108020
]
Michael Schmitz commented on HDFS-2115:
---------------------------------------
An easier feature might be to automatically set up the proper codec when
reading the file extension as input to a job. Also, when using streaming with
compression you get the offset as the key, but not when you use an uncompressed
TSV. It would be nice if this behavior were uniform.
> Transparent compression in HDFS
> -------------------------------
>
> Key: HDFS-2115
> URL: https://issues.apache.org/jira/browse/HDFS-2115
> Project: Hadoop HDFS
> Issue Type: New Feature
> Components: data-node, hdfs client
> Reporter: Todd Lipcon
>
> In practice, we find that a lot of users store text data in HDFS without
> using any compression codec. Improving usability of compressible formats like
> Avro/RCFile helps with this, but we could also help many users by providing
> an option to transparently compress data as it is stored.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira