[ 
https://issues.apache.org/jira/browse/HDFS-2115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13108020#comment-13108020
 ] 

Michael Schmitz commented on HDFS-2115:
---------------------------------------

An easier feature might be to automatically set up the proper codec when 
reading the file extension as input to a job.  Also, when using streaming with 
compression you get the offset as the key, but not when you use an uncompressed 
TSV.  It would be nice if this behavior were uniform.

> Transparent compression in HDFS
> -------------------------------
>
>                 Key: HDFS-2115
>                 URL: https://issues.apache.org/jira/browse/HDFS-2115
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>          Components: data-node, hdfs client
>            Reporter: Todd Lipcon
>
> In practice, we find that a lot of users store text data in HDFS without 
> using any compression codec. Improving usability of compressible formats like 
> Avro/RCFile helps with this, but we could also help many users by providing 
> an option to transparently compress data as it is stored.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to