Joey Echeverria created FLUME-1239:
--------------------------------------

             Summary: Flume uses codec file extensions for compressed sequence 
files
                 Key: FLUME-1239
                 URL: https://issues.apache.org/jira/browse/FLUME-1239
             Project: Flume
          Issue Type: Bug
    Affects Versions: v1.1.0
            Reporter: Joey Echeverria


The convention in hadoop is for a file to end in a codec's default extension if 
and only if it's a plain file that's fully compressed. When you configure the 
HDFSSink to output compressed sequence files, you end up with files that have 
that codec's default extension. This causes problems when using some Hadoop 
utilities (e.g. hadoop fs -text) as they assume that the file is not in a 
container format. It would be nice if all sequence files ended in a common 
extension (e.g. .seq).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to