BELUGA BEHR created HADOOP-14668:
------------------------------------

             Summary: Remove Configurable Default Sequence File Compression Type
                 Key: HADOOP-14668
                 URL: https://issues.apache.org/jira/browse/HADOOP-14668
             Project: Hadoop Common
          Issue Type: Improvement
          Components: io
    Affects Versions: 3.0.0-alpha3
            Reporter: BELUGA BEHR
            Priority: Trivial
             Fix For: 2.8.1


It is confusing to have two different ways to set the Sequence File compression 
type.

In a basic configuration, I can set 
_mapreduce.output.fileoutputformat.compress.type_ or 
_io.seqfile.compression.type_.  If I would like to set a default value, I 
should set it by setting the cluster environment's mapred-site.xml file setting 
for _mapreduce.output.fileoutputformat.compress.type_.

Please remove references to this magic string _io.seqfile.compression.type_, 
remove the {{setDefaultCompressionType}} method, and have 
{{getDefaultCompressionType}} return value hard-coded to 
{{CompressionType.RECORD}}.  This will make administration easier as I have to 
only interrogate one configuration.

{code:title=org.apache.hadoop.io.SequenceFile}
  /**
   * Get the compression type for the reduce outputs
   * @param job the job config to look in
   * @return the kind of compression to use
   */
  static public CompressionType getDefaultCompressionType(Configuration job) {
    String name = job.get("io.seqfile.compression.type");
    return name == null ? CompressionType.RECORD : 
      CompressionType.valueOf(name);
  }
  
  /**
   * Set the default compression type for sequence files.
   * @param job the configuration to modify
   * @param val the new compression type (none, block, record)
   */
  static public void setDefaultCompressionType(Configuration job, 
                                               CompressionType val) {
    job.set("io.seqfile.compression.type", val.toString());
  }
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to