Github user fjh100456 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19218#discussion_r158574986
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetOptions.scala
 ---
    @@ -42,8 +43,15 @@ private[parquet] class ParquetOptions(
        * Acceptable values are defined in 
[[shortParquetCompressionCodecNames]].
        */
       val compressionCodecClassName: String = {
    -    val codecName = parameters.getOrElse("compression",
    -      sqlConf.parquetCompressionCodec).toLowerCase(Locale.ROOT)
    +    // `compression`, `parquet.compression`(i.e., 
ParquetOutputFormat.COMPRESSION), and
    +    // `spark.sql.parquet.compression.codec`
    +    // are in order of precedence from highest to lowest.
    +    val parquetCompressionConf = 
parameters.get(ParquetOutputFormat.COMPRESSION)
    +    val codecName = parameters
    +      .get("compression")
    +      .orElse(parquetCompressionConf)
    --- End diff --
    
    If so, parquet's table-level compression may be overwrited in this PR, and 
it may not be what we want.
    Shall I  fix it first in another PR?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to