[ 
https://issues.apache.org/jira/browse/SPARK-45855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Robertson updated SPARK-45855:
----------------------------------
    Description: 
Hi,

We've discovered code that worked in Spark 3.3.0 doesn't in 3.4.0. I can't find 
anything in the release notes to indicate why, so I wonder if this is a bug. 
Thank you for looking.

Here we're using our own custom codec, but we noticed we can't set gzip either.

{{  SparkConf conf = spark.sparkContext().conf();}}
{{  conf.set("hive.exec.compress.output", "true");}}
{{  conf.set("mapred.output.compression.codec", D2Codec.class.getName()); }}
{{  spark.sql("CREATE TABLE b AS SELECT id FROM a");}}

This will create the table, but it writes uncompressed files, where Spark 3.3.0 
would write compressed files. 

Any advice is appreciated and I can help run tests. We run Spark on K8S using 
the stackable.tech distribution.

  was:
Hi,

We've discovered code that worked in Spark 3.3.0 doesn't in 3.4.0. I can't find 
anything in the release notes to indicate why, so I wonder if this is a bug. 
Thank you for looking.

Here we're using our own custom codec, but we noticed we can't set gzip either.



{{  SparkConf conf = spark.sparkContext().conf();}}
{{  conf.set("hive.exec.compress.output", "true");}}
{{  conf.set("mapred.output.compression.codec", D2Codec.class.getName()); }}
{{  spark.sql("CREATE TABLE b AS SELECT id FROM a");}}
 

Any advice is appreciated and I can help run tests. We run Spark on K8S using 
the stackable.tech distribution.


> Unable to set codec for Hive CTAS
> ---------------------------------
>
>                 Key: SPARK-45855
>                 URL: https://issues.apache.org/jira/browse/SPARK-45855
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 3.4.0
>         Environment: Spark 3.4.0 
> Stackable.tech release 23.7.0 which runs spark on K8s.
>            Reporter: Tim Robertson
>            Priority: Major
>
> Hi,
> We've discovered code that worked in Spark 3.3.0 doesn't in 3.4.0. I can't 
> find anything in the release notes to indicate why, so I wonder if this is a 
> bug. Thank you for looking.
> Here we're using our own custom codec, but we noticed we can't set gzip 
> either.
> {{  SparkConf conf = spark.sparkContext().conf();}}
> {{  conf.set("hive.exec.compress.output", "true");}}
> {{  conf.set("mapred.output.compression.codec", D2Codec.class.getName()); }}
> {{  spark.sql("CREATE TABLE b AS SELECT id FROM a");}}
> This will create the table, but it writes uncompressed files, where Spark 
> 3.3.0 would write compressed files. 
> Any advice is appreciated and I can help run tests. We run Spark on K8S using 
> the stackable.tech distribution.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to