[ 
https://issues.apache.org/jira/browse/SPARK-47493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated SPARK-47493:
-----------------------------------
    Labels: pull-request-available  (was: )

> Disable spark.sql.parquet.inferTimestampNTZ.enabled by default
> --------------------------------------------------------------
>
>                 Key: SPARK-47493
>                 URL: https://issues.apache.org/jira/browse/SPARK-47493
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 4.0.0
>            Reporter: Gengliang Wang
>            Assignee: Gengliang Wang
>            Priority: Major
>              Labels: pull-request-available
>
> Since Spark 4.0, the SQL config `spark.sql.parquet.inferTimestampNTZ.enabled` 
> is turned off by default. Consequently, when reading Parquet files that were 
> not produced by Spark, the Parquet reader will no longer automatically 
> recognize data as the TIMESTAMP_NTZ data type. This change ensures backward 
> compatibility with releases of Spark version 3.2 and earlier. It also aligns 
> the behavior of schema inference for Parquet files with that of other data 
> sources such as CSV, JSON, ORC, and JDBC, enhancing consistency across the 
> data sources. To revert to the previous behavior where TIMESTAMP_NTZ types 
> were inferred, set `spark.sql.parquet.inferTimestampNTZ.enabled` to true.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to