Ziya Mukhtarov created SPARK-54220:
--------------------------------------

             Summary: NullType/VOID/UNKNOWN Type Support in Parquet
                 Key: SPARK-54220
                 URL: https://issues.apache.org/jira/browse/SPARK-54220
             Project: Spark
          Issue Type: New Feature
          Components: SQL
    Affects Versions: 4.0.1
            Reporter: Ziya Mukhtarov


Currently, Spark does not support NullType columns in Parquet files. This is 
blocked at the ParquetFileFormat level, which throws an exception if there's an 
attempt to read or write NullType column. Parquet has [UNKNOWN logical type 
annotation|https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#unknown-always-null],
 which can be used for columns whose values are always null. This can be used 
to write NullType columns into Parquet files, and when reading from Parquet 
files that already has this annotation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to