[jira] [Created] (SPARK-38536) Spark 3 can not read mixed format partitions

Huicheng Song (Jira) Sat, 12 Mar 2022 12:18:07 -0800

Huicheng Song created SPARK-38536:
-------------------------------------

             Summary: Spark 3 can not read mixed format partitions
                 Key: SPARK-38536
                 URL: https://issues.apache.org/jira/browse/SPARK-38536
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.2.1, 3.0.0
            Reporter: Huicheng Song



Spark 3.x reads partitions with table's input format, which fails when the 
partition has a different input format than the table.

This is a regression introduced by SPARK-26630. Before that fix, Spark will use 
Partition InputFormat when creating HadoopRDD. With that fix, Spark uses only 
Table InputFormat when creating HadoopRDD, causing failures

Reading mixed format partitions is an import scenario, especially for format 
migration. It is also well supported in query engines like Hive and Presto.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Created] (SPARK-38536) Spark 3 can not read mixed format partitions

Reply via email to