Ye Li created SPARK-37749: ----------------------------- Summary: Built-in ORC reader cannot read data file in sub-directories created by Hive Tez Key: SPARK-37749 URL: https://issues.apache.org/jira/browse/SPARK-37749 Project: Spark Issue Type: Bug Components: Input/Output, SQL Affects Versions: 3.2.0, 3.1.2, 3.0.3 Environment: HDP 3.1.4 Reporter: Ye Li
A Partitioned Hive Table is created and load data in HDP 3.1.4. The Hive engine is Tez, and the storage format is ORC. The data direcotry is like: table1/statt_dt=2021-12-08/-ext-10000/000000_0 The result of SparkSQL which is "select * from table1" does not include the data of partition 2021-12-08. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org