Cheng Su created SPARK-32709:
--------------------------------
Summary: Write Hive ORC/Parquet bucketed table with hivehash (for
Hive 1,2)
Key: SPARK-32709
URL: https://issues.apache.org/jira/browse/SPARK-32709
Project: Spark
Issue Type: Sub-task
Components: SQL
Affects Versions: 3.0.0
Reporter: Cheng Su
Hive ORC/Parquet write code path is same as data source v1 code path
(FileFormatWriter). This JIRA is to add the support to write Hive ORC/Parquet
bucketed table with hivehash. The change is to custom `bucketIdExpression` to
use hivehash when the table is Hive bucketed table, and the Hive version is
1.x.y or 2.x.y.
This will allow us write Hive/Presto-compatible bucketed table for Hive 1 and 2.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]