Kokila N created HIVE-29458:
-------------------------------
Summary: Z-order and custom sort expressions in WRITE LOCALLY
ORDERED BY cause data scatter across reducers due to hash distribution
Key: HIVE-29458
URL: https://issues.apache.org/jira/browse/HIVE-29458
Project: Hive
Issue Type: Bug
Reporter: Kokila N
Assignee: Kokila N
*Problem*
When using
{code:java}
`WRITE LOCALLY ORDERED BY zorder()`{code}
or other custom sort expressions on Iceberg tables, the sort expressions are
added to the ReduceSink partition columns (distribution keys). This causes data
to be scattered across reducers based on `hash(partition_col, sort_expr) %
num_reducers`.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)