Shohei Okumiya created HIVE-28410: ------------------------------------- Summary: Partition-Aware Optimization for Iceberg or OTF Key: HIVE-28410 URL: https://issues.apache.org/jira/browse/HIVE-28410 Project: Hive Issue Type: New Feature Components: Iceberg integration, StorageHandler Affects Versions: 4.0.0 Reporter: Shohei Okumiya Assignee: Shohei Okumiya
This project aims to allow users using Apache Iceberg or other non-native table formats to take advantage of Hive's advanced optimizations. Apache Hive provides some optimizations depending on the storage layout of Hive native tables. Bucket Map Join, Sort Merge Bucket Join, or GroupByOptimizer are some of them. Those optimizations are not open to non-native tables because they rely on a piece of hardcoded logic. For example, hashing algorithms are implemented outside StorageHandlers, so enabling Bucket Map Join on Iceberg tables with Bucket Transforms is unrealistic. We have some discussions in HIVE-27734; [this|https://docs.google.com/document/d/1srEK3atO2T3Apa-FsF6bW__ECY-nFrev_1RZ8EN4UF8/edit#heading=h.jzie7kdemx93] is the first design doc. -- This message was sent by Atlassian Jira (v8.20.10#820010)