Shohei Okumiya created HIVE-28410:
-------------------------------------

             Summary: Partition-Aware Optimization for Iceberg or OTF
                 Key: HIVE-28410
                 URL: https://issues.apache.org/jira/browse/HIVE-28410
             Project: Hive
          Issue Type: New Feature
          Components: Iceberg integration, StorageHandler
    Affects Versions: 4.0.0
            Reporter: Shohei Okumiya
            Assignee: Shohei Okumiya


This project aims to allow users using Apache Iceberg or other non-native table 
formats to take advantage of Hive's advanced optimizations.

 

Apache Hive provides some optimizations depending on the storage layout of Hive 
native tables. Bucket Map Join, Sort Merge Bucket Join, or GroupByOptimizer are 
some of them. Those optimizations are not open to non-native tables because 
they rely on a piece of hardcoded logic. For example, hashing algorithms are 
implemented outside StorageHandlers, so enabling Bucket Map Join on Iceberg 
tables with Bucket Transforms is unrealistic.

 

We have some discussions in HIVE-27734; 
[this|https://docs.google.com/document/d/1srEK3atO2T3Apa-FsF6bW__ECY-nFrev_1RZ8EN4UF8/edit#heading=h.jzie7kdemx93]
 is the first design doc.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to