[
https://issues.apache.org/jira/browse/HIVE-29525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18073782#comment-18073782
]
Thomas Rebele commented on HIVE-29525:
--------------------------------------
In HIVE-29432 I also encountered the issue that partitioned Iceberg tables do
not provide info on the partitioning. At that time I just excluded Iceberg
tables. In case HIVE-29525 gets implemented, it might be possible to improve
HIVE-29432 as well.
> Iceberg partitioned columns are not reflected by RelOptHiveTable
> ----------------------------------------------------------------
>
> Key: HIVE-29525
> URL: https://issues.apache.org/jira/browse/HIVE-29525
> Project: Hive
> Issue Type: Bug
> Components: CBO
> Reporter: Stamatis Zampetakis
> Priority: Major
>
> Iceberg partitioned columns are not reflected by RelOptHiveTable and the
> respective data-structures (hivePartitionCols and hivePartitionColsMap) are
> always empty. RelOptHiveTable is used in various places during the CBO
> planning phase so any logic that depends on partitioning information is not
> kicking in and all Iceberg tables are treated as unpartitioned.
> Depending on the context where the partitioning information is used different
> issues may arise. For instance, selectivity estimation is computed
> differently if the predicate is over a partitioning column (see
> [FilterSelectivityEstimator|https://github.com/apache/hive/blob/931d4bb62b26de699240c816df439e00644e3dcb/ql/src/java/org/apache/hadoop/hive/ql/optimizer/calcite/stats/FilterSelectivityEstimator.java#L105])
> so the absence of partitioning information can have a direct impact on
> cardinality (row count) estimations and affect join order.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)