[
https://issues.apache.org/jira/browse/HIVE-29203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18020604#comment-18020604
]
Zhihua Deng commented on HIVE-29203:
------------------------------------
If we remove the batch processing from
[https://github.com/apache/hive/blob/4bb08099d91acbefee73a449a36abb1ecd2b5925/standalone-metastore/metastore-server/src/main/java/org/apache/hadoop/hive/metastore/MetaStoreDirectSql.java#L1882-L1891,]
perhaps we need to consider the memory usage when enableBitVector ||
enableKll, and theĀ
restrictions on IN list size in aggrStatsUseDB.
> get_aggr_stats_for doesn't aggregate stats when direct sql batch retrieve is
> enabled
> ------------------------------------------------------------------------------------
>
> Key: HIVE-29203
> URL: https://issues.apache.org/jira/browse/HIVE-29203
> Project: Hive
> Issue Type: Bug
> Components: Standalone Metastore
> Reporter: Zhihua Deng
> Priority: Major
>
> In case of metastore.direct.sql.batch.size > 0, and number of partition names
> or columns in get_aggr_stats_for is bigger than the
> metastore.direct.sql.batch.size, then the
> AggrStats from the call get_aggr_stats_for might have un-merged stats for the
> same column, so the aggregated stats is not correct, which may make CBO
> generate an outdated execution plan.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)