[ 
https://issues.apache.org/jira/browse/HIVE-27005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17955390#comment-17955390
 ] 

Denys Kuzmenko commented on HIVE-27005:
---------------------------------------

hi [~simhadri-g], that should be addressed, right? We provide incremental col 
stats support for inserts.

> Iceberg: Col stats are not used in queries
> ------------------------------------------
>
>                 Key: HIVE-27005
>                 URL: https://issues.apache.org/jira/browse/HIVE-27005
>             Project: Hive
>          Issue Type: Improvement
>          Components: Iceberg integration
>            Reporter: Rajesh Balamohan
>            Assignee: Simhadri Govindappa
>            Priority: Major
>              Labels: iceberg, performance
>         Attachments: col_stats.txt
>
>
> 1. Though, insert-queries compute colstats during runtime, they are not 
> persisted in HMS during final call. 
> 2. Due to #1, col stats are not available during runtime for hive queries. 
> This includes col stats, NDV etc. So unless users explicitly run "analyse 
> table" statements, queries can be have suboptimal plans.
> E.g 
> [col_stats.txt{^}!https://jira.cloudera.com/images/icons/link_attachment_7.gif|width=7,height=7!{^}|https://jira.cloudera.com/secure/attachment/658390/658390_col_stats.txt](note
>  that there is no col stats being used)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to