[ 
https://issues.apache.org/jira/browse/HIVE-23959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17477841#comment-17477841
 ] 

Stamatis Zampetakis commented on HIVE-23959:
--------------------------------------------

Before this change a DDL statement updating a column in a partitioned table 
would remove the statistics for the updated column from every partition but 
would leave the stats for other columns intact.

After this change, if the appropriate configuration property is set, updating a 
column removes *all* partition statistics (for all columns of the table).

[~kgyrtkirk]  is my understanding correct or did I miss something?

> Provide an option to wipe out column stats for partitioned tables in case of 
> column removal
> -------------------------------------------------------------------------------------------
>
>                 Key: HIVE-23959
>                 URL: https://issues.apache.org/jira/browse/HIVE-23959
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Zoltan Haindrich
>            Assignee: Zoltan Haindrich
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 4.0.0
>
>          Time Spent: 2h
>  Remaining Estimate: 0h
>
> in case of column removal / replacement - an update for each partition is 
> neccessary; which could take a while.
> goal here is to provide an option to switch to the bulk removal of column 
> statistics instead of working hard to retain as much as possible from the old 
> stats.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to