[ 
https://issues.apache.org/jira/browse/HIVE-20109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eugene Koifman updated HIVE-20109:
----------------------------------
    Component/s: Statistics

> get rid of COLUMN_STATS_ACCURATE
> --------------------------------
>
>                 Key: HIVE-20109
>                 URL: https://issues.apache.org/jira/browse/HIVE-20109
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>            Reporter: Sergey Shelukhin
>            Priority: Major
>
> I don't know why anyone would come up with an idea of storing a set of 
> booleans in a database using JSON. This has caused various problems in the 
> past (text field limitations, perf issues when parsing a giant string; also 
> bugs because the way it is set is brittle).
> However, now that we are implementing transactional stats, it becomes 
> especially problematic and error prone because the code in Hive sets C_S_A in 
> random places with reckless abandon, whereas we want to change the state of 
> the stats in well defined places where txn semantics can be verified.
> Currently in HIVE-19416, we are handling random things that touch it (from 
> metastore itself to output committers, various stats tasks, commands like 
> truncate, etc.) via a pile of hacks, but the best solution would be to remove 
> it completely and replace with a DB table/columns in stats tables that would 
> need to be set explicitly, not via generic alter_table.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to