[ https://issues.apache.org/jira/browse/HIVE-24515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dmitriy Fingerman reassigned HIVE-24515: ---------------------------------------- Assignee: Dmitriy Fingerman (was: mahesh kumar behera) > Analyze table job can be skipped when stats populated are already accurate > -------------------------------------------------------------------------- > > Key: HIVE-24515 > URL: https://issues.apache.org/jira/browse/HIVE-24515 > Project: Hive > Issue Type: Improvement > Reporter: Rajesh Balamohan > Assignee: Dmitriy Fingerman > Priority: Major > Labels: pull-request-available > Time Spent: 3h 40m > Remaining Estimate: 0h > > For non-partitioned tables, stats detail should be present in table level, > e.g > {noformat} > COLUMN_STATS_ACCURATE={"BASIC_STATS":"true","COLUMN_STATS":{"d_current_day":"true"... > }} > {noformat} > For partitioned tables, stats detail should be present in partition level, > {noformat} > store_sales(ss_sold_date_sk=2451819) > {totalSize=0, numRows=0, rawDataSize=0, > COLUMN_STATS_ACCURATE={"BASIC_STATS":"true","COLUMN_STATS":{"ss_addr_sk":"true"....}} > > {noformat} > When stats populated are already accurate, {{analyze table tn compute > statistics for columns}} should skip launching the job. > > For ACID tables, stats are auto computed and it can skip computing stats > again when stats are accurate. > > -- This message was sent by Atlassian Jira (v8.20.10#820010)