László Pintér created HIVE-24928: ------------------------------------ Summary: In case of non-native tables use basic statistics from HiveStorageHandler Key: HIVE-24928 URL: https://issues.apache.org/jira/browse/HIVE-24928 Project: Hive Issue Type: Bug Components: Hive Affects Versions: 4.0.0 Reporter: László Pintér Assignee: László Pintér Fix For: 4.0.0
When we are running `ANALYZE TABLE ... COMPUTE STATISTICS` or `ANALYZE TABLE ... COMPUTE STATISTICS FOR COLUMNS` all the basic statistics are collected by the BasicStatsTask class. This class tries to estimate the statistics by scanning the directory of the table. In the case of non-native tables (iceberg, hbase), the table directory might contain metadata files as well, which would be counted by the BasicStatsTask when calculating basic stats. Instead of having this logic, the HiveStorageHandler implementation should provide basic statistics. -- This message was sent by Atlassian Jira (v8.3.4#803005)