László Pintér created HIVE-24928:
------------------------------------

             Summary: In case of non-native tables use basic statistics from 
HiveStorageHandler
                 Key: HIVE-24928
                 URL: https://issues.apache.org/jira/browse/HIVE-24928
             Project: Hive
          Issue Type: Bug
          Components: Hive
    Affects Versions: 4.0.0
            Reporter: László Pintér
            Assignee: László Pintér
             Fix For: 4.0.0


When we are running `ANALYZE TABLE ... COMPUTE STATISTICS` or `ANALYZE TABLE 
... COMPUTE STATISTICS FOR COLUMNS` all the basic statistics are collected by 
the BasicStatsTask class. This class tries to estimate the statistics by 
scanning the directory of the table. 

In the case of non-native tables (iceberg, hbase), the table directory might 
contain metadata files as well, which would be counted by the BasicStatsTask 
when calculating basic stats. 

Instead of having this logic, the HiveStorageHandler implementation should 
provide basic statistics.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to