Rui Li created HIVE-10261:
-----------------------------

             Summary: Data size can be underestimated when computed with 
partial column stats
                 Key: HIVE-10261
                 URL: https://issues.apache.org/jira/browse/HIVE-10261
             Project: Hive
          Issue Type: Bug
            Reporter: Rui Li


With {{hive.stats.fetch.column.stats=true}}, we'll estimate data size with 
column  stats when annotating operators with statistics. However, when column 
stats is partial, we're likely to underestimate data size, which may hurt 
performance, e.g. picking an inappropriate small table for map join.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to