[ 
https://issues.apache.org/jira/browse/HIVE-10261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14485439#comment-14485439
 ] 

Mostafa Mokhtar commented on HIVE-10261:
----------------------------------------

[~lirui]

Can you please attach an explain plan along with query and actual number of 
rows for the operator with underestimation?

> Data size can be underestimated when computed with partial column stats
> -----------------------------------------------------------------------
>
>                 Key: HIVE-10261
>                 URL: https://issues.apache.org/jira/browse/HIVE-10261
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Rui Li
>
> With {{hive.stats.fetch.column.stats=true}}, we'll estimate data size with 
> column  stats when annotating operators with statistics. However, when column 
> stats is partial, we're likely to underestimate data size, which may hurt 
> performance, e.g. picking an inappropriate small table for map join.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to