[ 
https://issues.apache.org/jira/browse/HIVE-20573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16617297#comment-16617297
 ] 

Zoltan Haindrich commented on HIVE-20573:
-----------------------------------------

there was also some fluctuation in the plans (there are 2 spark drivers which 
check agains the same set of q.out-s I think)
I've ended up disabling {{hive.stats.fetch.column.stats}} in the site xml-s: 
{{data/conf/spark/yarn-cluster/hive-site.xml}}
I think before this ticket could be addressed HIVE-18139 should be fixed.

> Spark: incorrect results when column stats are fetched
> ------------------------------------------------------
>
>                 Key: HIVE-20573
>                 URL: https://issues.apache.org/jira/browse/HIVE-20573
>             Project: Hive
>          Issue Type: Bug
>          Components: Spark
>            Reporter: Zoltan Haindrich
>            Priority: Major
>
> there are some result set differences when column stats fetch is enabled 
> w.r.t to llap outputs. Examples:
> {code}
> +++ ql/src/test/results/clientpositive/spark/union_remove_12.q.out
> -       totalSize               194                 
> +       totalSize               192                 
> +       18
> +       18
> +       28
> +       28
> -8      18
> -8      18
> -8      28
> -8      28
> +++ ql/src/test/results/clientpositive/spark/vectorized_nested_mapjoin.q.out
> -6.06519093248863E11
> +5.744447909695194E9
> {code}
> ql/src/test/results/clientpositive/spark/union_remove_12.q.out
> ql/src/test/results/clientpositive/spark/union_remove_13.q.out
> ql/src/test/results/clientpositive/spark/union_remove_14.q.out
> ql/src/test/results/clientpositive/spark/bucket_map_join_tez1.q.out
> ql/src/test/results/clientpositive/spark/join32.q.out
> ql/src/test/results/clientpositive/spark/join33.q.out
> ql/src/test/results/clientpositive/spark/vectorized_nested_mapjoin.q.out



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to