[ 
https://issues.apache.org/jira/browse/HIVE-15848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15887340#comment-15887340
 ] 

Ashutosh Chauhan commented on HIVE-15848:
-----------------------------------------

It will be ideal to migrate these optimizations to calcite tree by writing an 
calcite rule where implementing these optimizations are more straight forward. 
In the meanwhile its alright to turn off these optimization in this particular 
case. 
+1

> count or sum distinct incorrect when hive.optimize.reducededuplication set to 
> true
> ----------------------------------------------------------------------------------
>
>                 Key: HIVE-15848
>                 URL: https://issues.apache.org/jira/browse/HIVE-15848
>             Project: Hive
>          Issue Type: Bug
>    Affects Versions: 0.13.0
>            Reporter: Biao Wu
>            Assignee: Zoltan Haindrich
>            Priority: Critical
>         Attachments: HIVE-15848.1.patch, HIVE-15848.2.patch
>
>
> Test Table:
> {code:sql}
> create table test(id int,key int,name int);
> {code}
> Data:
> ||id||key||name||
> |1    |1      |2
> |1    |2      |3
> |1    |3      |2
> |1    |4      |2
> |1    |5      |3
> Test SQL1:
> {code:sql}
> select id,count(Distinct key),count(Distinct name)
> from (select id,key,name from count_distinct_test group by id,key,name)m
> group by id;
> {code}
> result:
> |1|5|4
> expect:
> |1|5|2
> Test SQL2:
> {code:sql}
> select id,count(Distinct name),count(Distinct key)
> from (select id,key,name from count_distinct_test group by id,name,key)m
> group by id;
> {code}
> result:
> |1|2|5



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to