Børge Svingen created HIVE-5237:
-----------------------------------

             Summary: Incorrect group-by aggregation in 0.11.0
                 Key: HIVE-5237
                 URL: https://issues.apache.org/jira/browse/HIVE-5237
             Project: Hive
          Issue Type: Bug
    Affects Versions: 0.11.0
            Reporter: Børge Svingen
            Priority: Critical



group by with sub queries does not correctly aggregate results in Hive 0.11.0.

To reproduce:

Put the file

{code}
1,b
2,c
2,b
3,a
3,c
4,a
{code}

in HDFS, and run

{code}
create external table abc (x int, y string) row format delimited fields 
terminated by ',' location '/data/';
{code}

The query

{code}
select
        x,
        count(*)
from
(select
        x,
        y
from
        abc
group by
      x,
      y
) a
group by
        x;
{code}

will then give the result

{code}
2       1
3       1
2       1
4       1
3       1
1       1
{code}

instead of the correct

{code}
1       1
2       2
3       2
4       1
{code}

In 0.9.0 and 0.10.0 this is all working correctly.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to