[ https://issues.apache.org/jira/browse/HIVE-17921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Eugene Koifman updated HIVE-17921: ---------------------------------- Description: Consider {noformat} select ROW__ID, count(*) from over10k_orc_bucketed group by ROW__ID having count(*) > 1; {noformat} in acid_vectorization_original.q (available since HIVE-17458) when run using TestMiniLlapCliDriver produces "NULL, N" where N varies from run to run. The right answer is empty results set as can be seen by running {noformat} select ROW__ID, * from over10k_orc_bucketed where ROW__ID is null {noformat} in the same test. This is with {noformat} set hive.vectorized.execution.enabled=true; set hive.vectorized.row.identifier.enabled=true; {noformat} It fails with TestMiniLlapCliDriver but not TestMiniTezCliDriver. See acid_vectorization_original_tez.q which has identical query. was: Consider {noformat} select ROW__ID, count(*) from over10k_orc_bucketed group by ROW__ID having count(*) > 1; {noformat} in acid_vectorization_original.q (available since HIVE-17458) when run using TestMiniLlapCliDriver produces "NULL, N" where N varies from run to run. The right answer is empty results set as can be seen by running {noformat} select ROW__ID, * from over10k_orc_bucketed where ROW__ID is null {noformat} in the same test. > Aggregation with struct in LLAP produces wrong result > ----------------------------------------------------- > > Key: HIVE-17921 > URL: https://issues.apache.org/jira/browse/HIVE-17921 > Project: Hive > Issue Type: Sub-task > Components: llap, Transactions > Affects Versions: 3.0.0 > Reporter: Eugene Koifman > > Consider > {noformat} > select ROW__ID, count(*) from over10k_orc_bucketed group by ROW__ID having > count(*) > 1; > {noformat} > in acid_vectorization_original.q (available since HIVE-17458) > when run using TestMiniLlapCliDriver produces "NULL, N" where N varies from > run to run. > The right answer is empty results set as can be seen by running > {noformat} > select ROW__ID, * from over10k_orc_bucketed where ROW__ID is null > {noformat} > in the same test. > This is with > {noformat} > set hive.vectorized.execution.enabled=true; > set hive.vectorized.row.identifier.enabled=true; > {noformat} > It fails with TestMiniLlapCliDriver but not TestMiniTezCliDriver. See > acid_vectorization_original_tez.q which has identical query. -- This message was sent by Atlassian JIRA (v6.4.14#64029)