Nishant Bangarwa created HIVE-19762:
---------------------------------------
Summary: Druid Queries containing Joins gives wrong results.
Key: HIVE-19762
URL: https://issues.apache.org/jira/browse/HIVE-19762
Project: Hive
Issue Type: Bug
Reporter: Nishant Bangarwa
Assignee: Nishant Bangarwa
Druid queries that have joins against self table gives wrong results.
e.g.
{code}
SELECT
username AS `username`,
SUM(double1) AS `sum_double1`
FROM
druid_table_with_nulls `tbl1`
JOIN (
SELECT
username AS `username`,
SUM(double1) AS `sum_double2`
FROM druid_table_with_nulls
GROUP BY `username`
ORDER BY `sum_double2`
DESC LIMIT 10
)
`tbl2`
ON (`tbl1`.`username` = `tbl2`.`username`)
GROUP BY `tbl1`.`username`;
{code}
In this case one of the queries is a druid scan query and other is groupBy
query.
During planning, the properties of these queries are set to the tableDesc and
serdeInfo, while setting the map work, we overwrite the properties from the
properties present in serdeInfo, this causes the scan query results to be
deserialized using wrong column names and results in Null values.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)