chandra sekhar gunturi created HIVE-4564:
--------------------------------------------
Summary: Distinct along with order by is not working when table
name is part of column name in order by clause
Key: HIVE-4564
URL: https://issues.apache.org/jira/browse/HIVE-4564
Project: Hive
Issue Type: Bug
Components: Query Processor
Affects Versions: 0.9.0
Reporter: chandra sekhar gunturi
Fix For: 0.9.1
I have following table named 'region'.
hive> desc region;
r_regionkey int
r_name string
r_comment string
When we use <distinct, orderby> clause combination in table_name.column_name
format, the query throws SemanticException.
For example, the following query throws error.
hive> select distinct region.r_name from region order by region.r_name;
FAILED: SemanticException [Error 10004]: Line 1:51 Invalid table alias or
column reference 'region': (possible column names are: _col0)
The same query works fine if the same query is used without table name in order
by clause.
The following query works fine for region table.
hive> select distinct region.r_name from region order by r_name;
This is a common scenario in actual real world scenarios.
For example, I want to find out what are all the cities my employees are from.
>> SELECT DISTINCT CITY.NAME FROM EMPLOYEE, CITY WHERE EMPLOYEE.CID=CITY.CID
>> ORDER BY CITY.NAME
Here we are forced to use CITY.NAME as it may conflict with EMPLOYEE.NAME.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira