-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/18254/
-----------------------------------------------------------

(Updated Feb. 19, 2014, 9:56 p.m.)


Review request for hive.


Changes
-------

Incorporated review feedback.

Updated more test cases results of explain CTAS.

It seems that the test table srcbucket, as a bucketed (multi-file) table, will 
give random results from select query, so first insert to a staging table using 
sort by.


Bugs: HIVE-6375
    https://issues.apache.org/jira/browse/HIVE-6375


Repository: hive-git


Description
-------

There is a Hive bug in SemanticAnalyzer that chooses different names for 
columns in the CreateTable task and the FileSink task.  
columnInfo.getInternalName() was used in one place, and fieldSchema still used 
columnInfo.getAlias() if it is available.  This change makes both consistent, 
favoring columnInfo.getAlias if it is available.

This is not revealed before because other file-formats like RcFile seem to use 
column-ordinal position, and Avro file stores the schema separately altogether.


Diffs (updated)
-----

  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java 77388dd 
  ql/src/test/queries/clientpositive/parquet_ctas.q PRE-CREATION 
  ql/src/test/results/clientpositive/ctas.q.out 9668855 
  ql/src/test/results/clientpositive/ctas_hadoop20.q.out 0ec0af5 
  ql/src/test/results/clientpositive/merge3.q.out 3df75b7 
  ql/src/test/results/clientpositive/parquet_ctas.q.out PRE-CREATION 

Diff: https://reviews.apache.org/r/18254/diff/


Testing
-------

Added parquet_ctas.q.  Covers cases where column name is gotten directly from 
input table (implied alias), where name is auto-generated, where name is 
specified as alias, and a mix of the three.


Thanks,

Szehon Ho

Reply via email to