[ 
https://issues.apache.org/jira/browse/HIVE-8950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14238177#comment-14238177
 ] 

Ashish Kumar Singh commented on HIVE-8950:
------------------------------------------

[~brocknoland], [~rdblue], need suggestion on dealing with test failures.

The patch now addresses all the review comments and adds required 
functionality. However, I am fighting with qtest failures to avoid changing 
existing .q.out files. Below are my findings. Kindly suggest what you guys 
think should be done here.

Possible ways to pass down table location to SerDe.
1. Add table path to table.sd.location.
This led to 38 failures. HbaseStorageHandler checks if table.sd.location is 
null or not. If not, throws an MetaException.

2. Add table path to table.sd.serdeinfo.location
158 failures, as serdeinfo has location set. Bunch of qfiles have serdeinfo 
printed.

3. Add table path to table.properties
56 failures, as table properties has location set. Bunch of qfiles have table 
properties printed.

Possible solutions I can think of.
a. Modify .q.out's for failing tests, Will only add "#### A masked pattern was 
here ####" at a few places in existing .q.out.

b. Guard setting of path in (1) by checking for SerDe class name. Will require 
importing ParquetHiveSerDe to a SerDe agnostic class, Hive.

c. Add method, needTablePathInInitialize(), to SerDe. Hive class can then get 
this method using reflection and use it to determine if table path needs to be 
added to table or not. This will solve the problem in (b) of importing specific 
SerDe's in Hive class. However, reflections are costly.

> Add support in ParquetHiveSerde to create table schema from a parquet file
> --------------------------------------------------------------------------
>
>                 Key: HIVE-8950
>                 URL: https://issues.apache.org/jira/browse/HIVE-8950
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Ashish Kumar Singh
>            Assignee: Ashish Kumar Singh
>         Attachments: HIVE-8950.1.patch, HIVE-8950.2.patch, HIVE-8950.3.patch, 
> HIVE-8950.patch
>
>
> PARQUET-76 and PARQUET-47 ask for creating parquet backed tables without 
> having to specify the column names and types. As, parquet files store schema 
> in their footer, it is possible to generate hive schema from parquet file's 
> metadata. This will improve usability of parquet backed tables.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to