Lu Li created HIVE-20225:
----------------------------
Summary: SerDe to support Teradata Binary Format
Key: HIVE-20225
URL: https://issues.apache.org/jira/browse/HIVE-20225
Project: Hive
Issue Type: New Feature
Components: Serializers/Deserializers
Reporter: Lu Li
Assignee: Lu Li
When using TPT/BTEQ to export Data from Teradata, Teradata will export binary
files based on the schema.
A Customized SerDe is needed in order to directly read these files from Hive.
{code:java}
CREATE EXTERNAL TABLE `TABLE1`(
...)
PARTITIONED BY (
...)
ROW FORMAT SERDE
'org.apache.hadoop.hive.contrib.serde2.TeradataBinarySerde'
STORED AS INPUTFORMAT
'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileInputFormat'
OUTPUTFORMAT
'org.apache.hadoop.hive.contrib.fileformat.teradata.TeradataBinaryFileOutputFormat'
LOCATION ...;
SELECT * FROM `TABLE1`;{code}
Problem Statement:
Right now the fast way to export data from Teradata is using TPT. However, the
Hive could not directly utilize these exported binary format because it doesn't
have a SerDe for these files.
Result:
Provided with the SerDe, Hive can operate upon the exported Teradata Binary
Format file transparently.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)