[
https://issues.apache.org/jira/browse/HIVE-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
akshay updated HIVE-7542:
-------------------------
Description:
We plan to use RCFiles to create a data store as it can help store data in
compressed format and the columnar format enables better querying for selective
columns.
Problem: When we import data from text files (comma/tab delimited) into tables
with RCFile storage format, we get an error as stated below:
"Failed with exception Wrong file format. Please check the file's format.
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.MoveTask"
Workaround:
I know we can create an intermediate table.
Load data from text file to that table.
Then use insert into table rc_table select * from temp_text_file_table
But, we do not want to create intermediate tables as we have thousands of TB of
data.
Summary: Cannot import text data to Hive tables with RCFile storage (DO
NOT want to use intermediate tables) (was: Cannot import text data to Hive
tables with RCFile storage)
> Cannot import text data to Hive tables with RCFile storage (DO NOT want to
> use intermediate tables)
> ---------------------------------------------------------------------------------------------------
>
> Key: HIVE-7542
> URL: https://issues.apache.org/jira/browse/HIVE-7542
> Project: Hive
> Issue Type: Bug
> Components: Compression, File Formats, HiveServer2
> Reporter: akshay
> Priority: Critical
>
> We plan to use RCFiles to create a data store as it can help store data in
> compressed format and the columnar format enables better querying for
> selective columns.
> Problem: When we import data from text files (comma/tab delimited) into
> tables with RCFile storage format, we get an error as stated below:
> "Failed with exception Wrong file format. Please check the file's format.
> FAILED: Execution Error, return code 1 from
> org.apache.hadoop.hive.ql.exec.MoveTask"
> Workaround:
> I know we can create an intermediate table.
> Load data from text file to that table.
> Then use insert into table rc_table select * from temp_text_file_table
> But, we do not want to create intermediate tables as we have thousands of TB
> of data.
--
This message was sent by Atlassian JIRA
(v6.2#6252)