Hadoop does not support opening zip files out of the box see.
https://issues.apache.org/jira/browse/MAPREDUCE-210
Maybe someone out there has written a custom input format.
On Sun, Mar 4, 2012 at 11:03 PM, Lu, Wei wrote:
> So I cannot directly load files end in .zip to hive, right?
> I tried to
So I cannot directly load files end in .zip to hive, right?
I tried to put a .zip file to HDFS and load it to a Hive table, but the data in
the table turn out to be messy
Wei
-Original Message-
From: Edward Capriolo [mailto:edlinuxg...@gmail.com]
Sent: Monday, March 05, 2012 10:54
If the file ends in .bz2 .gz or .deflate there is nothing special you
need to. TextInputFormat (the default) will automatically unzip and
read these. However these types are not split-table so if the file is
large it can not be processed in parallel.
On Sun, Mar 4, 2012 at 9:26 PM, Lu, Wei wrote
Hi,
I need to load data directly from a ctl A delimiter zipped file from the Linux
box directly.
Do I need to 1) un-zip the files and then load them to Hive tables, or 2) is
there a direct command that can load zipped data to Hive table directly?
Thanks,
Wei
The MyJSQLView project has been considering adding support for Hadoop through
the Hive JDBC for sometime now. I'm sure we could put together a few systems
to configure Hadoop and Hive, but was wondering if a temporary development
standalone Hive server might be available out there?
Normally with c