Hive has built-in support for gzipped text files so you don't need to do
anything special with them (meaning you just load them like regular files).
However, there is one important limitation with this approach these files
are not splittable. Depending on the size of your log files and how many of
them you have perday that may not be a problem. The other, slightly more
complicated approach is to use sequence files, which support splittable
compression. You can find more details here:
https://cwiki.apache.org/confluence/display/Hive/CompressedStorage



On Mon, Nov 11, 2013 at 5:09 PM, Raj Hadoop <hadoop...@yahoo.com> wrote:

> Hi,
>
> I have a web log files (text format). I want to load these files to a Hive
> table in compressed format. How do I do it ?
>
> Should I compress the text file (using any Linux utilities) and then
> create the Hive table?
>
> Can any one provide me the Hive syntax for loading the compressed file?
>
> Thanks,
> Raj
>

Reply via email to