Re: How to read LZO compressed files?

2012-01-02 Thread edward choi
Harsh, your comment just saved me from several wasteful hours of aimless labor. I added LzoCodec in core-site.xml. But I forgot to add LzopCodec. Now it works all good. Thanks for the reply!!! Regards, Ed 2012/1/2 Harsh J > Hello Edward, > > On Mon, Jan 2, 2012 at 11:04 AM, edward choi wrote:

Re: How to read LZO compressed files?

2012-01-01 Thread Harsh J
Hello Edward, On Mon, Jan 2, 2012 at 11:04 AM, edward choi wrote: > Hi, > > I'm having trouble trying to handle lzo compressed files. > The input files are compressed by LzopCodec provided by hadoop-lzo package. > And I am using Cloudera 3 update 2 version Hadoop. > > I don't need to split the in

Re: How to read LZO compressed files?

2012-01-01 Thread edward choi
Hi, The first solution is my final plan. There are so many lzo files, that manual decompression would take quite a while As you suggested, I have used LzoTextInputFormat but I get the following error 2012-01-02 16:15:16,668 INFO org.apache.hadoop.util.NativeCodeLoader: Loaded the native-hadoop l

Re: How to read LZO compressed files?

2012-01-01 Thread Shi Yu
You could decompress the LZO file manually into plain text then using TextInputFormat. Alternatively, you don't need to index the LZO compressed file, just using LZOInputFormat on non-indexed files, then the LZO file will not be split anymore.

How to read LZO compressed files?

2012-01-01 Thread edward choi
Hi, I'm having trouble trying to handle lzo compressed files. The input files are compressed by LzopCodec provided by hadoop-lzo package. And I am using Cloudera 3 update 2 version Hadoop. I don't need to split the input file, so there is no need telling me to index the input file and to use LzoT