Harsh, your comment just saved me from several wasteful hours of aimless
labor.
I added LzoCodec in core-site.xml. But I forgot to add LzopCodec.
Now it works all good. Thanks for the reply!!!
Regards,
Ed
2012/1/2 Harsh J
> Hello Edward,
>
> On Mon, Jan 2, 2012 at 11:04 AM, edward choi wrote:
Hello Edward,
On Mon, Jan 2, 2012 at 11:04 AM, edward choi wrote:
> Hi,
>
> I'm having trouble trying to handle lzo compressed files.
> The input files are compressed by LzopCodec provided by hadoop-lzo package.
> And I am using Cloudera 3 update 2 version Hadoop.
>
> I don't need to split the in
Hi,
The first solution is my final plan. There are so many lzo files, that
manual decompression would take quite a while
As you suggested, I have used LzoTextInputFormat but I get the following
error
2012-01-02 16:15:16,668 INFO org.apache.hadoop.util.NativeCodeLoader:
Loaded the native-hadoop l
You could decompress the LZO file manually into plain text then
using TextInputFormat.
Alternatively, you don't need to index the LZO compressed file,
just using LZOInputFormat on non-indexed files, then the LZO
file will not be split anymore.
Hi,
I'm having trouble trying to handle lzo compressed files.
The input files are compressed by LzopCodec provided by hadoop-lzo package.
And I am using Cloudera 3 update 2 version Hadoop.
I don't need to split the input file, so there is no need telling me to
index the input file and to use LzoT