The hadoop dfs -cp or -mv seem like the perfect candidate to add an uncompress 
option.
 
-Ayon
See My Photos on Flickr
Also check out my Blog for answers to commonly asked questions.



________________________________
From: Harsh J <ha...@cloudera.com>
To: Ayon Sinha <ayonsi...@yahoo.com>
Cc: "hdfs-user@hadoop.apache.org" <hdfs-user@hadoop.apache.org>
Sent: Friday, June 17, 2011 1:42 AM
Subject: Re: unzip gz file in HDFS ?

Ayon,

We could write a utility for that, but the issue is that there's no
"server-side" for processing files on HDFS alone. The utility will
have to run an MR job either way, to avoid incurring network transfers
to and back from the invocation machine.

Perhaps it could be added to examples, or to a set of general tools MR
provides (not aware of one)?

On Fri, Jun 17, 2011 at 2:07 PM, Ayon Sinha <ayonsi...@yahoo.com> wrote:
> Yup, thought about that. That sounds like then only way. I was hoping
> someone already wrote a hadoop shell command equivalent like:
> hadoop dfs -unzip
>
> -Ayon
> See My Photos on Flickr
> Also check out my Blog for answers to commonly asked questions.
>
> ________________________________
> From: Harsh J <ha...@cloudera.com>
> To: hdfs-user@hadoop.apache.org; Ayon Sinha <ayonsi...@yahoo.com>
> Sent: Friday, June 17, 2011 1:00 AM
> Subject: Re: unzip gz file in HDFS ?
>
> Ayon,
>
> You can run an identity map job with no output compression set to it.
>
> On Fri, Jun 17, 2011 at 12:59 PM, Ayon Sinha <ayonsi...@yahoo.com> wrote:
>> Is there a way to unzip a gzip file within HDFS where source & target both
>> live on HDFS? I don't want to pull a large file to local and put it back.
>>
>> -Ayon
>> See My Photos on Flickr
>> Also check out my Blog for answers to commonly asked questions.
>>
>
>
>
> --
> Harsh J
>
>
>



-- 
Harsh J

Reply via email to