Yup, thought about that. That sounds like then only way. I was hoping someone already wrote a hadoop shell command equivalent like: hadoop dfs -unzip -Ayon See My Photos on Flickr Also check out my Blog for answers to commonly asked questions.
________________________________ From: Harsh J <ha...@cloudera.com> To: hdfs-user@hadoop.apache.org; Ayon Sinha <ayonsi...@yahoo.com> Sent: Friday, June 17, 2011 1:00 AM Subject: Re: unzip gz file in HDFS ? Ayon, You can run an identity map job with no output compression set to it. On Fri, Jun 17, 2011 at 12:59 PM, Ayon Sinha <ayonsi...@yahoo.com> wrote: > Is there a way to unzip a gzip file within HDFS where source & target both > live on HDFS? I don't want to pull a large file to local and put it back. > > -Ayon > See My Photos on Flickr > Also check out my Blog for answers to commonly asked questions. > -- Harsh J