Thanks Harshit. That approach doesn't look good as it will write uncompressed data to HDFS resulting into job side effects. -R PDate: Thu, 24 Sep 2015 09:55:49 +0530 Subject: Re: CombineFileInputFormat with Gzip files From: mathursh...@gmail.com To: user@hadoop.apache.org CC: mapreduce-u...@hadoop.apache.org
Hi R P, Follow this link, http://www.ibm.com/developerworks/library/bd-hadoopcombine/ Regards, Harshit On Thu, Sep 24, 2015 at 4:46 AM, R P <hadoo...@outlook.com> wrote: Hello All, What is the best way to process small Gzip files with CombineFileInputFormat ? If possible please provide link to the documentation.Appreciate your help. Thanks, *Adding mapreduce-dev to the mailing list. From: hadoo...@outlook.com To: user@hadoop.apache.org Subject: CombineFileInputFormat with Gzip files Date: Tue, 22 Sep 2015 18:29:05 -0700 Hello All, What is the best way to use CombineFileInputFormat with Gzip files as input? Thanks, -- Harshit Mathur