On Sat, 2009-07-11 at 10:22 -0700, jason hadoop wrote: > There is already support for tar.gz, but it is buried. > FileUtil provides a static unTar method. > This is only used currently by the DistributedCache for unpacking archives.
That's great start, I need something that represents itself in the same way as GzipCodec and enables hadoop to natively handle .tar.gz on input It doesn't look anything like this exists? bye andraz > On Fri, Jul 10, 2009 at 2:58 AM, Andraz Tori <and...@zemanta.com> wrote: > > > Has anyone written a TarGzipCodec decompressor for .tar.gz files? > > > > I have a situation where not pure .gz files are available, but files > > inside .tar.gz archive should be concatenated and treated as > > line-oriented input. > > > > I know I have to write TarGzipCodec to achieve this, but it would be so > > much nicer if anyone else already had this issue and solved it? :) > > > > > > -- > > Andraz Tori, CTO > > Zemanta Ltd, New York, London, Ljubljana > > www.zemanta.com > > mail: and...@zemanta.com > > tel: +386 41 515 767 > > twitter: andraz, skype: minmax_test > > > > > > > > > > -- Andraz Tori, CTO Zemanta Ltd, New York, London, Ljubljana www.zemanta.com mail: and...@zemanta.com tel: +386 41 515 767 twitter: andraz, skype: minmax_test