.tar.gz as input files

2009-07-10 Thread Andraz Tori
Has anyone written a TarGzipCodec decompressor for .tar.gz files? I have a situation where not pure .gz files are available, but files inside .tar.gz archive should be concatenated and treated as line-oriented input. I know I have to write TarGzipCodec to achieve this, but it would be so much nic

Re: .tar.gz as input files

2009-07-11 Thread jason hadoop
There is already support for tar.gz, but it is buried. FileUtil provides a static unTar method. This is only used currently by the DistributedCache for unpacking archives. On Fri, Jul 10, 2009 at 2:58 AM, Andraz Tori wrote: > Has anyone written a TarGzipCodec decompressor for .tar.gz files? > >

Re: .tar.gz as input files

2009-07-12 Thread Andraz Tori
On Sat, 2009-07-11 at 10:22 -0700, jason hadoop wrote: > There is already support for tar.gz, but it is buried. > FileUtil provides a static unTar method. > This is only used currently by the DistributedCache for unpacking archives. That's great start, I need something that represents itself in t