Re: part-00000.deflate as output

2009-11-27 Thread Mark Kerzner
Thank you, guys, for your very useful answers Mark On Fri, Nov 27, 2009 at 12:44 PM, Aaron Kimball wrote: > You are always free to run with compression disabled. But in many > production > situations, space or performance concerns dictate that all data sets are > stored compressed, so I think T

Re: part-00000.deflate as output

2009-11-27 Thread Patrick Angeles
You can always do hadoop fs -text This will 'cat' the file for you, and decompress it if necessary. On Thu, Nov 26, 2009 at 7:59 PM, Mark Kerzner wrote: > It worked! > > But why is it "for testing?" I only have one job, so I need by related as > text, can I use this fix all the time? > > Than

Re: part-00000.deflate as output

2009-11-27 Thread Aaron Kimball
You are always free to run with compression disabled. But in many production situations, space or performance concerns dictate that all data sets are stored compressed, so I think Tim was assuming that you might be operating in such an environment -- in which case, you'd only need things to appear

Re: part-00000.deflate as output

2009-11-26 Thread Mark Kerzner
It worked! But why is it "for testing?" I only have one job, so I need by related as text, can I use this fix all the time? Thank you, Mark On Thu, Nov 26, 2009 at 1:10 AM, Tim Kiefer wrote: > For testing purposes you can also try to disable the compression: > > conf.setBoolean("mapred.output.

Re: part-00000.deflate as output

2009-11-25 Thread Tim Kiefer
For testing purposes you can also try to disable the compression: conf.setBoolean("mapred.output.compress", false); Then you can look at the output. - tim Amogh Vasekar wrote: Hi, ".deflate" is the default compression codec used when parameter to generate compressed output is true ( mapred.o

Re: part-00000.deflate as output

2009-11-25 Thread Amogh Vasekar
Hi, ".deflate" is the default compression codec used when parameter to generate compressed output is true ( mapred.output.compress ). You may set the codec to be used via mapred.output.compression.codec, some commonly used are available in hadoop.io.compress package... Amogh On 11/26/09 11:03

part-00000.deflate as output

2009-11-25 Thread Mark Kerzner
Hi, I get this part-0.deflate instead of part-0. How do I get rid of the deflate option? Thank you, Mark