Re: how to unzip a .tar.bz2 file in hadoop/hdfs

2014-03-17 Thread Anthony Mattas
You want to use the BZip2Codec to un BZip the file and then use FileUtil to untar it. Anthony Mattas anth...@mattas.net On Mon, Mar 17, 2014 at 10:06 PM, Stanley Shi s...@gopivotal.com wrote: download it, unzip and put it back? Regards, *Stanley Shi,* On Fri, Mar 14, 2014 at 5:44 PM

Re: Benchmarking Hive Changes

2014-03-05 Thread Anthony Mattas
a reasonable response for a user wishing to do interactive querying. Specifically using Hive, I know you can get really good performance out of Impala, but am not yet interested in going that route. Anthony Mattas anth...@mattas.net On Wed, Mar 5, 2014 at 8:47 AM, java8964 java8...@hotmail.com wrote

Benchmarking Hive Changes

2014-03-04 Thread Anthony Mattas
I’ve been trying to benchmark some of the Hive enhancements in Hadoop 2.0 using the HDP Sandbox. I took one of their example queries and executed it with the tables stored as TEXTFILE, RCFILE, and ORC. I also tried enabling enabling vectorized execution, and predicate pushdown. SELECT