RE: Benchmarking Hive Changes

2014-03-05 Thread java8964
Are you doing on standalone one box? How large are your test files and how long of the jobs of each type took? Yong From: anth...@mattas.net Subject: Benchmarking Hive Changes Date: Tue, 4 Mar 2014 21:31:42 -0500 To: user@hadoop.apache.org I’ve been trying to benchmark some of the Hive

Re: Benchmarking Hive Changes

2014-03-05 Thread Anthony Mattas
Yes, I'm using the HortonWorks Data Platform 2.0 Sandbox which is a standalone box. But shame on me it looks like the files are both very tiny (46K), I'm seeing about 23 seconds per query, which appears mostly to be starting up MR. So I'm going to find a new data set and try again, is there any

Re: Benchmarking Hive Changes

2014-03-05 Thread Olivier Renault
, it will be really hard to archive an interactive result. MapReduce is a batch mode, period. You do want to consider Impala/spark or Apache stinger, if you really are looking for interactive. Yong -- Date: Wed, 5 Mar 2014 09:02:32 -0500 Subject: Re: Benchmarking