Re: Run multiple queries simultaneously

2015-08-25 Thread Raajay
The back-end execution engine is Tez, and I use YARN for resource management. I completely agree with your deduction that the impact on the run time will be dependent on the nature of the queries. I would like to conduct some experiments (for a given workload, cluster configuration) to quantify th

RE: Run multiple queries simultaneously

2015-08-25 Thread Ryan Harris
You need to be a bit more clear with your environment and objective here What is your back-end execution engine? MapReduce, Spark, or Tez? What are you using for resource management? YARN or MapReduce? The running time of one query in the presence of other queries will entirely depend on the

Re: Run multiple queries simultaneously

2015-08-25 Thread Sergey Shelukhin
You can start HiveServer2, then submit queries to it using JDBC. If you open multiple sessions using multiple threads, you will be able to submit queries in parallel, although the compilation is still currently serialized. From: Raajay mailto:raaja...@gmail.com>> Reply-To: "user@hive.apache.org<

Re: Run multiple queries simultaneously

2015-08-25 Thread Raajay
Noam, I am concerned with cases where the network is a bottleneck. Will i be able control it in YARN ? Ideally, I would like to run multiple queries simultaneously. Raajay On Tue, Aug 25, 2015 at 9:31 AM, Noam Hasson wrote: > I would just limit the resources given to the user on YARN. > > On

Re: Run multiple queries simultaneously

2015-08-25 Thread Noam Hasson
I would just limit the resources given to the user on YARN. On Tue, Aug 25, 2015 at 4:21 PM, Raajay wrote: > Hello, > > I want to compare the running time of an query when run alone against the > run time in presence of other queries. > > What is the ideal setup required to run this experiment ?