The back-end execution engine is Tez, and I use YARN for resource
management.
I completely agree with your deduction that the impact on the run time will
be dependent on the nature of the queries. I would like to conduct some
experiments (for a given workload, cluster configuration) to quantify th
You need to be a bit more clear with your environment and objective here
What is your back-end execution engine? MapReduce, Spark, or Tez?
What are you using for resource management? YARN or MapReduce?
The running time of one query in the presence of other queries will entirely
depend on the
You can start HiveServer2, then submit queries to it using JDBC. If you open
multiple sessions using multiple threads, you will be able to submit queries in
parallel, although the compilation is still currently serialized.
From: Raajay mailto:raaja...@gmail.com>>
Reply-To: "user@hive.apache.org<
Noam,
I am concerned with cases where the network is a bottleneck. Will i be able
control it in YARN ? Ideally, I would like to run multiple queries
simultaneously.
Raajay
On Tue, Aug 25, 2015 at 9:31 AM, Noam Hasson
wrote:
> I would just limit the resources given to the user on YARN.
>
> On
I would just limit the resources given to the user on YARN.
On Tue, Aug 25, 2015 at 4:21 PM, Raajay wrote:
> Hello,
>
> I want to compare the running time of an query when run alone against the
> run time in presence of other queries.
>
> What is the ideal setup required to run this experiment ?