Re: Enabling Tez sessions on HiveServer2

2014-12-19 Thread Pala M Muthaia
32 PM, Gopal V wrote: > > On 12/3/14, 3:34 PM, Pala M Muthaia wrote: > >> I didn't know doAs needs to be turned off. But I don't think that is >> something to give up - users create tables, manage data, query etc, and we >> need the queries/jobs to run as the user

Re: Enabling Tez sessions on HiveServer2

2014-12-03 Thread Pala M Muthaia
2 settings need to be in place at the time > of starting the server. > > Regards > Vikram. > > On Tue, Dec 2, 2014 at 2:42 PM, Hitesh Shah wrote: > > BCC’ed user@tez. > > > > This question belongs to either the hive user list or the Hortonworks > user forums. &

Re: Enabling Tez sessions on HiveServer2

2014-12-02 Thread Pala M Muthaia
: > BCC’ed user@tez. > > This question belongs to either the hive user list or the Hortonworks user > forums. > > thanks > — Hitesh > > On Dec 2, 2014, at 1:28 PM, Pala M Muthaia > wrote: > > > Hi, > > > > I am trying to get Tez sessions enabled wi

Enabling Tez sessions on HiveServer2

2014-12-02 Thread Pala M Muthaia
Hi, I am trying to get Tez sessions enabled with HS2. I start the HiveServer2 instance with the flag "-hiveconf hive.execution.engine=tez" and then try to submit multiple queries one after another, as the same user, to the HS2 instance. When i check the YARN UI, i find that each query of mine is

Re: Why does SMB join generate hash table locally, even if input tables are large?

2014-07-30 Thread Pala M Muthaia
+hive-users On Tue, Jul 29, 2014 at 1:56 PM, Pala M Muthaia wrote: > Hi, > > I am testing SMB join for 2 large tables. The tables are bucketed and > sorted on the join column. I notice that even though the table is large, > Hive attempts to generate hash table for the '

Re: Hive on Tez: Diagnosing query execution issues

2014-05-23 Thread Pala M Muthaia
Adding the right hive users alias. On Fri, May 23, 2014 at 5:52 PM, Pala M Muthaia wrote: > Hi, > > I am trying to run a relatively heavy Hive query that joins 3 tables. The > query succeeds on MR after increasing the mapper and reducer container > memory: > > set mapredu

Re: DISTRIBUTE BY works incorrectly in Hive 0.11 in some cases

2013-08-26 Thread Pala M Muthaia
> >>> optimization to work the group bys partitioning should be on the column > >>> 1 only. > >>> > >>> It wont effect the correctness of group by, can make it slow but int > >>> this case will fasten the overall query performance. > &g

Re: DISTRIBUTE BY works incorrectly in Hive 0.11 in some cases

2013-08-23 Thread Pala M Muthaia
I have attached the hive 10 and 11 query plans, for the sample query below, for illustration. On Fri, Aug 23, 2013 at 5:35 PM, Pala M Muthaia wrote: > Hi, > > We are using DISTRIBUTE BY with custom reducer scripts in our query > workload. > > After upgrade to Hive 0.11, que

DISTRIBUTE BY works incorrectly in Hive 0.11 in some cases

2013-08-23 Thread Pala M Muthaia
Hi, We are using DISTRIBUTE BY with custom reducer scripts in our query workload. After upgrade to Hive 0.11, queries with GROUP BY/DISTRIBUTE BY/SORT BY and custom reducer scripts produced incorrect results. Particularly, rows with same value on DISTRIBUTE BY column ends up in multiple reducers