Re: mapjoin with left join

2015-09-22 Thread Steve Howard
Hi Gopal/All, Yep, I absolutely understand the limitation of what we are trying to do. We will try the settings you suggested. Thanks, Steve On Tue, Sep 22, 2015 at 1:44 PM, Gopal Vijayaraghavan wrote: > > > select small.* from small s left join large l on s.id = > >l.id

Re: mapjoin with left join

2015-09-22 Thread Gopal Vijayaraghavan
> select small.* from small s left join large l on s.id = >l.id where l.id is null; ... > We simply want to load the 81K rows in to RAM, then for each row in >large, check the small hash table and if it the row in small is not in >large, then add it to

Re: mapjoin with left join

2015-09-20 Thread Noam Hasson
: Steve Howard > Reply-To: "user@hive.apache.org" > Date: Friday, September 11, 2015 at 09:48 > To: "user@hive.apache.org" > Subject: mapjoin with left join > > We would like to utilize mapjoin for the following SQL construct: > > select small.* from sma

Re: mapjoin with left join

2015-09-11 Thread Sergey Shelukhin
"user@hive.apache.org<mailto:user@hive.apache.org>" mailto:user@hive.apache.org>> Date: Friday, September 11, 2015 at 09:48 To: "user@hive.apache.org<mailto:user@hive.apache.org>" mailto:user@hive.apache.org>> Subject: mapjoin with left join We would like to utiliz

mapjoin with left join

2015-09-11 Thread Steve Howard
We would like to utilize mapjoin for the following SQL construct: select small.* from small s left join large l on s.id = l.id where l.id is null; We can easily fit small into RAM, but large is over 1TB according to optimizer stats. Unless we set hive.auto.convert.join.noconditionaltask.size = to