It works but it takes a very long time because the subqueries in NOT IN 
contains 400 million rows (the message table in the example) and the feed table 
contains 3 million rows.
SELECT uuid from feed f WHERE f.uuid NOT IN (SELECT uuid FROM message);
> Date: Sun, 5 May 2013 20:25:15 -0700
> From: michaelma...@yahoo.com
> Subject: Re: Hive QL - NOT IN, NOT EXIST
> To: user@hive.apache.org
> 
> 
> --- On Sun, 5/5/13, Peter Chu <pete....@outlook.com> wrote:
> 
> > I am wondering if there is any way to do this without resorting to
> > using left outer join and finding nulls.
> 
> I have found this to be an acceptable substitute.  Is it not working for you?
> 
                                          

Reply via email to