It works but it takes a very long time because the subqueries in NOT IN
contains 400 million rows (the message table in the example) and the feed table
contains 3 million rows.
SELECT uuid from feed f WHERE f.uuid NOT IN (SELECT uuid FROM message);
> Date: Sun, 5 May 2013 20:25:15 -0700
> From: michaelma...@yahoo.com
> Subject: Re: Hive QL - NOT IN, NOT EXIST
> To: user@hive.apache.org
>
>
> --- On Sun, 5/5/13, Peter Chu <pete....@outlook.com> wrote:
>
> > I am wondering if there is any way to do this without resorting to
> > using left outer join and finding nulls.
>
> I have found this to be an acceptable substitute. Is it not working for you?
>