I would do the following (obviously this is a bit shorthand):

a = load 'data1';
b = load 'data2';
c = cogroup a by $0, b by $0;
d = filter c by IsEmpty(b);

d would be a relation with only the keys and their corresponding rows which
exist in a

2012/1/24 Chan, Tim <[email protected]>

> I would like to generate a set of data that represents the items not found
> in another set.
> How would I do this using Pig?
>
> I'm thinking I would do an outer join and then filter off the items that
> were matched.
>
>

Reply via email to