I would do the following (obviously this is a bit shorthand): a = load 'data1'; b = load 'data2'; c = cogroup a by $0, b by $0; d = filter c by IsEmpty(b);
d would be a relation with only the keys and their corresponding rows which exist in a 2012/1/24 Chan, Tim <[email protected]> > I would like to generate a set of data that represents the items not found > in another set. > How would I do this using Pig? > > I'm thinking I would do an outer join and then filter off the items that > were matched. > >
