Hi Gagan, This link may help you https://bluewatersql.wordpress.com/2013/10/04/3-little-piggys-advanced-pig-join-scenarios/
On 30 July 2015 at 22:04, Alan Gates <alanfga...@gmail.com> wrote: > Here's the original design doc: > https://wiki.apache.org/pig/PigSkewedJoinSpec > > Alan. > > Gagan Juneja <gagandeepjun...@gmail.com> > July 29, 2015 at 21:30 > Any help? > > Regards, > Gagan > > > Gagan Juneja <gagandeepjun...@gmail.com> > July 14, 2015 at 4:56 > Hi Team, > > We are using Pig intensively in our various projects. We are doing > optimizations for that we wanted to know how join works. Though we have > moved to Skewed joins for some of our use cases. > > At many places in the documentation this is mentioned that in join data is > streamed for second table. But I was identify how this can fit in map > reduce paradigm. > > 1. Can anyone please clarify how join happens in pig. > 2. What is the meaning of Streaming here? Are we loading the files directly > in the reducres? > > > Regards, > Gagan > >