Re: replicated join vs regular ?

2012-01-30 Thread Alan Gates
Yeah, maybe I should have said "right or outer join". What I wanted to make clear is that if you want to identify non-matches in the large (fragment, or left side) you can still use fragment-replicate join. If you want to identify non-matches in the small (replicate, or right side) you cannot.

How Logical Plan Generator works?

2012-01-30 Thread Prasanth J
Hello Everyone I am a newbie to pig. I was going through https://cwiki.apache.org/PIG/guide-for-new-contributors.html, specifically the grammar files to start off with. I could not understand how LogicalPlanGenerator.g works by looking into the grammar file. Also there isn't much documentation av

Re: explode operation

2012-01-30 Thread Stan Rosenberg
On Mon, Jan 30, 2012 at 2:25 AM, Aniket Mokashi wrote: > Isnt FLATTEN similar to explode? Not quite. EXPLODE would take a record with n fields and generate n records.

Re: replicated join vs regular ?

2012-01-30 Thread Vincent Barat
I understand you point and it makes sense. The graph in Alan's book says that if you "outer join on the small input" you should not use replicated join. Maybe this sentence is not clear enough :) Le 28/01/12 00:21, Alex Rovner a écrit : From what I understand replicated should not be used