Shahab Thanks My doubt is why are we taking the bag b and not bag c as the arguement in the COUNT(b) function. The bag c contains the groups and not hte bag b. TThanks.
On Mon, Jul 21, 2014 at 6:21 PM, Shahab Yunus <[email protected]> wrote: > Have you seen this documentation and blog? > http://squarecog.wordpress.com/2010/05/11/group-operator-in-apache-pig/ > http://pig.apache.org/docs/r0.9.2/func.html#count > > They explain this in detail. > > Regards, > Shahab > > > On Mon, Jul 21, 2014 at 8:44 AM, Ashish Dobhal <[email protected]> > wrote: > > > a = load '/user/hue/word_count_text.txt'; > > b = foreach a generate flatten(TOKENIZE((chararray)$0)) as word; > > c = group b by word; > > d = foreach c generate COUNT(b), group; > > > > I want to know what would be the input to the udf COUNT in this > > case.Also what is the meaning of b being passed as an arguement. > > > > Also I am still not clear acout how count operates. > > > > Thanks > > > > Ashish > > >
