b = foreach a generate flatten(TOKENIZE((chararray)$0)) as word;
--result
--aa
--bb
--cc
--cc
--cc

c = group b by word;
--aa{aa}
--bb{bb}
--cc{cc,cc,cc}

d = foreach c generate COUNT(b), group;
--1, aa
--1, bb
--3, cc



2014-07-21 16:44 GMT+04:00 Ashish Dobhal <dobhalashish...@gmail.com>:

> a = load '/user/hue/word_count_text.txt';
> b = foreach a generate flatten(TOKENIZE((chararray)$0)) as word;
> c = group b by word;
> d = foreach c generate COUNT(b), group;
>
> I want to know what would be the input to the udf COUNT in this
> case.Also what is the meaning of b being passed as an arguement.
>
> Also I am still not clear acout how count operates.
>
> Thanks
>
> Ashish
>

Reply via email to