b = foreach a generate flatten(TOKENIZE((chararray)$0)) as word; --result --aa --bb --cc --cc --cc
c = group b by word; --aa{aa} --bb{bb} --cc{cc,cc,cc} d = foreach c generate COUNT(b), group; --1, aa --1, bb --3, cc 2014-07-21 16:44 GMT+04:00 Ashish Dobhal <dobhalashish...@gmail.com>: > a = load '/user/hue/word_count_text.txt'; > b = foreach a generate flatten(TOKENIZE((chararray)$0)) as word; > c = group b by word; > d = foreach c generate COUNT(b), group; > > I want to know what would be the input to the udf COUNT in this > case.Also what is the meaning of b being passed as an arguement. > > Also I am still not clear acout how count operates. > > Thanks > > Ashish >