That's actually the documented behavior:
https://pig.apache.org/docs/r0.10.0/func.html#count
There was some discussion about changing this:
https://issues.apache.org/jira/browse/PIG-1014
Patches gratefully accepted..
D
On Sat, Sep 14, 2013 at 12:01 AM, centerqi hu wrote:
> The sample.txt fil
The sample.txt file content:
android,u1,taobao1
android,u1,taobao1
,u2,taobao2
RR = LOAD '/user/www/udc/output/bugfind/sample.txt' USING PigStorage(',')
as (platform, machineID, productID);
RB = GROUP RR BY (productID);
RES = FOREACH RB{
ITEMUV = DISTINCT RR.machineID;