[ 
https://issues.apache.org/jira/browse/PIG-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12765241#action_12765241
 ] 

Pradeep Kamath commented on PIG-1014:
-------------------------------------

Here are the options we have in my mind:
1) status Quo which is confusing - so not quite an option
2) We disallow anything other than counting a single column with COUNT - So 
COUNT(A), COUNT(A.*), COUNT(A.($0,$1)) all would throw an error saying COUNT 
does not support this - I this case would COUNT_STAR be the suggestion to 
users? It would seem so for COUNT(A) and COUNT(A.*) not so sure for 
COUNT(A.($0,$1))
3) Same as 2) but instead of throwing an error, the parser replaces COUNT with 
COUNT_STAR - here again we need to decide what we should do with 
COUNT(A.($0,$1))
4) Change COUNT to look at all fields in the bag and only if all fields are 
null not count that towards the result - In this case, COUNT(A) and 
COUNT_STAR(A) will not give same results but we could document this.


> Pig should convert COUNT(relation) to COUNT_STAR(relation) so that all 
> records are counted without considering nullness of the fields in the records
> ----------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: PIG-1014
>                 URL: https://issues.apache.org/jira/browse/PIG-1014
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.4.0
>            Reporter: Pradeep Kamath
>


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to