[
https://issues.apache.org/jira/browse/PIG-2536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13539214#comment-13539214
]
Haitao Yao commented on PIG-2536:
---------------------------------
But I do think this feature is very useful. Without this feature, I have to
write like this:
A = load 'test_data' as (a,b,c,d);
B1 = foreach A generate a,b,c;
D1 = distinct B1;
-- or like this:
D = distinct A.(a,b,c);
less code , less error. I really hope this can be merged into trunk.
thanks.
> Extend pig to support DISTINCT x.(project)
> ------------------------------------------
>
> Key: PIG-2536
> URL: https://issues.apache.org/jira/browse/PIG-2536
> Project: Pig
> Issue Type: Improvement
> Reporter: Jonathan Coveney
> Assignee: Jonathan Coveney
> Priority: Minor
> Fix For: 0.11
>
> Attachments: PIG-2436-0.patch
>
>
> Currently, pig does not allow this syntax:
> {code}
> A = load 'thing' (x:int, y:int, z:int);
> B = distinct A.x;
> C = distinct A.(y,z)
> D = distinct C.$0;
> {code}
> and so on. With this patch, it does. I should probably add more tests, though
> it's a simple patch... it just turns distinct rel.proj into syntactic sugar
> for distinct (foreach rel generate proj)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira