Re: SIZE() of relation

2010-06-11 Thread Russell Jurney
This would be great. Save us from GROUP ALL/FOREACH, which is awkward. On Fri, Jun 11, 2010 at 7:14 PM, Dmitriy Ryaboy wrote: > It would be cool to just treat relations as bags in the general case. They > kind of are, and kind of are not. Causes lots of user confusion. > There are obvious users

Re: SIZE() of relation

2010-06-11 Thread Dmitriy Ryaboy
It would be cool to just treat relations as bags in the general case. They kind of are, and kind of are not. Causes lots of user confusion. There are obvious users-doing-dumb-stuff scenarios that arise though. I guess the Pig philosophy is that the user is the optimizer, though.. so maybe it's ok.

SIZE() of relation

2010-06-11 Thread Russell Jurney
Would it be possible, and not a ton of work to make the builtin SIZE() work on a relation? Reason being, I frequently do this: B = GROUP A ALL; C = FOREACH B GENERATE SIZE(A) AS total; DUMP C; And I would rather do this: DUMP SIZE(A); Russ

[jira] Commented: (PIG-1295) Binary comparator for secondary sort

2010-06-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12878130#action_12878130 ] Daniel Dai commented on PIG-1295: - Can you also attach performance test code? I want to take

[jira] Commented: (PIG-1295) Binary comparator for secondary sort

2010-06-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12878127#action_12878127 ] Daniel Dai commented on PIG-1295: - "the new comparator takes between 8 and 10 seconds. The ol

[jira] Commented: (PIG-1295) Binary comparator for secondary sort

2010-06-11 Thread Gianmarco De Francisci Morales (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12878120#action_12878120 ] Gianmarco De Francisci Morales commented on PIG-1295: - I did some more de

[jira] Created: (PIG-1447) Tune memory usage of InternalCachedBag

2010-06-11 Thread Daniel Dai (JIRA)
Tune memory usage of InternalCachedBag -- Key: PIG-1447 URL: https://issues.apache.org/jira/browse/PIG-1447 Project: Pig Issue Type: Improvement Components: impl Affects Versions: 0.7.0

[jira] Commented: (PIG-1405) Need to move many standard functions from piggybank into Pig

2010-06-11 Thread Alan Gates (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12877923#action_12877923 ] Alan Gates commented on PIG-1405: - I'm fine with moving COR and COV. The goal of this is to

[jira] Commented: (PIG-1405) Need to move many standard functions from piggybank into Pig

2010-06-11 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12877918#action_12877918 ] Aniket Mokashi commented on PIG-1405: - Do we need to add a function variance? or we need

[jira] Commented: (PIG-1405) Need to move many standard functions from piggybank into Pig

2010-06-11 Thread Aniket Mokashi (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12877916#action_12877916 ] Aniket Mokashi commented on PIG-1405: - As per the comments above, the existing classes wi

[jira] Commented: (PIG-1405) Need to move many standard functions from piggybank into Pig

2010-06-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12877903#action_12877903 ] Daniel Dai commented on PIG-1405: - Aniket, in the patch we also need to remove these UDFs fro

[jira] Commented: (PIG-972) Make describe work with nested foreach

2010-06-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12877901#action_12877901 ] Daniel Dai commented on PIG-972: Seems the second trunk of PigServer.java in the patch is irre

[jira] Updated: (PIG-1443) DefaultTuple underestimate the memory footprint for string

2010-06-11 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai updated PIG-1443: Status: Resolved (was: Patch Available) Hadoop Flags: [Reviewed] Resolution: Fixed Patch com

[jira] Updated: (PIG-1446) OOME in a query having a bincond in the inner plan of a Foreach.

2010-06-11 Thread Ashutosh Chauhan (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashutosh Chauhan updated PIG-1446: -- Status: Resolved (was: Patch Available) Fix Version/s: 0.8.0 0.7.0

[jira] Updated: (PIG-1444) [Zebra] Zebra build should have a test-smoke target

2010-06-11 Thread Yan Zhou (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Zhou updated PIG-1444: -- Status: Resolved (was: Patch Available) Assignee: Gaurav Jain Fix Version/s: 0.7.0