pig cogroup by null

2015-09-09 Thread 李运田
table1 ,aa 14, 12,bb table2 ,aa 14, 12,bb k = cogroup table1 by (a,b),table2 by (a,b); k: ((,aa),{(,aa)},{}) ((,aa),{},{(,aa)}) ((14,),{(14,)},{(14,)}) ((12,bb),{(12,bb)},{(12,bb)}) when I use cogroup,(,aa) null is in front,(14,) null is not in front. is there a parameter I can get result

[jira] Subscription: PIG patch available

2015-09-09 Thread jira
Issue Subscription Filter: PIG patch available (30 issues) Subscriber: pigdaily Key Summary PIG-4670Embedded Python scripts still parse line by line https://issues.apache.org/jira/browse/PIG-4670 PIG-4663HBaseStorage should allow the MaxResultsPerColumnFamily limit to

[jira] [Updated] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-09 Thread Murali Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murali Rao updated PIG-4673: Description: Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B with 2, C with

[jira] [Updated] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-09 Thread Murali Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murali Rao updated PIG-4673: Description: Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B with 2, C with

[jira] [Created] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-09 Thread Murali Rao (JIRA)
Murali Rao created PIG-4673: --- Summary: Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values. Key: PIG-4673 URL:

[jira] [Updated] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-09 Thread Murali Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murali Rao updated PIG-4673: Description: Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B with 2, C with

[jira] [Updated] (PIG-4673) Built In UDF - REPLACE_MULTI : For a given string, search and replace all occurrences of search keys with replacement values.

2015-09-09 Thread Murali Rao (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murali Rao updated PIG-4673: Description: Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B with 2, C with

Re: pig cogroup by null

2015-09-09 Thread Daniel Dai
There is no setting, however you can rewrite your query as the following: table1x = foreach table1 generate (a is null?'':a) as a, (b is null?'':b) as b; table2x = foreach table2 generate (a is null?'':a) as a, (b is null?'':b) as b; k = cogroup table1x by (a,b),table2x by (a,b); Daniel On

[jira] [Commented] (PIG-4667) Enable Pig on Spark to run on Yarn Client/Cluster mode

2015-09-09 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14737405#comment-14737405 ] Marcelo Vanzin commented on PIG-4667: - Unfortunately, Spark currently assumes that there is an assembly

[jira] [Commented] (PIG-4667) Enable Pig on Spark to run on Yarn Client/Cluster mode

2015-09-09 Thread Xuefu Zhang (JIRA)
[ https://issues.apache.org/jira/browse/PIG-4667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14737394#comment-14737394 ] Xuefu Zhang commented on PIG-4667: -- [~sriksun], jar problem is indeed very complicated. I'm a little

Re: Hive UDF's vs. "native" Pig UDF's

2015-09-09 Thread Daniel Dai
There are some moderate overhead for Hive UDF. My test shows around 10%-20% slow down than Pig native UDF. I will create a document Jira. Thanks, Daniel On 9/9/15, 9:27 AM, "Rohini Palaniswamy" wrote: >Daniel, > Not sure you saw this. We will have to document the

[jira] [Created] (PIG-4672) Document performance implication for Hive UDF

2015-09-09 Thread Daniel Dai (JIRA)
Daniel Dai created PIG-4672: --- Summary: Document performance implication for Hive UDF Key: PIG-4672 URL: https://issues.apache.org/jira/browse/PIG-4672 Project: Pig Issue Type: Task