table1
,aa
14,
12,bb
table2
,aa
14,
12,bb
k = cogroup table1 by (a,b),table2 by (a,b);
k:
((,aa),{(,aa)},{})
((,aa),{},{(,aa)})
((14,),{(14,)},{(14,)})
((12,bb),{(12,bb)},{(12,bb)})
when I use cogroup,(,aa) null is in front,(14,) null is not in front. is there
a parameter I can get result
Issue Subscription
Filter: PIG patch available (30 issues)
Subscriber: pigdaily
Key Summary
PIG-4670Embedded Python scripts still parse line by line
https://issues.apache.org/jira/browse/PIG-4670
PIG-4663HBaseStorage should allow the MaxResultsPerColumnFamily limit to
[
https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Murali Rao updated PIG-4673:
Description:
Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B
with 2, C with
[
https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Murali Rao updated PIG-4673:
Description:
Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B
with 2, C with
Murali Rao created PIG-4673:
---
Summary: Built In UDF - REPLACE_MULTI : For a given string, search
and replace all occurrences of search keys with replacement values.
Key: PIG-4673
URL:
[
https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Murali Rao updated PIG-4673:
Description:
Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B
with 2, C with
[
https://issues.apache.org/jira/browse/PIG-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Murali Rao updated PIG-4673:
Description:
Lets say we have a string = 'A1B2C3D4'. Our objective is to replace A with 1, B
with 2, C with
There is no setting, however you can rewrite your query as the following:
table1x = foreach table1 generate (a is null?'':a) as a, (b is null?'':b)
as b;
table2x = foreach table2 generate (a is null?'':a) as a, (b is null?'':b)
as b;
k = cogroup table1x by (a,b),table2x by (a,b);
Daniel
On
[
https://issues.apache.org/jira/browse/PIG-4667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14737405#comment-14737405
]
Marcelo Vanzin commented on PIG-4667:
-
Unfortunately, Spark currently assumes that there is an assembly
[
https://issues.apache.org/jira/browse/PIG-4667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14737394#comment-14737394
]
Xuefu Zhang commented on PIG-4667:
--
[~sriksun], jar problem is indeed very complicated. I'm a little
There are some moderate overhead for Hive UDF. My test shows around
10%-20% slow down than Pig native UDF. I will create a document Jira.
Thanks,
Daniel
On 9/9/15, 9:27 AM, "Rohini Palaniswamy" wrote:
>Daniel,
> Not sure you saw this. We will have to document the
Daniel Dai created PIG-4672:
---
Summary: Document performance implication for Hive UDF
Key: PIG-4672
URL: https://issues.apache.org/jira/browse/PIG-4672
Project: Pig
Issue Type: Task
12 matches
Mail list logo