[
https://issues.apache.org/jira/browse/DATAFU-45?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13985743#comment-13985743
]
Matthew Hayes commented on DATAFU-45:
-------------------------------------
I would try experimenting with what you are generating in my_stage2 to zero in
on what is causing the problem.
For example what happens if you try:
{code}
my_stage2 = foreach my_stage1 {
keywords = cross keywords, TOBAG(weight);
keywords2 = foreach keywords GENERATE keyword;
generate id, keywords2;
};
{code}
If this works then there is something weird going on with the weight. Maybe
Pig is confused by the {{null::weight}} name.
> RFE: CartesianProduct
> ---------------------
>
> Key: DATAFU-45
> URL: https://issues.apache.org/jira/browse/DATAFU-45
> Project: DataFu
> Issue Type: New Feature
> Reporter: Sam Steingold
>
> Given two bags, produce their [Cartesian
> product|http://en.wikipedia.org/wiki/Cartesian_product]:
> {code}
> B1: bag{T1}
> B2: bag{T2}
> CartesianProduct(B1,B2): bag{(T1,T2)}
> {code}
> Use case:
> {code}
> toks = TOKENIZE((charray)$0,',');
> kwds = CartesianProduct(toks, {1.0/(double)SIZE(toks)});
> {code}
--
This message was sent by Atlassian JIRA
(v6.2#6252)