[ 
https://issues.apache.org/jira/browse/PIG-2163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhijie Shen updated PIG-2163:
-----------------------------

    Attachment: PIG-2163.patch

Attached is the patch for this issue. Assume there are n bags as input. Now 
POCross only create n - 1 temporal bags.

The general logic is iterating the tuple of the first bag (the left-most one in 
the bag list) and merging it with all the combinations with the tuples stored 
in the n - 1 temporal bags. Choosing the first bag to iterate separately is to 
keep the order of the cross product in this method the same as that with n 
temporal bags.



> Improve nested cross to stream one relation
> -------------------------------------------
>
>                 Key: PIG-2163
>                 URL: https://issues.apache.org/jira/browse/PIG-2163
>             Project: Pig
>          Issue Type: Improvement
>          Components: impl
>    Affects Versions: 0.10
>            Reporter: Daniel Dai
>            Assignee: Zhijie Shen
>             Fix For: 0.10
>
>         Attachments: PIG-2163.patch
>
>
> PIG-1916 added nested cross support for PIG. One optimization is instead of 
> materialize all bags before producing result, we can stream one of the input 
> to save on memory.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to