[ 
https://issues.apache.org/jira/browse/PIG-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12831585#action_12831585
 ] 

Ashutosh Chauhan commented on PIG-1230:
---------------------------------------

This patch switches POJoinPackage to use NonSpillableDataBag for last bag 
instead of currently used InternalCachedBag. Both of these bag implementations 
are already covered by existing unit tests and thus this patch needs no new 
tests. 

> Streaming input in POJoinPackage should use nonspillable bag to collect tuples
> ------------------------------------------------------------------------------
>
>                 Key: PIG-1230
>                 URL: https://issues.apache.org/jira/browse/PIG-1230
>             Project: Pig
>          Issue Type: Bug
>          Components: impl
>    Affects Versions: 0.6.0
>            Reporter: Ashutosh Chauhan
>            Assignee: Ashutosh Chauhan
>             Fix For: 0.7.0
>
>         Attachments: pig-1230.patch, pig-1230_1.patch
>
>
> Last table of join statement is streamed through instead of collecting all 
> its tuple in a bag. As a further optimization of that, tuples of that 
> relation are collected in chunks in a bag. Since we don't want to spill the 
> tuples from this bag, NonSpillableBag should be used to hold tuples for this 
> relation. Initially, DefaultDataBag was used, which was later changed to 
> InternalCachedBag as a part of PIG-1209.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to