[ 
https://issues.apache.org/jira/browse/PIG-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106873#comment-13106873
 ] 

Ryan Hoegg commented on PIG-2290:
---------------------------------

This is probably true, but the current implementation behaves inconsistently 
(output does not match schema).  Plus, it makes it impossible to generate a bag 
containing tuples with more than one element.

Is 0.9 -> 0.10 considered a "major version upgrade"?

> TOBAG wraps tuple parameters in another tuple
> ---------------------------------------------
>
>                 Key: PIG-2290
>                 URL: https://issues.apache.org/jira/browse/PIG-2290
>             Project: Pig
>          Issue Type: Bug
>          Components: internal-udfs
>    Affects Versions: 0.9.0
>            Reporter: Ryan Hoegg
>            Assignee: Dmitriy V. Ryaboy
>         Attachments: pig-2290.patch
>
>
> The TOBAG function indiscriminately wraps all parameters in a tuple.  When I 
> pass a list of tuples to the function, I would expect it to return a bag 
> containing those tuples.  Instead, it returns a bag containing single element 
> tuples, where each tuple contains one of the tuples passed in.
> Example:
> {code:title=tuples.txt}
> (mike,608)
> (ryan,11624)
> (justin,2317)
> {code}
> {code:title=Demonstration using pig 0.9.0}
> grunt> TUPLE_DATA = LOAD 'tuples.txt' AS 
> (T:tuple(name:chararray,street_number:int));
> grunt> BAGGED = FOREACH TUPLE_DATA GENERATE TOBAG(T);
> grunt> DESCRIBE BAGGED;
> BAGGED: {{(name: chararray,street_number: int)}}
> grunt> DUMP BAGGED;
> ({((mike,608))})
> ({((ryan,11624))})
> ({((justin,2317))})
> {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to