[
https://issues.apache.org/jira/browse/PIG-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106873#comment-13106873
]
Ryan Hoegg commented on PIG-2290:
---------------------------------
This is probably true, but the current implementation behaves inconsistently
(output does not match schema). Plus, it makes it impossible to generate a bag
containing tuples with more than one element.
Is 0.9 -> 0.10 considered a "major version upgrade"?
> TOBAG wraps tuple parameters in another tuple
> ---------------------------------------------
>
> Key: PIG-2290
> URL: https://issues.apache.org/jira/browse/PIG-2290
> Project: Pig
> Issue Type: Bug
> Components: internal-udfs
> Affects Versions: 0.9.0
> Reporter: Ryan Hoegg
> Assignee: Dmitriy V. Ryaboy
> Attachments: pig-2290.patch
>
>
> The TOBAG function indiscriminately wraps all parameters in a tuple. When I
> pass a list of tuples to the function, I would expect it to return a bag
> containing those tuples. Instead, it returns a bag containing single element
> tuples, where each tuple contains one of the tuples passed in.
> Example:
> {code:title=tuples.txt}
> (mike,608)
> (ryan,11624)
> (justin,2317)
> {code}
> {code:title=Demonstration using pig 0.9.0}
> grunt> TUPLE_DATA = LOAD 'tuples.txt' AS
> (T:tuple(name:chararray,street_number:int));
> grunt> BAGGED = FOREACH TUPLE_DATA GENERATE TOBAG(T);
> grunt> DESCRIBE BAGGED;
> BAGGED: {{(name: chararray,street_number: int)}}
> grunt> DUMP BAGGED;
> ({((mike,608))})
> ({((ryan,11624))})
> ({((justin,2317))})
> {code}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira