[ https://issues.apache.org/jira/browse/PIG-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13106873#comment-13106873 ]
Ryan Hoegg commented on PIG-2290: --------------------------------- This is probably true, but the current implementation behaves inconsistently (output does not match schema). Plus, it makes it impossible to generate a bag containing tuples with more than one element. Is 0.9 -> 0.10 considered a "major version upgrade"? > TOBAG wraps tuple parameters in another tuple > --------------------------------------------- > > Key: PIG-2290 > URL: https://issues.apache.org/jira/browse/PIG-2290 > Project: Pig > Issue Type: Bug > Components: internal-udfs > Affects Versions: 0.9.0 > Reporter: Ryan Hoegg > Assignee: Dmitriy V. Ryaboy > Attachments: pig-2290.patch > > > The TOBAG function indiscriminately wraps all parameters in a tuple. When I > pass a list of tuples to the function, I would expect it to return a bag > containing those tuples. Instead, it returns a bag containing single element > tuples, where each tuple contains one of the tuples passed in. > Example: > {code:title=tuples.txt} > (mike,608) > (ryan,11624) > (justin,2317) > {code} > {code:title=Demonstration using pig 0.9.0} > grunt> TUPLE_DATA = LOAD 'tuples.txt' AS > (T:tuple(name:chararray,street_number:int)); > grunt> BAGGED = FOREACH TUPLE_DATA GENERATE TOBAG(T); > grunt> DESCRIBE BAGGED; > BAGGED: {{(name: chararray,street_number: int)}} > grunt> DUMP BAGGED; > ({((mike,608))}) > ({((ryan,11624))}) > ({((justin,2317))}) > {code} -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira