[ https://issues.apache.org/jira/browse/PIG-1184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Daniel Dai updated PIG-1184: ---------------------------- Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) Patch committed. > PruneColumns optimization does not handle the case of foreach flatten > correctly if flattened bag is not used later > ------------------------------------------------------------------------------------------------------------------ > > Key: PIG-1184 > URL: https://issues.apache.org/jira/browse/PIG-1184 > Project: Pig > Issue Type: Bug > Affects Versions: 0.6.0 > Reporter: Pradeep Kamath > Assignee: Daniel Dai > Fix For: 0.7.0 > > Attachments: PIG-1184-1.patch, PIG-1184-2.patch > > > The following script : > {noformat} > -e "a = load 'input.txt' as (f1:chararray, f2:chararray, > f3:bag{t:tuple(id:chararray)}, f4:bag{t:tuple(loc:chararray)}); b = foreach a > generate f1, f2, flatten(f3), flatten(f4), 10; b = foreach b generate f1, f2, > \$4; dump b;" > {noformat} > gives the following result: > (oiue,M,10) > {noformat} > cat input.txt: > oiue M {(3),(4)} {(toronto),(montreal)} > {noformat} > If PruneColumns optimizations is disabled, we get the right result: > (oiue,M,10) > (oiue,M,10) > (oiue,M,10) > (oiue,M,10) > The flatten results in 4 records - so the output should contain 4 records. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.