[ https://issues.apache.org/jira/browse/PIG-5224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16024320#comment-16024320 ]
Daniel Dai commented on PIG-5224: --------------------------------- The inserted LOForEach remove all the columns which are not used in the scripts going forward. The next LOForEach is not necessary doing that. I believe this is not for performance reason (The performance gain for removing several columns might be debatable), this is to make ColumnPruner simpler. > Extra foreach from ColumnPrune preventing Accumulator usage > ----------------------------------------------------------- > > Key: PIG-5224 > URL: https://issues.apache.org/jira/browse/PIG-5224 > Project: Pig > Issue Type: Improvement > Reporter: Koji Noguchi > Assignee: Koji Noguchi > Attachments: pig-5224-v0-testonly.patch, pig-5224-v1.patch > > > {code} > A = load 'input' as (id:int, fruit); > B = foreach A generate id; -- to enable columnprune > C = group B by id; > D = foreach C { > o = order B by id; > generate org.apache.pig.test.utils.AccumulatorBagCount(o); > } > STORE D into ... > {code} > Pig fails to use Accumulator interface for this UDF. -- This message was sent by Atlassian JIRA (v6.3.15#6346)