[
https://issues.apache.org/jira/browse/PIG-5341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16500838#comment-16500838
]
Koji Noguchi commented on PIG-5341:
-----------------------------------
Attached {{pig-5341-v01.patch}} that fixes the off-by-one bug in determining
which fields to keep.
Also, tagfile/tagpath was always being added to tuple even when columnpruning
was setting {{mRequiredColumns[0]}} to false.
Separate from this, found {{testColumnPrune}} wasn't really comparing the
output. Fixed that as well.
> PigStorage with -tagFile/-tagPath produces incorrect results with column
> pruning
> --------------------------------------------------------------------------------
>
> Key: PIG-5341
> URL: https://issues.apache.org/jira/browse/PIG-5341
> Project: Pig
> Issue Type: Bug
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Priority: Critical
> Attachments: pig-5341-v01.patch
>
>
> I don't know why we didn't see this till now.
> {code}
> A = load 'test.txt' using PigStorage('\t', '-tagFile') as
> (filename:chararray, a0:int, a1:int, a2:int, a3:int);
> B = FOREACH A GENERATE a0,a2;
> dump B;
> {code}
> Input
> {noformat}
> knoguchi@pig > cat test.txt
> 0 1 2 3
> 0 1 2 3
> 0 1 2 3
> {noformat}
> Expected Results
> {noformat}
> (0,2)
> (0,2)
> (0,2)
> {noformat}
> Actual Results
> {noformat}
> (,1)
> (,1)
> (,1)
> {noformat}
> This is really bad...
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)