Koji Noguchi created PIG-3641: --------------------------------- Summary: Split "otherwise" producing incorrect output when combined with ColumnPruning Key: PIG-3641 URL: https://issues.apache.org/jira/browse/PIG-3641 Project: Pig Issue Type: Bug Affects Versions: 0.11.1, 0.12.0, 0.10.0, 0.13.0 Reporter: Koji Noguchi Assignee: Koji Noguchi
Our user was observing incorrect outputs depending on if the query had intermediate output or not. Below is a simplified testcase I came up with. {noformat} knoguchi pig> cat test.txt 9,1,ignored 9,1,ignored 9,1,ignored knoguchi pig> cat bz-6590644/test.pig A = load 'test.txt' using PigStorage(',') as (a1:int, a2:int, a3:chararray); B = foreach A generate a1,a2; SPLIT B into C1 if a2 == 1, D1 otherwise; C2 = foreach C1 generate a2; store C2 into '/tmp/testC'; store D1 into '/tmp/testD'; knoguchi@nameother-lm pig> {noformat} Incorrect output shown below. /tmp/testD should be empty but somehow has data in it. {noformat} knoguchi@nameother-lm pig> cat /tmp/testC/part-m-00000 1 1 1 knoguchi pig> cat /tmp/testD/part-m-00000 9 1 9 1 9 1 knoguchi pig> {noformat} -- This message was sent by Atlassian JIRA (v6.1.5#6160)