Koji Noguchi created PIG-5370:
---------------------------------
Summary: Union onschema + columnprune dropping used fields
Key: PIG-5370
URL: https://issues.apache.org/jira/browse/PIG-5370
Project: Pig
Issue Type: Task
Reporter: Koji Noguchi
Assignee: Koji Noguchi
After PIG-5312, below query started failing.
{code}
A = load 'input.txt' as (a1:int, a2:chararray, a3:int);
B = FOREACH (GROUP A by (a1,a2)) {
A_FOREACH = FOREACH A GENERATE a2,a3;
GENERATE A, FLATTEN(A_FOREACH) as (a2,a3);
}
C = load 'input2.txt' as (A:bag{tuple:(a1: int,a2: chararray,a3:int)},a2:
chararray,a3:int);
D = UNION ONSCHEMA B, C;
dump D;
{code}
{code:title=input1.txt}
1 a 3
2 b 4
2 c 5
1 a 6
2 b 7
1 c 8
{code}
{code:title=input2.txt}
{(10,a0,30),(20,b0,40)} zzz 222
{code}
{noformat:title=Expected output}
({(10,a0,30),(20,b0,40)},zzz,222)
({(1,a,6),(1,a,3)},a,6)
({(1,a,6),(1,a,3)},a,3)
({(1,c,8)},c,8)
({(2,b,7),(2,b,4)},b,7)
({(2,b,7),(2,b,4)},b,4)
({(2,c,5)},c,5)
{noformat}
{noformat:title=Actual (incorrect) output}
({(10,a0,30),(20,b0,40)}) ****ONLY 1 Field ****
({(1,a,6),(1,a,3)},a,6)
({(1,a,6),(1,a,3)},a,3)
({(1,c,8)},c,8)
({(2,b,7),(2,b,4)},b,7)
({(2,b,7),(2,b,4)},b,4)
({(2,c,5)},c,5)
{noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)