[
https://issues.apache.org/jira/browse/PIG-272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arun C Murthy updated PIG-272:
------------------------------
Attachment: split.pl
PIG-272_test.pig
PIG-272_0_20080621.patch
Attached fix. The patch ensures we deep-copy the StreamingCommand before
optimizing it and reverts the optimization piece-meal (i.e for input and output
separately).
The test cases are quite complex/convoluted and are pretty hard to convert to
unit-tests, which I why I've attached them here and propose we integrate them
into our end-to-end tests...
> Failure running complex script with streaming
> ---------------------------------------------
>
> Key: PIG-272
> URL: https://issues.apache.org/jira/browse/PIG-272
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Arun C Murthy
> Attachments: PIG-272_0_20080621.patch, PIG-272_test.pig, split.pl
>
>
> The following script fails (stack is further down):
> define CMD `perl identity.pl`;
> define CMD1 `perl identity.pl`;
> A = load '/user/pig/tests/data/singlefile/studenttab10k' as (name, age, gpa);
> B = stream A through CMD;
> store B into 'B1';
> C = stream B through CMD1;
> D = JOIN B by name, C by name;
> store D into 'D1';
> If I remove the intermediate store, the script works fine. Also if I replace
> streaming commands with other operators such as filter and foreach, it works
> even with the intermediate store.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.