[ 
https://issues.apache.org/jira/browse/PIG-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15940461#comment-15940461
 ] 

liyunzhang_intel commented on PIG-5158:
---------------------------------------

[~nkollar]:LGTM
one question the difference between Limit_5 and Limit_12
Limit_5
{code}
a = load ':INPATH:/singlefile/studenttab10k';
b = load ':INPATH:/singlefile/votertab10k';
a1 = foreach a generate $0, $1;
b1 = foreach b generate $0, $1;
c = union a1, b1;
d = limit c 100;
store d into ':OUTPATH:';\,
{code}

Limit_12
{code}
a = load ':INPATH:/singlefile/studenttab10k';
b = load ':INPATH:/singlefile/studenttab10k';
a1 = foreach a generate $0, $1;
b1 = foreach b generate $0, $1;
c = union a1, b1;
d = limit c 100;
store d into ':OUTPATH:';\,
{code}

Limit_12, we load same file for operator a and b so we can avoid the disorder 
problem of union output in spark mode?

> Several e2e tests are marked to run only in Tez or MR mode only
> ---------------------------------------------------------------
>
>                 Key: PIG-5158
>                 URL: https://issues.apache.org/jira/browse/PIG-5158
>             Project: Pig
>          Issue Type: Task
>          Components: spark
>            Reporter: Nandor Kollar
>            Assignee: Nandor Kollar
>             Fix For: spark-branch
>
>         Attachments: PIG-5158_2.patch, PIG-5158_3.patch, PIG-5158.patch
>
>
> While executing the e2e tests in spark mode, I noticed that several tests are 
> marked with 'execonly' => 'mapred,local' or 'execonly' => 'mapred,tez' Revise 
> these tests, add spark for those, where it makes sense to test on Spark.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to