[ 
https://issues.apache.org/jira/browse/PIG-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15904213#comment-15904213
 ] 

liyunzhang_intel commented on PIG-5167:
---------------------------------------

[~rohini],[~xuefuz]:  the order of the result of "distinct" in pig on spark is 
not the order of input, which is different from mr or tez.  Currently, we want 
to add "order" after "distinct" to make the test pass, do you think it is 
suitable or can you provide a better approach?

> Limit_4 is failing with spark exec type
> ---------------------------------------
>
>                 Key: PIG-5167
>                 URL: https://issues.apache.org/jira/browse/PIG-5167
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>            Reporter: Nandor Kollar
>            Assignee: Nandor Kollar
>             Fix For: spark-branch
>
>         Attachments: PIG-5167.patch
>
>
> results are different:
> {code}
> diff <(head -n 5 Limit_4.out/out_sorted) <(head -n 5 
> Limit_4_benchmark.out/out_sorted)
> 1,5c1,5
> <     50      3.00
> <     74      2.22
> < alice carson        66      2.42
> < alice quirinius     71      0.03
> < alice van buren     28      2.50
> ---
> > bob allen           0.28
> > bob allen   22      0.92
> > bob allen   25      2.54
> > bob allen   26      2.35
> > bob allen   27      2.17
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to