[ https://issues.apache.org/jira/browse/PIG-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15904213#comment-15904213 ]
liyunzhang_intel commented on PIG-5167: --------------------------------------- [~rohini],[~xuefuz]: the order of the result of "distinct" in pig on spark is not the order of input, which is different from mr or tez. Currently, we want to add "order" after "distinct" to make the test pass, do you think it is suitable or can you provide a better approach? > Limit_4 is failing with spark exec type > --------------------------------------- > > Key: PIG-5167 > URL: https://issues.apache.org/jira/browse/PIG-5167 > Project: Pig > Issue Type: Sub-task > Components: spark > Reporter: Nandor Kollar > Assignee: Nandor Kollar > Fix For: spark-branch > > Attachments: PIG-5167.patch > > > results are different: > {code} > diff <(head -n 5 Limit_4.out/out_sorted) <(head -n 5 > Limit_4_benchmark.out/out_sorted) > 1,5c1,5 > < 50 3.00 > < 74 2.22 > < alice carson 66 2.42 > < alice quirinius 71 0.03 > < alice van buren 28 2.50 > --- > > bob allen 0.28 > > bob allen 22 0.92 > > bob allen 25 2.54 > > bob allen 26 2.35 > > bob allen 27 2.17 > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)