[ https://issues.apache.org/jira/browse/PIG-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15904431#comment-15904431 ]
liyunzhang_intel edited comment on PIG-5167 at 3/10/17 8:17 AM: ---------------------------------------------------------------- [~knoguchi]: thanks your suggestion! [~nkollar]: try to add sortArgs to original script, i have not tested it and hope it works {code} { 'num' => 4, 'pig' =>q\a = load ':INPATH:/singlefile/studentnulltab10k'; b = distinct a; c = limit b 100; store c into ':OUTPATH:';\, + 'sortArgs' => ['-t', ' ', '-k', '1,2'], }, {code} was (Author: kellyzly): [~knoguchi]: thanks your suggestion! [~szita]: try to add sortArgs to original script, i have not tested it and hope it works {code} { 'num' => 4, 'pig' =>q\a = load ':INPATH:/singlefile/studentnulltab10k'; b = distinct a; c = limit b 100; store c into ':OUTPATH:';\, + 'sortArgs' => ['-t', ' ', '-k', '1,2'], }, {code} > Limit_4 is failing with spark exec type > --------------------------------------- > > Key: PIG-5167 > URL: https://issues.apache.org/jira/browse/PIG-5167 > Project: Pig > Issue Type: Sub-task > Components: spark > Reporter: Nandor Kollar > Assignee: Nandor Kollar > Fix For: spark-branch > > Attachments: PIG-5167.patch > > > results are different: > {code} > diff <(head -n 5 Limit_4.out/out_sorted) <(head -n 5 > Limit_4_benchmark.out/out_sorted) > 1,5c1,5 > < 50 3.00 > < 74 2.22 > < alice carson 66 2.42 > < alice quirinius 71 0.03 > < alice van buren 28 2.50 > --- > > bob allen 0.28 > > bob allen 22 0.92 > > bob allen 25 2.54 > > bob allen 26 2.35 > > bob allen 27 2.17 > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346)