[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread tedyu
Github user tedyu commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60685910 --- Diff: core/src/test/scala/org/apache/spark/util/collection/unsafe/sort/RadixSortSuite.scala --- @@ -0,0 +1,264 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12490 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-213162459 Merging this into master, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-213134892 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-213134888 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-213133847 **[Test build #56581 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56581/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-213098364 **[Test build #56581 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56581/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60648020 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,32 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60647905 --- Diff: core/src/main/java/org/apache/spark/util/collection/unsafe/sort/RadixSort.java --- @@ -0,0 +1,253 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60647739 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Sort.scala --- @@ -139,11 +148,15 @@ case class Sort( val dataSize = metricTerm(ctx,

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60647734 --- Diff: core/src/test/scala/org/apache/spark/util/collection/unsafe/sort/PrefixComparatorsSuite.scala --- @@ -110,4 +112,12 @@ class PrefixComparatorsSuite

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212770972 LGTM overall, just some minor comments. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-21 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60532479 --- Diff: core/src/main/java/org/apache/spark/util/collection/unsafe/sort/RadixSort.java --- @@ -0,0 +1,253 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60529161 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,32 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60529190 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Sort.scala --- @@ -139,11 +148,15 @@ case class Sort( val dataSize =

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60528873 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,32 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60528592 --- Diff: core/src/test/scala/org/apache/spark/util/collection/unsafe/sort/PrefixComparatorsSuite.scala --- @@ -110,4 +112,12 @@ class

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60526773 --- Diff: core/src/main/java/org/apache/spark/util/collection/unsafe/sort/PrefixComparators.java --- @@ -28,88 +28,84 @@ public class PrefixComparators

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60526550 --- Diff: core/src/main/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorter.java --- @@ -87,18 +102,18 @@ public void expandPointerArray(LongArray

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread ericl
Github user ericl commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212691949 ptal --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212675549 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212675551 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212675352 **[Test build #56429 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56429/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212644491 **[Test build #56429 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56429/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212596602 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212596600 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212595870 **[Test build #56389 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56389/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212548929 **[Test build #56389 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56389/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212316191 **[Test build #56338 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56338/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212316213 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212316217 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-20 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212312260 **[Test build #56338 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56338/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212245315 Since we are using more memory, you should update the test UnsafeShuffleWriterSuite --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212245007 @ericl The precision is greater than 18, have you forgot to check the precision somewhere? --- If your project is set up for it, you can reply to this email and have

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212209007 It seems this is the failing test. @davies any idea why the decimal handling is wrong here? If I am understanding correctly when `dt.precision - dt.scale <=

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212203922 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212203921 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212203825 **[Test build #56296 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56296/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212178743 **[Test build #56296 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56296/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212177739 Done --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212169575 That makes sense. Can you put it in pr description as a todo for future pr? --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212169127 I was going to punt null handling for longs into another PR, unless you think it should be in this one? The only case that doesn't work is for LongType. --- If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212166485 Eric can you put the way you handle null in the pr description, and as some classdoc comment somewhere? --- If your project is set up for it, you can reply to this

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212163368 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212163374 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212163351 **[Test build #56288 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56288/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212162690 **[Test build #56288 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56288/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212161545 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212161537 **[Test build #56284 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56284/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212161543 Build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327136 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,29 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327140 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,29 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212161313 **[Test build #56284 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56284/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327131 --- Diff: sql/core/src/main/java/org/apache/spark/sql/execution/UnsafeKVExternalSorter.java --- @@ -85,13 +85,15 @@ public UnsafeKVExternalSorter(

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327152 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -88,6 +88,12 @@ object SQLConf { .booleanConf

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327119 --- Diff: core/src/test/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorterSuite.java --- @@ -70,14 +73,15 @@ public void testBasicSorting() throws

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327142 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,29 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327092 --- Diff: core/src/main/java/org/apache/spark/util/collection/unsafe/sort/RadixSort.java --- @@ -0,0 +1,253 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327080 --- Diff: core/src/main/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorter.java --- @@ -47,18 +48,25 @@ public int compare(PackedRecordPointer left,

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60327083 --- Diff: core/src/main/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorter.java --- @@ -148,7 +157,12 @@ public void loadNext() { * Return an

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-212036416 @ericl can you decouple the benchmark change from this pr? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60275221 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,29 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60268804 --- Diff: sql/core/src/main/java/org/apache/spark/sql/execution/UnsafeKVExternalSorter.java --- @@ -85,13 +85,15 @@ public UnsafeKVExternalSorter(

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60268406 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,29 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60268294 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/SortPrefixUtils.scala --- @@ -66,6 +66,29 @@ object SortPrefixUtils { }

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60267980 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -88,6 +88,12 @@ object SQLConf { .booleanConf

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60265931 --- Diff: core/src/main/java/org/apache/spark/util/collection/unsafe/sort/RadixSort.java --- @@ -0,0 +1,253 @@ +/* + * Licensed to the Apache

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60265463 --- Diff: core/src/main/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorter.java --- @@ -148,7 +157,12 @@ public void loadNext() { * Return

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60264951 --- Diff: core/src/main/java/org/apache/spark/util/collection/unsafe/sort/PrefixComparators.java --- @@ -28,88 +28,84 @@ public class

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60264341 --- Diff: core/src/main/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorter.java --- @@ -47,18 +48,25 @@ public int compare(PackedRecordPointer

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211785722 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211785718 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211785553 **[Test build #56201 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56201/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211784566 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211784569 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211784426 **[Test build #56200 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56200/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/12490#discussion_r60183184 --- Diff: core/src/test/java/org/apache/spark/shuffle/sort/ShuffleInMemorySorterSuite.java --- @@ -70,14 +73,15 @@ public void testBasicSorting() throws

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211757342 **[Test build #56201 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56201/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211756345 **[Test build #56200 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56200/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211755392 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211755389 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211755381 **[Test build #56199 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56199/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-19 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211754828 **[Test build #56199 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56199/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211715196 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211715186 **[Test build #56187 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56187/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211715199 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-18 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12490#issuecomment-211714809 **[Test build #56187 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56187/consoleFull)** for PR 12490 at commit

[GitHub] spark pull request: [SPARK-14724] Use radix sort for shuffles and ...

2016-04-18 Thread ericl
GitHub user ericl opened a pull request: https://github.com/apache/spark/pull/12490 [SPARK-14724] Use radix sort for shuffles and sort operator when possible ## What changes were proposed in this pull request? Spark currently uses TimSort for all in-memory sorts, including