[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629022 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629030 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629019 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -38,13 +41,45 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629028 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629123 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629117 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629129 --- Diff: core/src/test/java/org/apache/spark/JavaAPISuite.java --- @@ -140,10 +140,9 @@ public void intersection() { public void sample() {

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629120 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629128 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629116 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19629166 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19638435 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19638500 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -52,57 +87,252 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61191942 @mengxr, I changed `fractionEpsilon` to `rngEpsilon`, which is more suggestive of its purpose. I also updated its documentation, which I think is also now more

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61192204 [Test build #22576 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22576/consoleFull) for PR 2455 at commit

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61199382 [Test build #22576 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22576/consoleFull) for PR 2455 at commit

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61220683 LGTM. Merged into master. Thanks for implementing gap sampling! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-30 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/2455 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-29 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-60978274 Was about to push, but looks like commit for SPARK-4022 broke my updates so I'm going to have to make more edits to rebsae --- If your project is set up for it,

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-29 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61005185 [Test build #22475 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22475/consoleFull) for PR 2455 at commit

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-29 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61014883 [Test build #22475 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22475/consoleFull) for PR 2455 at commit

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-29 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61014891 Test PASSed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-29 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-61031984 @mengxr latest updates are rebased and passing Jenkins --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-60789726 @erikerlandson The feature freeze deadline for v1.2 is this Sat. Just want to check with you and see whether you are going to update the PR this week. --- If your

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-28 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-60793304 @mengxr, coincidentally I'm working through the PR comments today, I plan to have an update pushed this evening --- If your project is set up for it, you can

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-28 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-60793728 @erikerlandson Great! Thanks for the heads up. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-28 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19509062 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-28 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19509239 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-28 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19509394 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-17 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r19047174 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-13 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18790724 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-13 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18791243 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536661 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536639 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536657 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536653 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536673 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536693 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536679 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536671 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536659 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536697 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536688 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536690 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536685 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18536682 --- Diff: core/src/test/scala/org/apache/spark/util/random/RandomSamplerSuite.scala --- @@ -18,96 +18,547 @@ package org.apache.spark.util.random

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-07 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18537414 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57865917 Jenkins, add to whitelist. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57865927 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57866985 Test FAILed. Refer to this link for build results (access rights to CI server needed):

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423427 --- Diff: core/src/main/scala/org/apache/spark/rdd/RDD.scala --- @@ -375,7 +376,9 @@ abstract class RDD[T: ClassTag]( val sum = weights.sum

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423440 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423426 --- Diff: core/src/main/scala/org/apache/spark/rdd/RDD.scala --- @@ -43,7 +43,8 @@ import org.apache.spark.partial.PartialResult import

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423449 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423438 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423429 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423444 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423475 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423477 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423453 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423474 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423459 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423463 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423478 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423454 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423479 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423461 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423433 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423437 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423464 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423484 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423470 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423457 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423473 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423487 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423485 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423495 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423492 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423489 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423468 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423448 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423443 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -39,13 +42,46 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423499 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423504 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423498 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423491 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423493 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18423500 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +89,238 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57874204 @erikerlandson I didn't check the test code. I will try to find another time to make a pass on the test. The implementation looks good to me except minor inline comments.

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-03 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57883154 @mengxr I'll be occupied next week but I'll try to respond asap to your feedback the week after --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-10-02 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57628456 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-30 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18202539 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -43,9 +46,34 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-30 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18202776 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +81,237 @@ trait RandomSampler[T, U] extends Pseudorandom with

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-30 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18241039 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +81,237 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-30 Thread erikerlandson
Github user erikerlandson commented on the pull request: https://github.com/apache/spark/pull/2455#issuecomment-57395923 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-29 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18188816 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +81,237 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-29 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18188928 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +81,237 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-29 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18189003 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -43,9 +46,34 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-29 Thread erikerlandson
Github user erikerlandson commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18189085 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +81,237 @@ trait RandomSampler[T, U] extends Pseudorandom

[GitHub] spark pull request: [SPARK-3250] Implement Gap Sampling optimizati...

2014-09-28 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/2455#discussion_r18135108 --- Diff: core/src/main/scala/org/apache/spark/util/random/RandomSampler.scala --- @@ -53,56 +81,237 @@ trait RandomSampler[T, U] extends Pseudorandom with

  1   2   >