GitHub user mmenestret opened a pull request: https://github.com/apache/spark/pull/7997
SPARK-9690 Adding the possibility to set the seed of the rand in the ⦠â¦CrossValidator fold The fold in the ML CrossValidator depends on a rand whose seed is set to 0 and it leads the sql.functions rand to call sc._jvm.functions.rand() with no seed. In order to be able to unit test a Cross Validation it would be a good idea to be able to set this seed so the output of the cross validation (with a featureSubsetStrategy set to "all") would always be the same. You can merge this pull request into a Git repository by running: $ git pull https://github.com/mmenestret/spark SPARK-9690-CrossValidator-Random-Seed Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/7997.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #7997 ---- commit 480a5d081fe34bea42dc3c474a57a811947f483f Author: Martin MENESTRET <mmenest...@ippon.fr> Date: 2015-08-06T16:28:48Z SPARK-9690 Adding the possibility to set the seed of the rand in the CrossValidator fold ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org