GitHub user mmenestret opened a pull request:

    https://github.com/apache/spark/pull/7997

    SPARK-9690 Adding the possibility to set the seed of the rand in the …

    …CrossValidator fold
    
    The fold in the ML CrossValidator depends on a rand whose seed is set to 0 
and it leads the sql.functions rand to call sc._jvm.functions.rand() with no 
seed.
    In order to be able to unit test a Cross Validation it would be a good idea 
to be able to set this seed so the output of the cross validation (with a 
featureSubsetStrategy set to "all") would always be the same.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mmenestret/spark 
SPARK-9690-CrossValidator-Random-Seed

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/7997.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #7997
    
----
commit 480a5d081fe34bea42dc3c474a57a811947f483f
Author: Martin MENESTRET <mmenest...@ippon.fr>
Date:   2015-08-06T16:28:48Z

    SPARK-9690 Adding the possibility to set the seed of the rand in the 
CrossValidator fold

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to