Github user holdenk commented on the pull request:

    https://github.com/apache/incubator-spark/pull/572#issuecomment-34671873
  
    Sure, I'll take a look at that tonight. From the earlier pull request that
    was abandoned someone had asked that its PartionedRDD (which only did it
    for k=2) be in the core rather than mllib.
    
    
    On Mon, Feb 10, 2014 at 11:04 AM, Xiangrui Meng 
<notificati...@github.com>wrote:
    
    > @holdenk <https://github.com/holdenk> , the PartitionwiseSampledRDD was
    > designed with this use case in mind. Both the folded RDD and its 
complement
    > can be represented by PartitionwiseSampledRDD with BernoulliSamplers. Do
    > you mind modifying your code to use it? Also, cross-validation is a 
machine
    > learning specific operation. spark.rdd.RDD may not be a good place for it.
    >
    > --
    > Reply to this email directly or view it on 
GitHub<https://github.com/apache/incubator-spark/pull/572#issuecomment-34668194>
    > .
    >
    
    
    
    -- 
    Cell : 425-233-8271

Reply via email to