[GitHub] spark pull request #17819: [SPARK-20542][ML][SQL] Add an API to Bucketizer t...

viirya Wed, 11 Oct 2017 00:02:41 -0700

Github user viirya commented on a diff in the pull request:

    https://github.com/apache/spark/pull/17819#discussion_r143926137
  
    --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/Bucketizer.scala 
---
    @@ -24,20 +24,23 @@ import org.apache.spark.annotation.Since
     import org.apache.spark.ml.Model
     import org.apache.spark.ml.attribute.NominalAttribute
     import org.apache.spark.ml.param._
    -import org.apache.spark.ml.param.shared.{HasHandleInvalid, HasInputCol, 
HasOutputCol}
    +import org.apache.spark.ml.param.shared.{HasHandleInvalid, HasInputCol, 
HasInputCols, HasOutputCol}
     import org.apache.spark.ml.util._
     import org.apache.spark.sql._
     import org.apache.spark.sql.expressions.UserDefinedFunction
     import org.apache.spark.sql.functions._
     import org.apache.spark.sql.types.{DoubleType, StructField, StructType}
     
     /**
    - * `Bucketizer` maps a column of continuous features to a column of 
feature buckets.
    + * `Bucketizer` maps a column of continuous features to a column of 
feature buckets. Since 2.3.0,
    + * `Bucketizer` can also map multiple columns at once. Whether it goes to 
map a column or multiple
    --- End diff --
    
    Ok. Looks better.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #17819: [SPARK-20542][ML][SQL] Add an API to Bucketizer t...

Reply via email to