Github user jkbradley commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21129#discussion_r189052646
  
    --- Diff: mllib/src/main/scala/org/apache/spark/ml/tree/treeParams.scala ---
    @@ -460,18 +461,37 @@ private[ml] trait RandomForestRegressorParams
      *
      * Note: Marked as private and DeveloperApi since this may be made public 
in the future.
      */
    -private[ml] trait GBTParams extends TreeEnsembleParams with HasMaxIter 
with HasStepSize {
    +private[ml] trait GBTParams extends TreeEnsembleParams with HasMaxIter 
with HasStepSize
    +  with HasValidationIndicatorCol {
     
    -  /* TODO: Add this doc when we add this param.  SPARK-7132
    -   * Threshold for stopping early when runWithValidation is used.
    +  /**
    +   * Threshold for stopping early when fit with validation is used.
        * If the error rate on the validation input changes by less than the 
validationTol,
    --- End diff --
    
    This doc is not quite accurate.  Can you please update it to:
    ```
    Threshold for stopping early when fit with validation is used.
    (This parameter is ignored when fit without validation is used.)
    The decision to stop early is decided based on this logic:
    If the current loss on the validation set is greater than 0.01, the diff
    of validation error is compared to relative tolerance which is
    validationTol * (current loss on the validation set).
    If the current loss on the validation set is less than or equal to 0.01,
    the diff of validation error is compared to absolute tolerance which is
    validationTol * 0.01.
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to