[ https://issues.apache.org/jira/browse/SPARK-6004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14337604#comment-14337604 ]
Liang-Chi Hsieh commented on SPARK-6004: ---------------------------------------- Why do we want to stop early? I think we want to get the best model not the model at the training moment first showing decreasing performance on validation dataset? If you need to tune the model, in practice you would tune the iteration number, not the validationTol. But it is still okay to have an option for it, if not a default behavior. > Pick the best model when training GradientBoostedTrees with validation > ---------------------------------------------------------------------- > > Key: SPARK-6004 > URL: https://issues.apache.org/jira/browse/SPARK-6004 > Project: Spark > Issue Type: Improvement > Components: MLlib > Reporter: Liang-Chi Hsieh > Priority: Minor > > Since the validation error does not change monotonically, in practice, it > should be proper to pick the best model when training GradientBoostedTrees > with validation instead of stopping it early. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org