[ 
https://issues.apache.org/jira/browse/SPARK-6004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14337604#comment-14337604
 ] 

Liang-Chi Hsieh commented on SPARK-6004:
----------------------------------------

Why do we want to stop early? I think we want to get the best model not the 
model at the training moment first showing decreasing performance on validation 
dataset? If you need to tune the model, in practice you would tune the 
iteration number, not the validationTol. But it is still okay to have an option 
for it, if not a default behavior.


> Pick the best model when training GradientBoostedTrees with validation
> ----------------------------------------------------------------------
>
>                 Key: SPARK-6004
>                 URL: https://issues.apache.org/jira/browse/SPARK-6004
>             Project: Spark
>          Issue Type: Improvement
>          Components: MLlib
>            Reporter: Liang-Chi Hsieh
>            Priority: Minor
>
> Since the validation error does not change monotonically, in practice, it 
> should be proper to pick the best model when training GradientBoostedTrees 
> with validation instead of stopping it early.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to