[ 
https://issues.apache.org/jira/browse/SPARK-20199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15966926#comment-15966926
 ] 

Yan Facai (颜发才) commented on SPARK-20199:
-----------------------------------------

It's not hard, and I can work on it.

However, there are two possible solutions:

1. add `setFeatureSubsetStrategy` method to DecisionTree. So for GBT, it create 
an DecesionTree by using the method. 
    code like  `val dt = new 
DecisionTreeRegressor().setFeatureSubsetStrategy(xxx)`.

2. add `featureSubsetStrategy` param for `train` method of DecesionTree. 
minimum changes.

which one is better? I prefer to the first.

> GradientBoostedTreesModel doesn't have  Column Sampling Rate Paramenter
> -----------------------------------------------------------------------
>
>                 Key: SPARK-20199
>                 URL: https://issues.apache.org/jira/browse/SPARK-20199
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML, MLlib
>    Affects Versions: 2.1.0
>            Reporter: pralabhkumar
>
> Spark GradientBoostedTreesModel doesn't have Column  sampling rate parameter 
> . This parameter is available in H2O and XGBoost. 
> Sample from H2O.ai 
> gbmParams._col_sample_rate
> Please provide the parameter . 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to