[ https://issues.apache.org/jira/browse/SPARK-20199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16025932#comment-16025932 ]
pralabhkumar commented on SPARK-20199: -------------------------------------- 1) Have Created pull request. Basically Moved 1) featureSubsetStrategy to TreeEnsembleParams instead of having it on RandomForestParams . So that it can be used for both Random Forest and GBT 2 ) Changed DecisionTreeRegressor private train method to pass featureSubsetStrategy 3) To Test changed GradientBoostedTreeClassifierExample with val gbt = new GBTClassifier() .setLabelCol("indexedLabel") .setFeaturesCol("indexedFeatures") .setMaxIter(10) .setFeatureSubsetStrategy("auto") > GradientBoostedTreesModel doesn't have featureSubsetStrategy parameter > ----------------------------------------------------------------------- > > Key: SPARK-20199 > URL: https://issues.apache.org/jira/browse/SPARK-20199 > Project: Spark > Issue Type: Improvement > Components: ML, MLlib > Affects Versions: 2.1.0 > Reporter: pralabhkumar > > Spark GradientBoostedTreesModel doesn't have Column sampling rate parameter > . This parameter is available in H2O and XGBoost. > Sample from H2O.ai > gbmParams._col_sample_rate > Please provide the parameter . -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org