Julian King created SPARK-23730: ----------------------------------- Summary: Save and expose "in bag" tracking for random forest model Key: SPARK-23730 URL: https://issues.apache.org/jira/browse/SPARK-23730 Project: Spark Issue Type: Improvement Components: ML Affects Versions: 2.3.0 Reporter: Julian King
In a random forest model, it is often useful to be able to keep track of which samples ended up in each of the bootstrap replications (and how many times this happened). For instance, in the R randomForest package this is accomplished through the option keep.inbag=TRUE Similar functionality in Spark ML's random forest would be helpful -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org