[ https://issues.apache.org/jira/browse/SPARK-6884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14492892#comment-14492892 ]
Joseph K. Bradley commented on SPARK-6884: ------------------------------------------ Is this not a duplicate of [SPARK-3727]? Perhaps the best way to split up the work will be to make a subtask for trees, and a separate subtask for ensembles. I'll go ahead and do that. > random forest predict probabilities functionality (like in sklearn) > ------------------------------------------------------------------- > > Key: SPARK-6884 > URL: https://issues.apache.org/jira/browse/SPARK-6884 > Project: Spark > Issue Type: New Feature > Components: MLlib > Affects Versions: 1.3.0 > Environment: cross-platform > Reporter: Max Kaznady > Labels: prediction, probability, randomforest, tree > Original Estimate: 72h > Remaining Estimate: 72h > > Currently, there is no way to extract the class probabilities from the > RandomForest classifier. I implemented a probability predictor by counting > votes from individual trees and adding up their votes for "1" and then > dividing by the total number of votes. > I opened this ticked to keep track of changes. Will update once I push my > code to master. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org