[ https://issues.apache.org/jira/browse/SPARK-3251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14162410#comment-14162410 ]
Apache Spark commented on SPARK-3251: ------------------------------------- User 'BigCrunsh' has created a pull request for this issue: https://github.com/apache/spark/pull/2137 > Clarify learning interfaces > ---------------------------- > > Key: SPARK-3251 > URL: https://issues.apache.org/jira/browse/SPARK-3251 > Project: Spark > Issue Type: Improvement > Components: MLlib > Affects Versions: 1.1.0, 1.1.1 > Reporter: Christoph Sawade > > *Make threshold mandatory* > Currently, the output of predict for an example is either the score > or the class. This side-effect is caused by clearThreshold. To > clarify that behaviour three different types of predict (predictScore, > predictClass, predictProbabilty) were introduced; the threshold is not > longer optional. > *Clarify classification interfaces* > Currently, some functionality is spreaded over multiple models. > In order to clarify the structure and simplify the implementation of > more complex models (like multinomial logistic regression), two new > classes are introduced: > - BinaryClassificationModel: for all models that derives a binary > classification from a single weight vector. Comprises the tresholding > functionality to derive a prediction from a score. It basically captures > SVMModel and LogisticRegressionModel. > - ProbabilitistClassificaitonModel: This trait defines the interface for > models that return a calibrated confidence score (aka probability). > *Misc* > - some renaming > - add test for probabilistic output -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org