[ https://issues.apache.org/jira/browse/SPARK-23528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16380426#comment-16380426 ]
Marco Gaido commented on SPARK-23528: ------------------------------------- The log likelihood is already available in the summary (eg. {{model.summary.logLikelihood}}). I will submit soon a PR adding the number of iterations. Thanks for reporting this. > Expose vital statistics of GaussianMixtureModel > ----------------------------------------------- > > Key: SPARK-23528 > URL: https://issues.apache.org/jira/browse/SPARK-23528 > Project: Spark > Issue Type: Improvement > Components: ML > Affects Versions: 2.2.1 > Reporter: Erich Schubert > Priority: Minor > > Spark ML should expose vital statistics of the GMM model: > * *Number of iterations* (actual, not max) until the tolerance threshold was > hit: we can set a maximum, but how do we know the limit was large enough, and > how many iterations it really took? > * Final *log likelihood* of the model: if we run multiple times with > different starting conditions, how do we know which run converged to the > better fit? -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org