Joseph K. Bradley created SPARK-18328: -----------------------------------------
Summary: CLONE - Additional fixes to Mllib 2.0 documentation Key: SPARK-18328 URL: https://issues.apache.org/jira/browse/SPARK-18328 Project: Spark Issue Type: Sub-task Components: Documentation, GraphX, ML, MLlib, SparkR Reporter: Timothy Hunter Assignee: Joseph K. Bradley Fix For: 2.0.0 While reviewing the documentation of MLlib, I found some additional issues. Important issues that affect the binary signatures: - GBTClassificationModel: all the setters should be overriden - LogisticRegressionModel: setThreshold(s) - RandomForestClassificationModel: all the setters should be overriden - org.apache.spark.ml.stat.distribution.MultivariateGaussian is exposed but most of the methods are private[ml] -> do we need to expose this class for now? - GeneralizedLinearRegressionModel: linkObj, familyObj, familyAndLink should not be exposed - sqlDataTypes: name does not follow conventions. Do we need to expose it? Issues that involve only documentation: - Evaluator: 1. inconsistent doc between evaluate and isLargerBetter - MinMaxScaler: math rendering - GeneralizedLinearRegressionSummary: aic doc is incorrect The reference documentation that was used was: http://people.apache.org/~pwendell/spark-releases/spark-2.0.0-rc2-docs/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org