[ https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15381438#comment-15381438 ]
Joseph K. Bradley commented on SPARK-14816: ------------------------------------------- I'd say the updates for [http://spark.apache.org/mllib/] are: * Ease of Use: "MLlib fits into Spark's APIs and interoperates with NumPy in Python (starting in Spark 0.9)." --> change to "MLlib fits into Spark's APIs and interoperates with NumPy in Python (as of Spark 0.9) and R (as of Spark 1.5)." * Algorithms list: Change to a list of categories, not specific algs * Calling MLlib in Python code snippet: Change to: {code} data = spark.read.format("libsvm").load("hdfs://...") model = new KMeans().setK(10).fit(data) {code} If this sounds good, I can make the change. [~shivaram] SparkR does not really have a website. Should we add one? > Update MLlib, GraphX, SparkR websites for 2.0 > --------------------------------------------- > > Key: SPARK-14816 > URL: https://issues.apache.org/jira/browse/SPARK-14816 > Project: Spark > Issue Type: Sub-task > Components: Documentation, GraphX, ML, MLlib, SparkR > Reporter: Joseph K. Bradley > Priority: Critical > > Update the sub-projects' websites to include new features in this release. > For MLlib, make it clear that the DataFrame-based API is the primary one now. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org