[ 
https://issues.apache.org/jira/browse/SPARK-14816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15381438#comment-15381438
 ] 

Joseph K. Bradley commented on SPARK-14816:
-------------------------------------------

I'd say the updates for [http://spark.apache.org/mllib/] are:
* Ease of Use: "MLlib fits into Spark's APIs and interoperates with NumPy in 
Python (starting in Spark 0.9)." --> change to "MLlib fits into Spark's APIs 
and interoperates with NumPy in Python (as of Spark 0.9) and R (as of Spark 
1.5)."
* Algorithms list: Change to a list of categories, not specific algs
* Calling MLlib in Python code snippet: Change to:
{code}
data = spark.read.format("libsvm").load("hdfs://...")
model = new KMeans().setK(10).fit(data)
{code}

If this sounds good, I can make the change.

[~shivaram] SparkR does not really have a website.  Should we add one?

> Update MLlib, GraphX, SparkR websites for 2.0
> ---------------------------------------------
>
>                 Key: SPARK-14816
>                 URL: https://issues.apache.org/jira/browse/SPARK-14816
>             Project: Spark
>          Issue Type: Sub-task
>          Components: Documentation, GraphX, ML, MLlib, SparkR
>            Reporter: Joseph K. Bradley
>            Priority: Critical
>
> Update the sub-projects' websites to include new features in this release.
> For MLlib, make it clear that the DataFrame-based API is the primary one now.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to