[ https://issues.apache.org/jira/browse/SPARK-6192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14707233#comment-14707233 ]
Joseph K. Bradley commented on SPARK-6192: ------------------------------------------ I'll mark it resolved. Thanks again for all of your help this summer---we really appreciate it. Good luck with everything, and I hope you're able to keep contributing. > Enhance MLlib's Python API (GSoC 2015) > -------------------------------------- > > Key: SPARK-6192 > URL: https://issues.apache.org/jira/browse/SPARK-6192 > Project: Spark > Issue Type: Umbrella > Components: ML, MLlib, PySpark > Reporter: Xiangrui Meng > Assignee: Manoj Kumar > Labels: gsoc, gsoc2015, mentor > > This is an umbrella JIRA for [~MechCoder]'s GSoC 2015 project. The main theme > is to enhance MLlib's Python API, to make it on par with the Scala/Java API. > The main tasks are: > 1. For all models in MLlib, provide save/load method. This also > includes save/load in Scala. > 2. Python API for evaluation metrics. > 3. Python API for streaming ML algorithms. > 4. Python API for distributed linear algebra. > 5. Simplify MLLibPythonAPI using DataFrames. Currently, we use > customized serialization, making MLLibPythonAPI hard to maintain. It > would be nice to use the DataFrames for serialization. > I'll link the JIRAs for each of the tasks. > Note that this doesn't mean all these JIRAs are pre-assigned to [~MechCoder]. > The TODO list will be dynamic based on the backlog. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org