[ https://issues.apache.org/jira/browse/PIO-116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144973#comment-16144973 ]
ASF GitHub Bot commented on PIO-116: ------------------------------------ GitHub user marevol opened a pull request: https://github.com/apache/incubator-predictionio/pull/427 [PIO-116] PySpark Support This PR provides PySpark support with minimum PIO changes. 1. Support pyspark on pio-shell 2. Add python files to use pyspark 3. Add --main-py-file option to "pio train" to submit .py file to spark Note that this provides only fixes for Spark 2.x. (because this fixes expect to use SparkML) Sample project is: https://github.com/jpioug/predictionio-template-iris (For prediction API, Scala code is used.) You can merge this pull request into a Git repository by running: $ git pull https://github.com/marevol/incubator-predictionio pyspark Alternatively you can review and apply these changes as the patch at: https://github.com/apache/incubator-predictionio/pull/427.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #427 ---- commit ee28fcf139c6ac8184d990cbdc4d43b00ff483fd Author: Shinsuke Sugaya <shins...@apache.org> Date: 2017-08-22T09:47:05Z add pyspark sub-command commit 97f0343691ff1ca98f1ce65fc8ad3e25df6cd15b Author: Shinsuke Sugaya <shins...@apache.org> Date: 2017-08-27T14:16:18Z replace with values.toString commit 2970397a6024f17872011979edcae1712f8a4362 Author: Shinsuke Sugaya <shins...@apache.org> Date: 2017-08-28T10:04:24Z add --main-py-file option to train ---- > PySpark Support > --------------- > > Key: PIO-116 > URL: https://issues.apache.org/jira/browse/PIO-116 > Project: PredictionIO > Issue Type: New Feature > Components: Core > Reporter: Shinsuke Sugaya > Assignee: Shinsuke Sugaya > > This provides PySpark support with minimum PIO changes. > 1. Support pyspark on pio-shell > 2. Add python files to use pyspark > 3. Add --main-py-file option to "pio train" to submit .py file to spark > Note that this provides only fixes for Spark 2.x. > (because this fixes expect to use SparkML) > Sample project is: > https://github.com/jpioug/predictionio-template-iris > (For prediction API, Scala code is used.) -- This message was sent by Atlassian JIRA (v6.4.14#64029)