GitHub user ajaysaini725 opened a pull request: https://github.com/apache/spark/pull/18888
[Spark-17025][ML][Python] Persistence for Custom Python-only Pipelines ## What changes were proposed in this pull request? Implemented a Python-only persistence framework for pipelines containing stages that cannot be saved using Java. ## How was this patch tested? Created a custom Python-only UnaryTransformer, included it in a Pipeline, and saved/loaded the pipeline. The loaded pipeline was compared against the original using _compare_pipelines() in tests.py. You can merge this pull request into a Git repository by running: $ git pull https://github.com/ajaysaini725/spark PythonPipelines Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/18888.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #18888 ---- commit 840a19324f54a1077d59a7cd6f8e911f57505370 Author: Ajay Saini <ajays...@gmail.com> Date: 2017-08-08T23:24:23Z Pipeline persistence commit with tests. commit 85a98d6fa45e7f6bb32af9b949913e70b32f8ca9 Author: Ajay Saini <ajays...@gmail.com> Date: 2017-08-08T23:30:33Z Fixed import ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org