GitHub user holdenk opened a pull request: https://github.com/apache/spark/pull/19876
[WIP][ML][SPARK-11171] spark 11237 Add PMML export to Spark ML pipelines ## What changes were proposed in this pull request? Adds PMML export support to Spark ML pipelines in the style of Spark's DataSource API to allow library authors to add their own model export formats. This is a WIP to see if this is the design we want to go with. ## How was this patch tested? Basic unit test. You can merge this pull request into a Git repository by running: $ git pull https://github.com/holdenk/spark SPARK-11171-SPARK-11237-Add-PMML-export-for-ML-KMeans-r2 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/19876.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #19876 ---- commit 43ae30f08aed921178da07a5e982297b272c7c8f Author: Holden Karau <holdenka...@google.com> Date: 2017-11-24T14:00:16Z Initial attempt at allowing Spark ML writers to be slightly more pluggable commit 9fec08fbd2dd1c980d5862f0b4521213e1e9349c Author: Holden Karau <holdenka...@google.com> Date: 2017-11-25T12:55:19Z The LinearRegression suite passes commit 0075bf4776ecffa7fcb24a6f74c0e96161d6221c Author: Holden Karau <holdenka...@google.com> Date: 2017-11-25T13:00:18Z Add missing META-INFO for MLFormatRegister commit c68880d6d982c56934f4b583263ed5cd4e8329d6 Author: Holden Karau <holdenka...@google.com> Date: 2017-11-25T16:19:35Z Add a (untested) PMMLLinearRegressionModelWriter commit c2108df2b499bd45dff0e8add789f01d8c3c2c48 Author: Holden Karau <holdenka...@google.com> Date: 2017-12-04T10:00:56Z Basic PMML export test commit de8619098eeb01ff86b54753f27c29729935bb94 Author: Holden Karau <holdenka...@google.com> Date: 2017-12-04T11:27:03Z Add PMML testing utils for Spark ML that were accidently left out ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org