GitHub user yogeshg opened a pull request: https://github.com/apache/spark/pull/20970
[SPARK-23562][ML] Forward RFormula handleInvalid Param to VectorAssembler to handle invalid values in non-string columns ## What changes were proposed in this pull request? `handleInvalid` Param was forwarded to the VectorAssembler used by RFormula. ## How was this patch tested? added a test and ran all tests for RFormula and VectorAssembler You can merge this pull request into a Git repository by running: $ git pull https://github.com/yogeshg/spark spark_23562 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/20970.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #20970 ---- commit 555be7900312cc594dae6f86140bd0f2fbf7cdd9 Author: Yogesh Garg <yogesh(dot)garg()databricks(dot)com> Date: 2018-03-20T01:27:13Z add test case commit b53cbfaa2323da5c46059a6e3439ded84cb23eeb Author: Yogesh Garg <yogesh(dot)garg()databricks(dot)com> Date: 2018-04-03T18:07:15Z pass handleInvalid param to VectorAssembler ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org