GitHub user wangyum opened a pull request: https://github.com/apache/spark/pull/22530
[SPARK-24869][SQL] Fix SaveIntoDataSourceCommand's input Dataset does not use Cached Data ## What changes were proposed in this pull request? This PR fix 2 things: 1. Fix `SaveIntoDataSourceCommand`'s input Dataset does not use Cached Data. 2. Fix `SaveIntoDataSourceCommand`s web ui when writing data. ## How was this patch tested? unit tests and manual tests Manual test web UI: ```scala val udf1 = udf({ (x: Int, y: Int) => x + y }) val df = spark.range(0, 3).toDF("a").withColumn("b", udf1(col("a"), lit(10))) df.write.mode(SaveMode.Overwrite).jdbc(jdbcUrl, jdbcTable, new Properties) ``` Before this PR: ![image](https://user-images.githubusercontent.com/5399861/45927685-539bf080-bf69-11e8-8700-f32430d9aeae.png) After this PR: ![image](https://user-images.githubusercontent.com/5399861/45927682-4b43b580-bf69-11e8-9161-64399542cf77.png) You can merge this pull request into a Git repository by running: $ git pull https://github.com/wangyum/spark SPARK-24869 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/22530.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #22530 ---- commit 44f6a6ce5d4bdd4ac02d84526b5c4cdf85af2886 Author: Yuming Wang <yumwang@...> Date: 2018-09-23T11:27:08Z SPARK-24869 SaveIntoDataSourceCommand's input Dataset does not use Cached Data commit 9b1cc1d826cb89f0ed6021ae6c8cddc978c0173e Author: Yuming Wang <yumwang@...> Date: 2018-09-23T11:34:58Z Fix UI ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org