GitHub user wangyum opened a pull request:

    https://github.com/apache/spark/pull/22530

    [SPARK-24869][SQL] Fix SaveIntoDataSourceCommand's input Dataset does not 
use Cached Data

    ## What changes were proposed in this pull request?
    
    This PR fix 2 things:
    1. Fix `SaveIntoDataSourceCommand`'s input Dataset does not use Cached Data.
    2. Fix `SaveIntoDataSourceCommand`s web ui when writing data.
    
    ## How was this patch tested?
    unit tests and  manual tests
    
    Manual test web UI:
    ```scala
    val udf1 = udf({ (x: Int, y: Int) => x + y })
    val df = spark.range(0, 3).toDF("a").withColumn("b", udf1(col("a"), 
lit(10)))
    df.write.mode(SaveMode.Overwrite).jdbc(jdbcUrl, jdbcTable, new Properties)
    ```
    Before this PR:
    
![image](https://user-images.githubusercontent.com/5399861/45927685-539bf080-bf69-11e8-8700-f32430d9aeae.png)
    
    
    After this PR:
    
![image](https://user-images.githubusercontent.com/5399861/45927682-4b43b580-bf69-11e8-9161-64399542cf77.png)
    
    


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/wangyum/spark SPARK-24869

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/22530.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #22530
    
----
commit 44f6a6ce5d4bdd4ac02d84526b5c4cdf85af2886
Author: Yuming Wang <yumwang@...>
Date:   2018-09-23T11:27:08Z

    SPARK-24869 SaveIntoDataSourceCommand's input Dataset does not use Cached 
Data

commit 9b1cc1d826cb89f0ed6021ae6c8cddc978c0173e
Author: Yuming Wang <yumwang@...>
Date:   2018-09-23T11:34:58Z

    Fix UI

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to