GitHub user rdblue opened a pull request:

    https://github.com/apache/spark/pull/21305

    [SPARK-24251][SQL] Add AppendData logical plan.

    ## What changes were proposed in this pull request?
    
    This adds a new logical plan, AppendData, that was proposed in SPARK-23521: 
Standardize SQL logical plans.
    
    * DataFrameWriter uses the new AppendData plan for DataSourceV2 appends
    * AppendData is resolved if its output columns match the incoming data frame
    * A new analyzer rule, ResolveOutputColumns, validates data before it is 
appended. This rule will add safe casts, rename columns, and checks nullability
    
    ## How was this patch tested?
    
    Existing tests for v2 appends. Will add AppendData tests to validate 
logical plan analysis.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/rdblue/spark SPARK-24251-add-append-data

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/21305.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #21305
    
----
commit a3bf5300e4a1c9c05a50273f530eaade00d57659
Author: Ryan Blue <blue@...>
Date:   2018-05-07T15:54:37Z

    SPARK-24251: Add AppendData logical plan.
    
    This adds a new logical plan, AppendData, that was proposed in
    SPARK-23521. This also adds an analyzer rule to validate data written
    with AppendData against the target table. DataFrameWriter is also
    updated so that v2 writes use the new AppendData logical plan.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to