[
https://issues.apache.org/jira/browse/SPARK-54172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18037547#comment-18037547
]
Szehon Ho commented on SPARK-54172:
-----------------------------------
Hi [~dongjoon] this is the other major feature I did for Spark 4.1.
Can you also help me make the following into subtasks of this:
* https://issues.apache.org/jira/browse/SPARK-52403
* https://issues.apache.org/jira/browse/SPARK-52578
* https://issues.apache.org/jira/browse/SPARK-53891
* https://issues.apache.org/jira/browse/SPARK-54310 (in progress- may
potentially slip 4.1)
And also link to https://issues.apache.org/jira/browse/SPARK-51166 for 4.1
for release note organization
Thanks!
Szehon
> Merge Into schema evolution should only add referenced fields
> -------------------------------------------------------------
>
> Key: SPARK-54172
> URL: https://issues.apache.org/jira/browse/SPARK-54172
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Affects Versions: 4.1.0
> Reporter: Szehon Ho
> Assignee: Szehon Ho
> Priority: Major
> Labels: pull-request-available
>
> SPARK-52991 added schema evolution support for MERGE INTO statements.
> However, it is a bit too broad. In some instances, source table may have
> many more fields than target tables. But only a few new ones are needed to
> be added to the target. We should limit the scope of schema evolution to
> only add columns/nested fields that are referenced in the MERGE INTO query
> via UPDATE or INSERT statements.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]