[jira] [Updated] (SPARK-49687) Delay sorting in validateAndMaybeEvolveStateSchema

Zhihong Yu (Jira) Tue, 17 Sep 2024 08:46:04 -0700


     [ 
https://issues.apache.org/jira/browse/SPARK-49687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Zhihong Yu updated SPARK-49687:
-------------------------------
    Description: 
In `{{{}validateAndMaybeEvolveStateSchema`{}}}, existing schema and new schema 
are sorted by column family name.
The sorting can be delayed until `createSchemaFile` is called.

When computing `{{{}colFamiliesAddedOrRemoved`{}}}, we can use `{{{}toSet`{}}} 
to compare column families.

  was:
In `{{{}validateAndMaybeEvolveStateSchema`{}}}, existing schema and new schema 
are sorted by column family name.
The sorting can be skipped.


When computing `{{{}colFamiliesAddedOrRemoved`{}}}, we can use `{{{}toSet`{}}} 
to compare column families.


> Delay sorting in validateAndMaybeEvolveStateSchema
> --------------------------------------------------
>
>                 Key: SPARK-49687
>                 URL: https://issues.apache.org/jira/browse/SPARK-49687
>             Project: Spark
>          Issue Type: Improvement
>          Components: Structured Streaming
>    Affects Versions: 3.4.3
>            Reporter: Zhihong Yu
>            Priority: Minor
>              Labels: pull-request-available
>
> In `{{{}validateAndMaybeEvolveStateSchema`{}}}, existing schema and new 
> schema are sorted by column family name.
> The sorting can be delayed until `createSchemaFile` is called.
> When computing `{{{}colFamiliesAddedOrRemoved`{}}}, we can use 
> `{{{}toSet`{}}} to compare column families.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Updated] (SPARK-49687) Delay sorting in validateAndMaybeEvolveStateSchema

Reply via email to