[ https://issues.apache.org/jira/browse/SPARK-49687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zhihong Yu updated SPARK-49687: ------------------------------- Description: In `{{{}validateAndMaybeEvolveStateSchema`{}}}, existing schema and new schema are sorted by column family name. The sorting can be delayed until `createSchemaFile` is called. When computing `{{{}colFamiliesAddedOrRemoved`{}}}, we can use `{{{}toSet`{}}} to compare column families. was: In `{{{}validateAndMaybeEvolveStateSchema`{}}}, existing schema and new schema are sorted by column family name. The sorting can be skipped. When computing `{{{}colFamiliesAddedOrRemoved`{}}}, we can use `{{{}toSet`{}}} to compare column families. > Delay sorting in validateAndMaybeEvolveStateSchema > -------------------------------------------------- > > Key: SPARK-49687 > URL: https://issues.apache.org/jira/browse/SPARK-49687 > Project: Spark > Issue Type: Improvement > Components: Structured Streaming > Affects Versions: 3.4.3 > Reporter: Zhihong Yu > Priority: Minor > Labels: pull-request-available > > In `{{{}validateAndMaybeEvolveStateSchema`{}}}, existing schema and new > schema are sorted by column family name. > The sorting can be delayed until `createSchemaFile` is called. > When computing `{{{}colFamiliesAddedOrRemoved`{}}}, we can use > `{{{}toSet`{}}} to compare column families. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org