Dear community, Nice to share Hudi community bi-weekly updates for 2022-12-05 ~ 2022-12-18 with updates on bug fixes.
======================================= Features [Core] add call help procedure [1] [Spark] Hudi supports Spark TVF [2] [Spark] Add new bulk insert sort modes repartitioning data by partition path [3] [Spark] Upgrade to spark 3.3.1 & 3.2.2 [4] [1] https://issues.apache.org/jira/browse/HUDI-5314 [2] https://issues.apache.org/jira/browse/HUDI-5340 [3] https://issues.apache.org/jira/browse/HUDI-5342 [4] https://issues.apache.org/jira/browse/HUDI-4411 ======================================= Bugs [Spark] Support type change for schema on read + reconcile schema [1] [Spark] Fix checkpoint reading for structured streaming [2] [Core] Flink async compaction is not thread safe when use watermark [3] [Spark] Fix failure handling with spark datasource write [4] [Spark] FIxing performance traps in Spark SQL MERGE INTO implementation [5] [Spark] Fixing Create Table as Select (CTAS) performance gaps [6] [Flink] Fix oom cause compaction event lost problem [7] [Spark] Checkpoint management for muti-writer scenario [8] [1] https://issues.apache.org/jira/browse/HUDI-5294 [2] https://issues.apache.org/jira/browse/HUDI-5334 [3] https://issues.apache.org/jira/browse/HUDI-3661 [4] https://issues.apache.org/jira/browse/HUDI-5163 [5] https://issues.apache.org/jira/browse/HUDI-5347 [6] https://issues.apache.org/jira/browse/HUDI-5346 [7] https://issues.apache.org/jira/browse/HUDI-5350 [8] https://issues.apache.org/jira/browse/HUDI-4432 Best, Leesf
