Dear community, Nice to share Hudi community bi-weekly updates for 2022-09-12 ~ 2022-09-25 with updates on bug fixes.
======================================= Features [Core] Consistent bucket index: bucket resizing (split&merge) & concurrent write during resizing [1] [Core] Add Postgres Schema Name to Postgres Debezium Source [2] [Core] Support compaction strategy based on delta log file num [3] [Core] Support partial update payload [4] [Core] Implement CDC Write in Spark [5] [Core] Supporting delete savepoint for MOR [6] [Core] Support hiveSync command based on Call Produce Command [7] [1] https://issues.apache.org/jira/browse/HUDI-3558 [2] https://issues.apache.org/jira/browse/HUDI-4833 [3] https://issues.apache.org/jira/browse/HUDI-4842 [4] https://issues.apache.org/jira/browse/HUDI-3304 [5] https://issues.apache.org/jira/browse/HUDI-3478 [6] https://issues.apache.org/jira/browse/HUDI-4883 [7] https://issues.apache.org/jira/browse/HUDI-4559 ======================================= Bugs [Core] Fix AWSDmsAvroPayload#getInsertValue,combineAndGetUpdateValue to invoke correct api [1] [Flink] Hudi-flink support GLOBAL_BLOOM,GLOBAL_SIMPLE,BUCKET index type [2] [Core] hoodie.logfile.max.size It does not take effect, causing the log file to be too large [3] [Spark] Fix infer keygen not work in sparksql side issue [4] [Core] Fix HoodieSimpleBucketIndex not consider bucket num in log file issue [5] [Core] Fix file group pending compaction cannot be queried when query _ro table [6] [Spark] Support Clustering row writer to improve performance [7] [1] https://issues.apache.org/jira/browse/HUDI-4831 [2] https://issues.apache.org/jira/browse/HUDI-4628 [3] https://issues.apache.org/jira/browse/HUDI-4780 [4] https://issues.apache.org/jira/browse/HUDI-4813 [5] https://issues.apache.org/jira/browse/HUDI-4808 [6] https://issues.apache.org/jira/browse/HUDI-4729 [7] https://issues.apache.org/jira/browse/HUDI-4363 Best, Leesf