Dear community, Nice to share Hudi community bi-weekly updates for 2022-06-20 ~ 2022-07-03 with updates on bug fixes.
======================================= Features [Spark] Support export command based on Call Produce Command [1] [Spark] Initialize hudi table management module [2] [Spark] Add call procedure for FileSystemViewCommand [3] [Spark] Add call procedure for HoodieLogFileCommand [4] [Flink] Support inline schedule clustering for Flink stream [5] [Spark] Add call procedure for StatsCommand [6] [Spark] Support hdfs parquet import command based on Call Produce Command [7] [Spark] Add call procedure for CommitsCommand [8] [Flink] Column stats data skipping for flink [9] [Spark] Add call procedure for UpgradeOrDowngradeCommand [10] [1] https://issues.apache.org/jira/browse/HUDI-3507 [2] https://issues.apache.org/jira/browse/HUDI-3475 [3] https://issues.apache.org/jira/browse/HUDI-3508 [4] https://issues.apache.org/jira/browse/HUDI-3509 [5] https://issues.apache.org/jira/browse/HUDI-4273 [6] https://issues.apache.org/jira/browse/HUDI-3512 [7] https://issues.apache.org/jira/browse/HUDI-3502 [8] https://issues.apache.org/jira/browse/HUDI-3506 [9] https://issues.apache.org/jira/browse/HUDI-4353 [10] https://issues.apache.org/jira/browse/HUDI-3505 ======================================= Bugs [Flink] Fix when HoodieTable removes data file before the end of Flink job [1] [Spark] Fix wrong results if the user read no base files hudi table by glob paths [2] [Core] Bootstrap op data loading missing [3] [Flink] Fix Flink lose data on some rollback scene [4] [Core] Fix records overwritten bug with binary primary key [5] [Flink] Flink Hudi module should support low-level source and sink api [6] [1] https://issues.apache.org/jira/browse/HUDI-4258 [2] https://issues.apache.org/jira/browse/HUDI-4173 [3] https://issues.apache.org/jira/browse/HUDI-4270 [4] https://issues.apache.org/jira/browse/HUDI-4311 [5] https://issues.apache.org/jira/browse/HUDI-4336 [6] https://issues.apache.org/jira/browse/HUDI-3953 Best, Leesf
