Dear community, Nice to share Hudi community bi-weekly update for 2020-09-27 ~ 2020-10-11 with updates on features, bugfixs and tests.
======================================= Features [Hive Sync] Make create hive database automatically configurable [1] [Writer Core] Deltastreamer Kafka consumption delay reporting indicators [2] [Writer Core] Introduce REPLACE top level action. Implement insert_overwrite operation on top of replace action [3] [Writer Core] Refactor hudi-client to support multi-engine [4] [Metrics] Added an API to shutdown and remove the metrics reporter [5] [Writer Core] add port configuration for EmbeddedTimelineService [6] [Spark Integration] use spark INCREMENTAL mode query hudi dataset support schema version [7] ======================================= Bugs [Writer Core] Avoid blank file created by HoodieLogFormatWriter [8] [Writer Core] relocated jetty in hudi-utilities-bundle pom [9] [Writer Core] Ordering Field should be optional when precombine is turned off [10] [DeltaStreamer] DeltaStreamer can now fetch schema before every run in continuous mode [11] ====================================== Tests [Test] Some improvements for the HUDI Test Suite [12] [Test] Migrate HoodieTestUtils APIs to HoodieTestTable [13] [1] https://issues.apache.org/jira/browse/HUDI-1192 [2] https://issues.apache.org/jira/browse/HUDI-1233 [3] https://issues.apache.org/jira/browse/HUDI-1072 [4] https://issues.apache.org/jira/browse/HUDI-1089 [5] https://issues.apache.org/jira/browse/HUDI-1305 [6] https://issues.apache.org/jira/browse/HUDI-1203 [7] https://issues.apache.org/jira/browse/HUDI-1301 [8] https://issues.apache.org/jira/browse/HUDI-840 [9] https://issues.apache.org/jira/browse/HUDI-1199 [10] https://issues.apache.org/jira/browse/HUDI-1208 [11] https://issues.apache.org/jira/browse/HUDI-603 [12] https://issues.apache.org/jira/browse/HUDI-1303 [13] https://issues.apache.org/jira/browse/HUDI-995 Best, Leesf
