Dear community, Nice to share Hudi community weekly update for 2020-09-13 ~ 2020-09-20 with updates on features, bugfixs and tests.
======================================= Features [Integration Test] Check whether the topic exists before deltastrmer consumes Kafka [1] [Hudi CLI] Add deduping logic for upserts case [2] [Writer Core] Adding a way to post process schema after it is fetched [3] ======================================= Bugs [Spark Integration] Fix for preventing MOR datasource jobs from hanging via spark-submit [4] ====================================== Tests [Test] Change timestamp field in HoodieTestDataGenerator from double to long [5] [Test] Use HoodieTestTable in more classes [6] [Test] Migrate HoodieTestUtils APIs to HoodieTestTable [7] [1] https://issues.apache.org/jira/browse/HUDI-1228 [2] https://issues.apache.org/jira/browse/HUDI-976 [3] https://issues.apache.org/jira/browse/HUDI-801 [4] https://issues.apache.org/jira/browse/HUDI-1230 [5] https://issues.apache.org/jira/browse/HUDI-1143 [6] https://issues.apache.org/jira/browse/HUDI-995 [7] https://issues.apache.org/jira/browse/HUDI-995 Best, Leesf