Dear community, Nice to share Hudi community weekly update for 2020-04-05 ~ 2020-04-12 with updates on features, bug fix and tests, also with jiras help wanted [1], please feel free to pick up.
===================================== Help wanted [Writer Core] Remove Rolling Stat management from Hudi Writer [2] [Code Cleanup] Introduce abstraction for writing and reading and compacting from FileGroups [3] [Code Cleanup] Abstract/Refactor all transaction management logic into a set of classes [4] [Index] Introduce ability to compress bloom filters while storing in parquet [5] [DeltaStreamer] Implement support for bootstrapping in HoodieDeltaStreamer [6] ===================================== Features [Hive Integration] Stop incremental reader on RO table when there is a pending compaction [7] [FileSystem] Add support of Aliyun object storage service [8] ===================================== Bugs [CLI] Fix can not specify the sparkMaster and code clean for SparkUtil [9] [DeltaStreamer] Add validation to DeltaStreamer to fail fast when filterDupes is enabled on UPSERT mode [10] ===================================== Tests [Tests] Modify Integration test to include incremental queries for MOR tables[11] [Tests] Improve unit test coverage [12] [Tests] Add unit test for FileSystemViewCommand [13] [1] https://jira.apache.org/jira/browse/HUDI-760?jql=project%20%3D%20HUDI%20AND%20labels%20%3D%20help-wanted [2] https://jira.apache.org/jira/browse/HUDI-760 [3] https://jira.apache.org/jira/browse/HUDI-684 [4] https://jira.apache.org/jira/browse/HUDI-677 [5] https://jira.apache.org/jira/browse/HUDI-558 [6] https://jira.apache.org/jira/browse/HUDI-425 [7] https://jira.apache.org/jira/browse/HUDI-687 [8] https://jira.apache.org/jira/browse/HUDI-782 [9] https://jira.apache.org/jira/browse/HUDI-740 [10] https://jira.apache.org/jira/browse/HUDI-738 [11] https://jira.apache.org/jira/browse/HUDI-758 [12] https://jira.apache.org/jira/browse/HUDI-568 [13] https://jira.apache.org/jira/browse/HUDI-700 Best, Leesf
