Dear community, Nice to share Hudi community weekly update for 2020-03-29 ~ 2020-04-05 with updates on development, features, bug fix and tests.
==================================== Development [Troubleshooting] A discussion about troubleshooting flow, currently we use slack to answer all kinds of questions, however, the questions cann't be indexed by search engines, thus there are some repetitive workload, so we could would like to answer simple questions and move debugging type of questions to GH issues. [1] ===================================== Features [Code Cleanup] Restructure hudi-common and clean up files under util packages [2] [Writer Core] Parallelize getSmallFiles for partitions [3] [Utilities] Add ChainedTransformer [4] [Utilities] kafka connect checkpoint provider [5] [Code Cleanup] Organize Cleaning Action execution into a single package in hudi-client[6] ===================================== Bugs [DeltaStreamer] Not an Avro data file when running HoodieCleanClient.runClean [7] [Writer Core] Fix conversion of Spark array of struct type to Avro schema [8] [Spark Integration] Fix Java Math Exception [9] [Hive Integration] Fixed usage of HiveDriver for DDL statements [10] ===================================== Tests [Tests] Enable testing at debug log level [11] [1] https://lists.apache.org/thread.html/r0de5b576ea3db07e663d76d72196404b65f1624c298a6b335229c05d%40%3Cdev.hudi.apache.org%3E [2] https://jira.apache.org/jira/browse/HUDI-744 [3] https://jira.apache.org/jira/browse/HUDI-724 [4] https://jira.apache.org/jira/browse/HUDI-731 [5] https://jira.apache.org/jira/browse/HUDI-644 [6] https://jira.apache.org/jira/browse/HUDI-756 [7] https://jira.apache.org/jira/browse/HUDI-716 [8] https://jira.apache.org/jira/browse/HUDI-713 [9] https://jira.apache.org/jira/browse/HUDI-742 [10] https://jira.apache.org/jira/browse/HUDI-717 [11] https://jira.apache.org/jira/browse/HUDI-562 Best, Leesf