hudi-bot opened a new issue, #14693: URL: https://github.com/apache/hudi/issues/14693
disscuss in https://github.com/apache/hudi/pull/2196#issuecomment-722512937 A. Thanks so much. This pr need to solved the issue with better approach. Now I am more clear about overwrite semantic between table.overwrite and spark sql overwrite for hudi. B. Also spark sql for hudi overwrite should have the ability just like spark sql 、hive 、 delta lake. these engine have three mode for overwrite about partition: 1. Dynamic Partition : delete all partition data ,and the insert the new data for different 2. Static partition: just overwrite the partition which is user specified 3. Mixed partition: mixed of 1 and 2 more detail in : [https://spark.apache.org/docs/3.0.0-preview/sql-ref-syntax-dml-insert-overwrite-table.html] [https://www.programmersought.com/article/47155360487/] Just fyi, in the [RFC|https://cwiki.apache.org/confluence/display/HUDI/RFC+-+18+Insert+Overwrite+API#RFC18InsertOverwriteAPI-API] we discussed having 'insert_overwrite_table' operation to support dynamic partitioning. static partitioning is supported by 'insert_overwrite'. ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-1374 - Type: New Feature - Fix version(s): - 0.16.0 - 1.1.0 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
