flashJd opened a new pull request, #9113:
URL: https://github.com/apache/hudi/pull/9113

   ### Change Logs
   
   When upgrade hudi from 0.12.2->0.13.1, I found spark's capcity of insert 
overwrite partitioned table with dynamic partition lost,
   see https://github.com/apache/hudi/issues/8283#issuecomment-1482745990: 
     It will cause serious data problems if upgrade to 0.13.0, user will delete 
all data by mistake
   As https://github.com/apache/hudi/pull/7365#issuecomment-1338371540 
mentioned, 
     `insert_overwrite_table` will override entire table. while 
`insert_overwrite_partition` will overwrite only matching partitions.
   Now we can only use static partition syntax to realize 
`insert_overwrite_partition` semantics.
   
   ### Impact
   
   1) Keep insert overwrite semantics  to be forward compatible(recover dynamic 
partition capcity) 
   2) Realize `insert_overwrite_table` semantics  with partitioned table as 
https://github.com/apache/hudi/pull/7365#issue-1472566386 mentioned using a 
config(set hoodie.datasource.write.operation = insert_overwrite_table)
   3) Use append mode to not delete whole table data as 
https://github.com/apache/hudi/pull/8076#discussion_r1127283648 mentioned
   
   ### Risk level (write none, low medium or high below)
   
   Media
   
   ### Documentation Update
   
   N/A
   
   ### Contributor's checklist
   
   - [ ] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [ ] Change Logs and Impact were stated clearly
   - [ ] Adequate tests were added if applicable
   - [ ] CI passed
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to