bschell opened a new pull request #1194: [HUDI-326] Add support to delete 
records with only record_key
URL: https://github.com/apache/incubator-hudi/pull/1194
 
 
   Allows HoodieSparkSqlWriter to delete records from a dataframe containing 
only record keys when using global indices because delete does not use 
partition from global indices.
   
   ## What is the purpose of the pull request
   
   This pull request allows Hudi deletes with only record_key for global 
indices. I don't see any way to avoid this because keyGenerators will always 
fail if the partition value field is not included. Having an API with 
delete(List<RecordKey> deleteKeys) will not be sufficient either as any 
keyGenerator transformations on the record_key would not happen then. This 
works around this by allowing the keyGenerators to always pass.
   
   ## Verify this pull request
   
   This pull request has minimal changes. The changes have been manually tested 
and verified. 
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to