hudi-bot opened a new issue, #15251:
URL: https://github.com/apache/hudi/issues/15251

   we need to support a new compaction strategy called 
LogFileModTimeBasedCompactionStrategy.
   
   Using this strategy, we want to choose the file slice whose's earliest log 
file mod time for compaction. 
   
   This will be similar to LogFileSizeBasedCompactionStrategy, just that 
instead of comparing total log files size for a given file slice, we will use 
earliest mod time for a given file slice. 
   
   
   The goal is to compact some part of the whole change set (say, 20%) in one 
batch.
   Compaction plan for a next batch should include incomplete operations from 
the previous plans.
   Operations should be processed in order of earliest log file modification 
time.
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-4313
   - Type: New Feature


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to