Aitozi opened a new pull request, #4386:
URL: https://github.com/apache/paimon/pull/4386

   …r strategies.
   
   <!-- Please specify the module before the PR name: [core] ... or [flink] ... 
-->
   
   ### Purpose
   
   <!-- Linking this pull request to the issue -->
   Linked issue: close #xxx
   
   In our company, we have encountered an issue with the HMS partition 
statistic being incorrect. This is because during the writing process, we only 
update the metastore partition when it is first written to.
   Therefore, we would like to implement the PartitionMarkDone strategy to 
update the statistics in HMS after a short idle period for each partition.
   We need a separate configuration for PartitionMarkDone due to differing 
requirements:
   - We do not want to rely on extracting accurate timestamps from partition 
names.
   - We aim to store each partition's last access time in state so that it can 
be reported back to HMS.
   To achieve this, we plan on extending `PartitionMarkDone` functionality to 
support different trigger strategies for partitions. Following this extension, 
we will be able to introduce custom triggers such as 
`PartitionHmsReporterTrigger`.
   
   <!-- What is the purpose of the change -->
   
   ### Tests
   
   <!-- List UT and IT cases to verify this change -->
   
   ### API and Format
   
   <!-- Does this change affect API or storage format -->
   
   ### Documentation
   
   <!-- Does this change introduce a new feature -->
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to