tsreaper opened a new pull request, #4907:
URL: https://github.com/apache/paimon/pull/4907

   ### Purpose
   
   In #4380 we introduce pre-commit compact for changelog files. Multiple 
changelog files from the same partition will be merged into one big file in one 
worker parallelism to decrease the number of small files.
   
   However, when the number of changelog files to merge is large (while each 
file itself is small enough), the copying process will be slow, because opening 
these many files from the filesystem takes a lot of time.
   
   In this PR, we add a thread pool to the worker operator, so that when 
performing pre-commit compact for changelogs, we can copy the bytes with 
multiple threads, thus speeding up the process.
   
   ### Tests
   
   Existing IT cases should cover this change. This PR also adds a unit test 
for the coordinator operator.
   
   ### API and Format
   
   No format changes.
   
   ### Documentation
   
   Document is also updated.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@paimon.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to