GitHub user danny0405 added a comment to the discussion: Make cleaning adaptive 
to workload

Actually a more costly operation of cleaning is the file listing, if we infer 
the files to be cleaned just from the plan, that would be great, for e.g, only 
compaction and clustering yield legacy files, we can check these plan to see 
which files have been replaced?

GitHub link: 
https://github.com/apache/hudi/discussions/13846#discussioncomment-14325122

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to