GitHub user danny0405 added a comment to the discussion: Make cleaning adaptive to workload
Actually a more costly operation of cleaning is the file listing, if we infer the files to be cleaned just from the plan, that would be great, for e.g, only compaction and clustering yield legacy files, we can check these plan to see which files have been replaced? GitHub link: https://github.com/apache/hudi/discussions/13846#discussioncomment-14325122 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected]
