danny0405 commented on issue #5020:
URL: https://github.com/apache/hudi/issues/5020#issuecomment-1121853297
> @danny0405 : going back to your original example in the description. If a
file slice was written in C1, and never updated in any of the future commits,
at C10 or C11, even if cleane
danny0405 commented on issue #5020:
URL: https://github.com/apache/hudi/issues/5020#issuecomment-1120787401
> yes, to ease the pressure for file IO and metadata scanning. @danny0405
That is a valid improvement from my side, but it does not solve the problem
i mentioned in this issue,
danny0405 commented on issue #5020:
URL: https://github.com/apache/hudi/issues/5020#issuecomment-1120582884
> we have 3 files in requested, inflight and completed states. How about
have a mechanism where after a certain threshold in terms of number of commits,
we archive or simply delete on
danny0405 commented on issue #5020:
URL: https://github.com/apache/hudi/issues/5020#issuecomment-1120348627
Yes, we may need to work to find a solution in release 0.12.0
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use
danny0405 commented on issue #5020:
URL: https://github.com/apache/hudi/issues/5020#issuecomment-1066289540
> If we want to realize that when the user reads, the data being read is not
clean. We may need to add other third-party components. For example, the
zookeeper temporary node. Otherw