[GitHub] [hudi] danny0405 commented on issue #5020: [SUPPORT] The cleaning strategy breaks the reader view completeness

2022-05-09 Thread GitBox
danny0405 commented on issue #5020: URL: https://github.com/apache/hudi/issues/5020#issuecomment-1121853297 > @danny0405 : going back to your original example in the description. If a file slice was written in C1, and never updated in any of the future commits, at C10 or C11, even if cleane

[GitHub] [hudi] danny0405 commented on issue #5020: [SUPPORT] The cleaning strategy breaks the reader view completeness

2022-05-09 Thread GitBox
danny0405 commented on issue #5020: URL: https://github.com/apache/hudi/issues/5020#issuecomment-1120787401 > yes, to ease the pressure for file IO and metadata scanning. @danny0405 That is a valid improvement from my side, but it does not solve the problem i mentioned in this issue,

[GitHub] [hudi] danny0405 commented on issue #5020: [SUPPORT] The cleaning strategy breaks the reader view completeness

2022-05-08 Thread GitBox
danny0405 commented on issue #5020: URL: https://github.com/apache/hudi/issues/5020#issuecomment-1120582884 > we have 3 files in requested, inflight and completed states. How about have a mechanism where after a certain threshold in terms of number of commits, we archive or simply delete on

[GitHub] [hudi] danny0405 commented on issue #5020: [SUPPORT] The cleaning strategy breaks the reader view completeness

2022-05-07 Thread GitBox
danny0405 commented on issue #5020: URL: https://github.com/apache/hudi/issues/5020#issuecomment-1120348627 Yes, we may need to work to find a solution in release 0.12.0 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [hudi] danny0405 commented on issue #5020: [SUPPORT] The cleaning strategy breaks the reader view completeness

2022-03-13 Thread GitBox
danny0405 commented on issue #5020: URL: https://github.com/apache/hudi/issues/5020#issuecomment-1066289540 > If we want to realize that when the user reads, the data being read is not clean. We may need to add other third-party components. For example, the zookeeper temporary node. Otherw