hudi-bot opened a new issue, #14746: URL: https://github.com/apache/hudi/issues/14746
As of now, we only have inline clustering support w/ hoodie test suite job. we need to add support for async clustering. This might be tricky since the regular writes should not overstep w/ clustering. if not the pipeline will fail. So, data generation has to go hand in hand w/ clustering configs. For eg, if clustering will get triggered every 4 commits, data generation should switch partitions for every 4 batches of input. That way there won't be any overstepping and pipeline can run for as many iterations as needed. ## JIRA info - Link: https://issues.apache.org/jira/browse/HUDI-1590 - Type: Test - Epic: https://issues.apache.org/jira/browse/HUDI-3303 - Affects version(s): - 0.9.0 --- ## Comments 08/Mar/21 13:54;legendtkl;Hi, I would like to take this one. I have requested "assign to me", thanks.;;; --- 08/Mar/21 14:48;309637554;[~legendtkl] try it . :D;;; --- 27/Mar/22 14:41;xushiyan;[~legendtkl] did you get a chance to work on this?;;; --- 28/Mar/22 02:25;legendtkl;Hi, [~xushiyan] ,I'm very sorry about this. I am afraid that I don't have much time to work on this. Would you assign this to another guy who is interested about this? Thanks. ;;; --- 28/Mar/22 03:39;xushiyan;[~legendtkl] no worries. ;;; -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
