Re: [PR] [HUDI-7957] fix data skew when writing with bulk_insert + bucket_inde… [hudi]

via GitHub Tue, 09 Jul 2024 04:58:43 -0700


KnightChess commented on PR #11578:
URL: https://github.com/apache/hudi/pull/11578#issuecomment-2217454238


   both of these algorithms are better than the original spark bulk bucket 
partitioner algorithm. I think they can both address the skew issue to some 
extent. If we want to maintain the original flink implementation, I will modify 
the logic and unit tests, because with the current unit test scenarios, half of 
them cannot pass. I am inclined towards the current fix. What do you think?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] [HUDI-7957] fix data skew when writing with bulk_insert + bucket_inde… [hudi]

Reply via email to