KnightChess commented on PR #8856: URL: https://github.com/apache/hudi/pull/8856#issuecomment-1641295662
> @KnightChess do you have any number on the performance improvement on updating MDT from this PR? parallelism compute: ```java parallelism = Math.max(Math.min(partitionToAppendedFilesList.size(), recordsGenerationParams.getBloomIndexParallelism()), 1); ``` @yihua in this picture, total file is more than 5000 with on partitions, before it fix, bloom and col stats parallelism is 1, and now, bloom filter is 200, col stat is 10, which from default value ![image](https://github.com/apache/hudi/assets/20125927/ff4cb9e4-d595-4294-83e7-cb42c73c40ff) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org