KnightChess created HUDI-7957: --------------------------------- Summary: data skew when writing with bulk_insert + bucket_index enabled Key: HUDI-7957 URL: https://issues.apache.org/jira/browse/HUDI-7957 Project: Apache Hudi Issue Type: Improvement Components: spark-sql Reporter: KnightChess Assignee: KnightChess
as [https://github.com/apache/hudi/issues/11565] say, when use bulk insert as row if table is bucket, data will skew, because of the partitioner algorithm -- This message was sent by Atlassian Jira (v8.20.10#820010)