wang-zhun created SPARK-39921: --------------------------------- Summary: SkewJoin--Stream side skew in BroadcastJoin Key: SPARK-39921 URL: https://issues.apache.org/jira/browse/SPARK-39921 Project: Spark Issue Type: Improvement Components: SQL Affects Versions: 3.1.2 Reporter: wang-zhun Fix For: 3.3.0, 3.1.2
There is data skew in BroadcastJoin. Since LocalShuffle cannot cover all cases, we can refer to the solution of SortMergeJoin skew to consider solving the data skew problem in BroadcastJoin. h3. senairo !https://git.n.xiaomi.com/computing/spark/uploads/ee0bf49e1c2a378592c6885c1a37a14c/image.png! h3. !image-2022-07-29-14-52-10-659.png! h3. Effect !image-2022-07-29-14-53-27-021.png! !image-2022-07-29-14-52-58-810.png! -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org