wang-zhun created SPARK-39921:
---------------------------------

             Summary: SkewJoin--Stream side skew in BroadcastJoin
                 Key: SPARK-39921
                 URL: https://issues.apache.org/jira/browse/SPARK-39921
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.1.2
            Reporter: wang-zhun
             Fix For: 3.3.0, 3.1.2


There is data skew in BroadcastJoin. Since LocalShuffle cannot cover all cases, 
we can refer to the solution of SortMergeJoin skew to consider solving the data 
skew problem in BroadcastJoin.
h3. senairo

!https://git.n.xiaomi.com/computing/spark/uploads/ee0bf49e1c2a378592c6885c1a37a14c/image.png!
h3. !image-2022-07-29-14-52-10-659.png!
h3. Effect

!image-2022-07-29-14-53-27-021.png!

!image-2022-07-29-14-52-58-810.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to