[ 
https://issues.apache.org/jira/browse/SPARK-39921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wang-zhun updated SPARK-39921:
------------------------------
    Description: 
There is data skew in BroadcastJoin. Since LocalShuffle cannot cover all cases, 
we can refer to the solution of SortMergeJoin skew to consider solving the data 
skew problem in BroadcastJoin.
h3. senairo

!1.png!

!2.png!
h3. Effect

!4.png!

!3.png!

  was:
There is data skew in BroadcastJoin. Since LocalShuffle cannot cover all cases, 
we can refer to the solution of SortMergeJoin skew to consider solving the data 
skew problem in BroadcastJoin.
h3. senairo

!https://git.n.xiaomi.com/computing/spark/uploads/ee0bf49e1c2a378592c6885c1a37a14c/image.png!
h3. !image-2022-07-29-14-52-10-659.png!
h3. Effect

!image-2022-07-29-14-53-27-021.png!

!image-2022-07-29-14-52-58-810.png!


> SkewJoin--Stream side skew in BroadcastJoin
> -------------------------------------------
>
>                 Key: SPARK-39921
>                 URL: https://issues.apache.org/jira/browse/SPARK-39921
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 3.1.2
>            Reporter: wang-zhun
>            Priority: Major
>             Fix For: 3.1.2, 3.3.0
>
>         Attachments: 1.png, 2.png, 3.png, 4.png
>
>
> There is data skew in BroadcastJoin. Since LocalShuffle cannot cover all 
> cases, we can refer to the solution of SortMergeJoin skew to consider solving 
> the data skew problem in BroadcastJoin.
> h3. senairo
> !1.png!
> !2.png!
> h3. Effect
> !4.png!
> !3.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to