(spark) branch master updated: [SPARK-46590][SQL][FOLLOWUP] Update CoalesceShufflePartitions comments

wenchen Tue, 23 Jan 2024 23:07:14 -0800

This is an automated email from the ASF dual-hosted git repository.

wenchen pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git



The following commit(s) were added to refs/heads/master by this push:
     new fe4f8eac3efe [SPARK-46590][SQL][FOLLOWUP] Update 
CoalesceShufflePartitions comments
fe4f8eac3efe is described below

commit fe4f8eac3efee42d53f7f24763a59c82ef03d343
Author: zml1206 <zhuml1...@gmail.com>
AuthorDate: Wed Jan 24 15:06:55 2024 +0800

    [SPARK-46590][SQL][FOLLOWUP] Update CoalesceShufflePartitions comments
    
    ### What changes were proposed in this pull request?
    After #44661 ,In addition to Union, children of CartesianProduct, 
BroadcastHashJoin and BroadcastNestedLoopJoin can also be coalesced 
independently, update comments.
    
    ### Why are the changes needed?
    Improve the readability and maintainability.
    
    ### Does this PR introduce _any_ user-facing change?
    No.
    
    ### How was this patch tested?
    N/A
    
    ### Was this patch authored or co-authored using generative AI tooling?
    No.
    
    Closes #44854 from zml1206/SPARK-46590-FOLLOWUP.
    
    Authored-by: zml1206 <zhuml1...@gmail.com>
    Signed-off-by: Wenchen Fan <wenc...@databricks.com>
---
 .../sql/execution/adaptive/CoalesceShufflePartitions.scala    | 11 ++++++-----
 1 file changed, 6 insertions(+), 5 deletions(-)

diff --git 
a/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CoalesceShufflePartitions.scala
 
b/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CoalesceShufflePartitions.scala
index 26e5ac649dbb..db4a6b7dcf2e 100644
--- 
a/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CoalesceShufflePartitions.scala
+++ 
b/sql/core/src/main/scala/org/apache/spark/sql/execution/adaptive/CoalesceShufflePartitions.scala
@@ -66,9 +66,9 @@ case class CoalesceShufflePartitions(session: SparkSession) 
extends AQEShuffleRe
       }
     }
 
-    // Sub-plans under the Union operator can be coalesced independently, so 
we can divide them
-    // into independent "coalesce groups", and all shuffle stages within each 
group have to be
-    // coalesced together.
+    // Sub-plans under the 
Union/CartesianProduct/BroadcastHashJoin/BroadcastNestedLoopJoin
+    // operator can be coalesced independently, so we can divide them into 
independent
+    // "coalesce groups", and all shuffle stages within each group have to be 
coalesced together.
     val coalesceGroups = collectCoalesceGroups(plan)
 
     // Divide minimum task parallelism among coalesce groups according to 
their data sizes.
@@ -137,8 +137,9 @@ case class CoalesceShufflePartitions(session: SparkSession) 
extends AQEShuffleRe
   }
 
   /**
-   * Gather all coalesce-able groups such that the shuffle stages in each 
child of a Union operator
-   * are in their independent groups if:
+   * Gather all coalesce-able groups such that the shuffle stages in each 
child of a
+   * Union/CartesianProduct/BroadcastHashJoin/BroadcastNestedLoopJoin operator 
are in their
+   * independent groups if:
    * 1) all leaf nodes of this child are exchange stages; and
    * 2) all these shuffle stages support coalescing.
    */


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

(spark) branch master updated: [SPARK-46590][SQL][FOLLOWUP] Update CoalesceShufflePartitions comments

Reply via email to