cloud-fan commented on a change in pull request #32875:
URL: https://github.com/apache/spark/pull/32875#discussion_r746380755



##########
File path: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/physical/partitioning.scala
##########
@@ -171,6 +148,27 @@ trait Partitioning {
     required.requiredNumPartitions.forall(_ == numPartitions) && 
satisfies0(required)
   }
 
+  /**
+   * Creates a shuffle spec for this partitioning and its required 
distribution. The
+   * spec is used in the scenario where an operator has multiple children 
(e.g., join), and is
+   * used to decide whether this child is co-partitioned with others, 
therefore whether extra
+   * shuffle shall be introduced.
+   *
+   * @param defaultNumPartitions the default number of partitions to use when 
creating a new

Review comment:
       Why do we need this parameter? I think we need to always provide the 
numPartitions when creating a new partitioning from a shuffle spec.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to