ueshin commented on code in PR #47349:
URL: https://github.com/apache/spark/pull/47349#discussion_r1678189190


##########
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala:
##########
@@ -641,14 +641,19 @@ object SQLConf {
     .checkValue(_ > 0, "The value of spark.sql.leafNodeDefaultParallelism must 
be positive.")
     .createOptional
 
+  // Lazily get the number of cores to make sure SparkContext created first.
+  private lazy val defaultShufflePartition: Option[Int] = 
SparkContext.getActive.flatMap { sc =>
+    if (sc.isLocal) Some(SparkContext.numDriverCores(sc.master)) else None
+  }
+
   val SHUFFLE_PARTITIONS = buildConf("spark.sql.shuffle.partitions")
     .doc("The default number of partitions to use when shuffling data for 
joins or aggregations. " +
       "Note: For structured streaming, this configuration cannot be changed 
between query " +
       "restarts from the same checkpoint location.")
     .version("1.1.0")
     .intConf
     .checkValue(_ > 0, "The value of spark.sql.shuffle.partitions must be 
positive")
-    .createWithDefault(200)
+    .createWithDefaultFunction(() => defaultShufflePartition.getOrElse(200))

Review Comment:
   Will this change also affect Spark Connect local mode?
   
   E.g.,
   
   ```
   $ ./bin/pyspark --remote local
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to