peter-toth commented on code in PR #55519:
URL: https://github.com/apache/spark/pull/55519#discussion_r3166930325
##########
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala:
##########
@@ -2142,15 +2142,18 @@ object SQLConf {
.booleanConf
.createWithDefault(false)
- val V2_BUCKETING_ALLOW_JOIN_KEYS_SUBSET_OF_PARTITION_KEYS =
-
buildConf("spark.sql.sources.v2.bucketing.allowJoinKeysSubsetOfPartitionKeys.enabled")
- .doc("Whether to allow storage-partition join in the case where join
keys are " +
- "a subset of the partition keys of the source tables. At planning
time, " +
- "Spark will group the partitions by only those keys that are in the
join keys. " +
+ val V2_BUCKETING_ALLOW_KEYS_SUBSET_OF_PARTITION_KEYS =
+
buildConf("spark.sql.sources.v2.bucketing.allowKeysSubsetOfPartitionKeys.enabled")
+
.withAlternative("spark.sql.sources.v2.bucketing.allowJoinKeysSubsetOfPartitionKeys.enabled")
+ .doc("Whether to allow storage-partitioned operations (joins and
aggregates) in the case " +
+ "where the operation's keys are a subset of the partition keys of the
source tables. At " +
+ "planning time, Spark will group the partitions by only those keys
that are in the " +
+ "operation's keys. " +
s"This is currently enabled only if
${REQUIRE_ALL_CLUSTER_KEYS_FOR_DISTRIBUTION.key} " +
"is false."
)
.version("4.0.0")
Review Comment:
Makes sense, added in
https://github.com/apache/spark/pull/55519/commits/bcf0be11509865da1ff96ea43bb2b066608a8771.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]