Re: [PR] [SPARK-46367][SQL] Support narrowing projection of `KeyedPartitioning` in `PartitioningPreservingUnaryExecNode` [spark]

via GitHub Fri, 01 May 2026 21:48:18 -0700


pan3793 commented on code in PR #55519:
URL: https://github.com/apache/spark/pull/55519#discussion_r3176130249



##########
sql/core/src/test/scala/org/apache/spark/sql/connector/KeyGroupedPartitioningSuite.scala:
##########
@@ -3988,4 +3988,161 @@ class KeyGroupedPartitioningSuite extends 
DistributionAndOrderingSuiteBase with
       }
     }
   }
+

Review Comment:
   Thanks for adding the `bucket` coverage.
   
   I should be clear, this is another issue that should be fixed independently.
   ```
   spark-sql (default)> create table t1(id int, user_id int, item_id int, dt 
string) using iceberg partitioned by (bucket(4, user_id), bucket(2, item_id), 
dt);
   Time taken: 1.397 seconds
   spark-sql (default)> show create table t1;
   CREATE TABLE spark_catalog.default.t1 (
     id INT,
     user_id INT,
     item_id INT,
     dt STRING COLLATE UTF8_BINARY)
   USING iceberg
   PARTITIONED BY (dt)
   CLUSTERED BY (item_id)
   INTO 2 BUCKETS
   LOCATION 'hdfs://hadoop-master1.orb.local:8020/warehouse/t1'
   TBLPROPERTIES (
     'current-snapshot-id' = 'none',
     'format' = 'iceberg/parquet',
     'format-version' = '2',
     'write.parquet.compression-codec' = 'zstd')
   
   Time taken: 0.253 seconds, Fetched 1 row(s)
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [SPARK-46367][SQL] Support narrowing projection of `KeyedPartitioning` in `PartitioningPreservingUnaryExecNode` [spark]

Reply via email to