[ 
https://issues.apache.org/jira/browse/SPARK-44886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Terry Kim updated SPARK-44886:
------------------------------
    Description: 
This proposes to introduce CLUSTER BY clause to CREATE/REPLACE SQL syntax:
{code:java}
CREATE TABLE tbl(a int, b string) CLUSTER BY (a, b){code}
This doesn't introduce a default implementation for clustering, but it's up to 
the catalog/datasource implementation to utilize the clustering information 
(e.g., Delta, Iceberg, etc.).

  was:
This proposes to introduce CLUSTER BY clause to CREATE/REPLACE SQL syntax:
{code:java}
CREATE TABLE tbl(a int, b string) CLUSTER BY (a, b){code}
There will not be an implementation, but it's up to the catalog implementation 
to utilize the clustering information (e.g., Delta, Iceberg, etc.).

Note that specifying CLUSTER BY will throw an exception if the table being 
created is for v1 source or session catalog (e.g., v2 source w/ session 
catalog).


> Introduce CLUSTER BY SQL clause to CREATE/REPLACE TABLE
> -------------------------------------------------------
>
>                 Key: SPARK-44886
>                 URL: https://issues.apache.org/jira/browse/SPARK-44886
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 4.0.0
>            Reporter: Terry Kim
>            Priority: Major
>
> This proposes to introduce CLUSTER BY clause to CREATE/REPLACE SQL syntax:
> {code:java}
> CREATE TABLE tbl(a int, b string) CLUSTER BY (a, b){code}
> This doesn't introduce a default implementation for clustering, but it's up 
> to the catalog/datasource implementation to utilize the clustering 
> information (e.g., Delta, Iceberg, etc.).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to