[jira] [Commented] (FLINK-22827) Hive dialect supports CLUSTERED BY clause of CREATE TABLE DDL

Rui Li (Jira) Tue, 01 Jun 2021 05:00:08 -0700


    [ 
https://issues.apache.org/jira/browse/FLINK-22827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355059#comment-17355059
 ]


Rui Li commented on FLINK-22827:
--------------------------------

Hi [~aidenma], currently Flink doesn't support the semantics of bucketed/sorted 
hive tables. So even if the DDL allows you to create such a table, you probably 
won't get the desired behavior when writing to this table with Flink, e.g. the 
data won't be shuffled/sorted according to the bucket/sort key you specified.

> Hive dialect supports CLUSTERED BY clause of CREATE TABLE DDL
> -------------------------------------------------------------
>
>                 Key: FLINK-22827
>                 URL: https://issues.apache.org/jira/browse/FLINK-22827
>             Project: Flink
>          Issue Type: New Feature
>          Components: Connectors / Hive
>    Affects Versions: 1.13.1
>            Reporter: Ma Jun
>            Priority: Major
>
> {code:java}
> # hive syntax:
> CREATE [ EXTERNAL ] TABLE [ IF NOT EXISTS ] table_identifier
>     [ ( col_name1[:] col_type1 [ COMMENT col_comment1 ], ... ) ]
>     [ COMMENT table_comment ]
>     [ PARTITIONED BY ( col_name2[:] col_type2 [ COMMENT col_comment2 ], ... ) 
>         | ( col_name1, col_name2, ... ) ]
>     [ CLUSTERED BY ( col_name1, col_name2, ...) 
>         [ SORTED BY ( col_name1 [ ASC | DESC ], col_name2 [ ASC | DESC ], ... 
> ) ] 
>         INTO num_buckets BUCKETS ]
>     [ ROW FORMAT row_format ]
>     [ STORED AS file_format ]
>     [ LOCATION path ]
>     [ TBLPROPERTIES ( key1=val1, key2=val2, ... ) ]
>     [ AS select_statement ]
> {code}
>  
> {code:java}
> [ CLUSTERED BY ( col_name1, col_name2, ...) [ SORTED BY ( col_name1 [ ASC | 
> DESC ], col_name2 [ ASC | DESC ], ... ) ] 
> {code}
> Will Flink support the way of creating tables and supporting clustered by | 
> sort by into buckets in later versions？



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (FLINK-22827) Hive dialect supports CLUSTERED BY clause of CREATE TABLE DDL

Reply via email to