This is an automated email from the ASF dual-hosted git repository.
blue pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/iceberg.git
The following commit(s) were added to refs/heads/main by this push:
new 3684152113 Docs: Enhance documentation on identifier fields (#9478)
3684152113 is described below
commit 36841521137713d3962bb08a5d5df6490c8ac466
Author: Manu Zhang <[email protected]>
AuthorDate: Wed Jan 17 02:26:22 2024 +0800
Docs: Enhance documentation on identifier fields (#9478)
---
docs/spark-ddl.md | 8 +++++---
1 file changed, 5 insertions(+), 3 deletions(-)
diff --git a/docs/spark-ddl.md b/docs/spark-ddl.md
index 57f279435f..0501bede01 100644
--- a/docs/spark-ddl.md
+++ b/docs/spark-ddl.md
@@ -36,7 +36,7 @@ Spark 3 can create tables in any Iceberg catalog with the
clause `USING iceberg`
```sql
CREATE TABLE prod.db.sample (
- id bigint COMMENT 'unique id',
+ id bigint NOT NULL COMMENT 'unique id',
data string)
USING iceberg;
```
@@ -459,7 +459,8 @@ ALTER TABLE prod.db.sample WRITE DISTRIBUTED BY PARTITION
LOCALLY ORDERED BY cat
### `ALTER TABLE ... SET IDENTIFIER FIELDS`
-Iceberg supports setting identifier fields to a spec using `SET IDENTIFIER
FIELDS`:
+Iceberg supports setting [identifier
fields](https://iceberg.apache.org/spec/#identifier-field-ids) to a spec using
`SET IDENTIFIER FIELDS`:
+Spark table can support Flink SQL upsert operation if the table has identifier
fields.
```sql
ALTER TABLE prod.db.sample SET IDENTIFIER FIELDS id
@@ -468,7 +469,8 @@ ALTER TABLE prod.db.sample SET IDENTIFIER FIELDS id, data
-- multiple columns
```
-identifier fields must be `NOT NULL`, The later `ALTER` statement will
overwrite the previous setting.
+Identifier fields must be `NOT NULL` columns when they are created or added.
+The later `ALTER` statement will overwrite the previous setting.
### `ALTER TABLE ... DROP IDENTIFIER FIELDS`