This is an automated email from the ASF dual-hosted git repository.

blue pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/iceberg.git


The following commit(s) were added to refs/heads/main by this push:
     new 3684152113 Docs: Enhance documentation on identifier fields (#9478)
3684152113 is described below

commit 36841521137713d3962bb08a5d5df6490c8ac466
Author: Manu Zhang <[email protected]>
AuthorDate: Wed Jan 17 02:26:22 2024 +0800

    Docs: Enhance documentation on identifier fields (#9478)
---
 docs/spark-ddl.md | 8 +++++---
 1 file changed, 5 insertions(+), 3 deletions(-)

diff --git a/docs/spark-ddl.md b/docs/spark-ddl.md
index 57f279435f..0501bede01 100644
--- a/docs/spark-ddl.md
+++ b/docs/spark-ddl.md
@@ -36,7 +36,7 @@ Spark 3 can create tables in any Iceberg catalog with the 
clause `USING iceberg`
 
 ```sql
 CREATE TABLE prod.db.sample (
-    id bigint COMMENT 'unique id',
+    id bigint NOT NULL COMMENT 'unique id',
     data string)
 USING iceberg;
 ```
@@ -459,7 +459,8 @@ ALTER TABLE prod.db.sample WRITE DISTRIBUTED BY PARTITION 
LOCALLY ORDERED BY cat
 
 ### `ALTER TABLE ... SET IDENTIFIER FIELDS`
 
-Iceberg supports setting identifier fields to a spec using `SET IDENTIFIER 
FIELDS`:
+Iceberg supports setting [identifier 
fields](https://iceberg.apache.org/spec/#identifier-field-ids) to a spec using 
`SET IDENTIFIER FIELDS`:
+Spark table can support Flink SQL upsert operation if the table has identifier 
fields.
 
 ```sql
 ALTER TABLE prod.db.sample SET IDENTIFIER FIELDS id
@@ -468,7 +469,8 @@ ALTER TABLE prod.db.sample SET IDENTIFIER FIELDS id, data
 -- multiple columns
 ```
 
-identifier fields must be `NOT NULL`, The later `ALTER` statement will 
overwrite the previous setting.
+Identifier fields must be `NOT NULL` columns when they are created or added. 
+The later `ALTER` statement will overwrite the previous setting.
 
 ### `ALTER TABLE ... DROP IDENTIFIER FIELDS`
 

Reply via email to