[iceberg-docs] branch main updated: Update main specs to 1.2.0 (#216)

jackye Wed, 22 Mar 2023 13:21:16 -0700

This is an automated email from the ASF dual-hosted git repository.

jackye pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/iceberg-docs.git



The following commit(s) were added to refs/heads/main by this push:
     new 1cb7e113 Update main specs to 1.2.0 (#216)
1cb7e113 is described below

commit 1cb7e1135c24672fd5be30227df669a88852f975
Author: Jack Ye <[email protected]>
AuthorDate: Wed Mar 22 13:20:24 2023 -0700

    Update main specs to 1.2.0 (#216)
---
 landing-page/content/common/spec.md      | 10 +++++-----
 landing-page/content/common/view-spec.md | 13 +++++--------
 2 files changed, 10 insertions(+), 13 deletions(-)

diff --git a/landing-page/content/common/spec.md 
b/landing-page/content/common/spec.md
index 56abfc6a..58cfc229 100644
--- a/landing-page/content/common/spec.md
+++ b/landing-page/content/common/spec.md
@@ -41,7 +41,7 @@ All version 1 data and metadata files are valid after 
upgrading a table to versi
 
 Version 2 of the Iceberg spec adds row-level updates and deletes for analytic 
tables with immutable files.
 
-The primary change in version 2 adds delete files to encode that rows that are 
deleted in existing data files. This version can be used to delete or replace 
individual rows in immutable data files without rewriting the files.
+The primary change in version 2 adds delete files to encode rows that are 
deleted in existing data files. This version can be used to delete or replace 
individual rows in immutable data files without rewriting the files.
 
 In addition to row-level deletes, version 2 makes some requirements stricter 
for writers. The full set of changes are listed in [Appendix E](#version-2).
 
@@ -581,7 +581,7 @@ Scan predicates are also used to filter data and delete 
files using column bound
 
 Data files that match the query filter must be read by the scan. 
 
-Note that for any snapshot, all file paths marked with "ADDED" or "EXISTING" 
may appear at most once across all manifest files in the snapshot. If a file 
path appears more then once, the results of the scan are undefined. Reader 
implementations may raise an error in this case, but are not required to do so.
+Note that for any snapshot, all file paths marked with "ADDED" or "EXISTING" 
may appear at most once across all manifest files in the snapshot. If a file 
path appears more than once, the results of the scan are undefined. Reader 
implementations may raise an error in this case, but are not required to do so.
 
 
 Delete files that match the query filter must be applied to data files at read 
time, limited by the scope of the delete file using the following rules.
@@ -684,7 +684,7 @@ Statistics files metadata within `statistics` table 
metadata field is a struct w
 
 | v1 | v2 | Field name | Type | Description |
 |----|----|------------|------|-------------|
-| _required_ | _required_ | **`snapshot-id`** | `string` | ID of the Iceberg 
table's snapshot the statistics were computed from. |
+| _required_ | _required_ | **`snapshot-id`** | `string` | ID of the Iceberg 
table's snapshot the statistics file is associated with. |
 | _required_ | _required_ | **`statistics-path`** | `string` | Path of the 
statistics file. See [Puffin file format](../puffin-spec). |
 | _required_ | _required_ | **`file-size-in-bytes`** | `long` | Size of the 
statistics file. |
 | _required_ | _required_ | **`file-footer-size-in-bytes`** | `long` | Total 
size of the statistics file's footer (not the footer payload size). See [Puffin 
file format](../puffin-spec) for footer definition. |
@@ -777,10 +777,10 @@ When the deleted row column is present, its schema may be 
any subset of the tabl
 
 To ensure the accuracy of statistics, all delete entries must include row 
values, or the column must be omitted (this is why the column type is 
`required`).
 
-The rows in the delete file must be sorted by `file_path` then `position` to 
optimize filtering rows while scanning. 
+The rows in the delete file must be sorted by `file_path` then `pos` to 
optimize filtering rows while scanning. 
 
 *  Sorting by `file_path` allows filter pushdown by file in columnar storage 
formats.
-*  Sorting by `position` allows filtering rows while scanning, to avoid 
keeping deletes in memory.
+*  Sorting by `pos` allows filtering rows while scanning, to avoid keeping 
deletes in memory.
 
 #### Equality Delete Files
 
diff --git a/landing-page/content/common/view-spec.md 
b/landing-page/content/common/view-spec.md
index c2ae5c0c..5490a7b2 100644
--- a/landing-page/content/common/view-spec.md
+++ b/landing-page/content/common/view-spec.md
@@ -145,11 +145,10 @@ The metadata directory contains View Version Metadata 
files. The text after '=>'
   },
   "versions" : [ { => Last few versions of the view.
     "version-id" : 1,
-    "parent-version-id" : -1,
     "timestamp-ms" : 1573518431292,
     "summary" : {
       "operation" : "create", => View operation that caused this metadata to 
be created
-      "engineVersion" : "presto-350", => Version of the engine that performed 
the operation (create / replace)
+      "engineVersion" : "presto-350" => Version of the engine that performed 
the operation (create / replace)
     },
     "representations" : [ { => SQL metadata of the view
       "type" : "sql",
@@ -158,7 +157,7 @@ The metadata directory contains View Version Metadata 
files. The text after '=>'
       "schema-id" : 1,
       "default-catalog" : "iceberg",
       "default-namespace" : [ "anorwood" ]
-    } ],
+    } ]
   } ],
   "version-log" : [ { => Log of the created versions
     "timestamp-ms" : 1573518431292,
@@ -197,11 +196,10 @@ The Iceberg / view library creates a new metadata JSON 
file every time the view
   },
   "versions" : [ {
     "version-id" : 1,
-    "parent-version-id" : -1,
     "timestamp-ms" : 1573518431292,
     "summary" : {
       "operation" : "create",
-      "engineVersion" : "presto-350",
+      "engineVersion" : "presto-350"
     },
     "representations" : [ {
       "type" : "sql",
@@ -214,11 +212,10 @@ The Iceberg / view library creates a new metadata JSON 
file every time the view
     "properties" : { }
   }, {
     "version-id" : 2,
-    "parent-version-id" : 1, => Version 2 was created on top of version 1, 
making parent-version-id 1
     "timestamp-ms" : 1573518440265,
     "summary" : {
       "operation" : "replace", => The ‘replace’ operation caused this latest 
version creation
-      "engineVersion" : "spark-2.4.4",
+      "engineVersion" : "spark-2.4.4"
     },
     "representations" : [ {
       "type" : "sql",
@@ -227,7 +224,7 @@ The Iceberg / view library creates a new metadata JSON file 
every time the view
       "schema-id" : 2,
       "default-catalog" : "iceberg",
       "default-namespace" : [ "anorwood" ]
-    },
+    } ]
   } ],
   "version-log" : [ {
     "timestamp-ms" : 1573518431292,

[iceberg-docs] branch main updated: Update main specs to 1.2.0 (#216)

Reply via email to