Hi Chen, Right now, the main parts of the v2 spec are the addition of sequence numbers and delete files. We're also making some other requirements more strict, but those are mainly cleaning up problems and not related to row-level deletes.
Upserts would be encoded as a delete and an insert. Deletes are stored in delete files, and inserts are normal data files. Delete files are valid within a partition, and apply to all data files with the same or lower sequence number. I'm planning on updating what's currently in the spec now that we have sequence numbers and delete file metadata committed in master, but right now I'm working on getting the 0.9.0 release out with support for Spark 3. The documentation should be coming in the next couple of weeks. rb On Wed, Jul 1, 2020 at 6:28 AM Chen Song <[email protected]> wrote: > I saw Table Spec V2 > <https://iceberg.apache.org/spec/#version-2-row-level-deletes> was > mentioned in the official iceberg doc. I know it is incomplete and wip. Is > there any to-be-reviewed or proposed version for public view? I am > interested to understand how row level upserts are supported? > > Thanks > -- > Chen Song > > -- Ryan Blue Software Engineer Netflix
