This is a nice doc and it covers many different options. Upon first skim, I don't see a strong argument for particular approach. D
In our own development, we've been leaning heavily towards what you describe in the document as "lazy with SRI". I believe this is consistent with what the Hive community did on top of Orc. It's interesting because my (maybe incorrect) understanding of the Databricks Delta approach is they chose what you title "eager" in their approach to upserts. They may also have a lazy approach for other types of mutations but I don't think they do. Thanks again for putting this together! Jacques -- Jacques Nadeau CTO and Co-Founder, Dremio On Wed, May 8, 2019 at 3:42 AM Anton Okolnychyi <aokolnyc...@apple.com.invalid> wrote: > Hi folks, > > Miguel (cc) and I have spent some time thinking about how to perform > updates/deletes/upserts on top of Iceberg tables. This functionality is > essential for many modern use cases. We've summarized our ideas in a doc > [1], which, hopefully, will trigger a discussion in the community. The > document presents different conceptual approaches alongside their > trade-offs. We will be glad to consider any other ideas as well. > > Thanks, > Anton > > [1] - > https://docs.google.com/document/d/1Pk34C3diOfVCRc-sfxfhXZfzvxwum1Odo-6Jj9mwK38/ > > >