This is a nice doc and it covers many different options. Upon first skim, I
don't see a strong argument for particular approach. D

In our own development, we've been leaning heavily towards what you
describe in the document as "lazy with SRI". I believe this is consistent
with what the Hive community did on top of Orc. It's interesting because my
(maybe incorrect) understanding of the Databricks Delta approach is they
chose what you title "eager" in their approach to upserts. They may also
have a lazy approach for other types of mutations but I don't think they do.

Thanks again for putting this together!
Jacques
--
Jacques Nadeau
CTO and Co-Founder, Dremio


On Wed, May 8, 2019 at 3:42 AM Anton Okolnychyi
<aokolnyc...@apple.com.invalid> wrote:

> Hi folks,
>
> Miguel (cc) and I have spent some time thinking about how to perform
> updates/deletes/upserts on top of Iceberg tables. This functionality is
> essential for many modern use cases. We've summarized our ideas in a doc
> [1], which, hopefully, will trigger a discussion in the community. The
> document presents different conceptual approaches alongside their
> trade-offs. We will be glad to consider any other ideas as well.
>
> Thanks,
> Anton
>
> [1] -
> https://docs.google.com/document/d/1Pk34C3diOfVCRc-sfxfhXZfzvxwum1Odo-6Jj9mwK38/
>
>
>

Reply via email to