Re: [DISCUSS] FLIP-143: Unified Sink API

Aljoscha Krettek Fri, 18 Sep 2020 02:44:37 -0700

Steven,

we were also wondering if it is a strict requirement that "later"updates to Iceberg subsume earlier updates. In the current version, youonly check whether checkpoint X made it to Iceberg and then discard allcommittable state from Flink state for checkpoints smaller X.

If we go with a (somewhat random) nonce, this would not work. Insteadthe sink would have to check for each set of committables seperately ifthey had already been committed. Do you think this is feasible? Duringnormal operation this set would be very small, it would usually only bethe committables for the last checkpoint. Only when there is an outagewould multiple sets of committables pile up.

We were thinking to extend the GlobalCommitter interface to allow it toreport success or failure and then let the framework retry. I think thisis something that you would need for the Iceberg case. The signaturecould be like this:


CommitStatus commitGlobally(List<Committable>, Nonce)

where CommitStatus could be an enum of SUCCESS, TERMINAL_FAILURE, andRETRY.


Best,
Aljoscha

Re: [DISCUSS] FLIP-143: Unified Sink API

Reply via email to