Re: [DISCUSS] FLIP-143: Unified Sink API

Aljoscha Krettek Mon, 14 Sep 2020 07:06:12 -0700

I thought about this some more. One of the important parts of theIceberg sink is to know whether we have already committed someDataFiles. Currently, this is implemented by writing a (JobId,MaxCheckpointId) tuple to the Iceberg table when committing. Whenrestoring from a failure we check this and discard committables(DataFile) that we know to already be committed.

I think this can have some problems, for example when checkpoint ids arenot strictly sequential, when we wrap around, or when the JobID changes.This will happen when doing a stop/start-from-savepoint cycle, for example.

I think we could fix this by having Flink provide a nonce to theGlobalCommitter where Flink guarantees that this nonce is unique andwill not change for repeated invocations of the GlobalCommitter with thesame set of committables. The GlobalCommitter could use this todetermine whether a set of committables has already been committed tothe Iceberg table.

It's seems very tailor-made for Iceberg for now but other systems shouldsuffer from the same problem.


Best,
Aljoscha

Re: [DISCUSS] FLIP-143: Unified Sink API

Reply via email to