Re: RMQSource synchronous message ack

Chesnay Schepler Wed, 06 Mar 2019 04:11:31 -0800

The acknowledgement has to be synchronous since Flink assume that afternotifyCheckpointComplete() all data has been persisted to externalsystems. For example, if record 1 to 100 were passed to the sink and acheckpoint occurs and completed, on restart Flink would continue withrecord 101. But if the sink does not synchronously waits for all updatesto be persisted the checkpoint may finish, and if then send asynchronousupdate (say for record 99) then Flink will _still_ resume from record 101.


On 05.03.2019 15:07, Gabriel Candal wrote:

Hi,
Recently I've opened a Stack Overflow question<https://stackoverflow.com/questions/54909315/why-does-checkpointing-impact-latency-so-much> aboutlatency spikes (~500ms) after a checkpoint operation, even though theoperation itself was relatively fast (~50ms).
I've come to realize that the cause for the latency was that the jobwas waiting for the RMQSource to acknowledgeSessionIDs duringnotifyCheckpointComplete.
I've noticed that the Kafka connectors do the equivalent operation(committing offsets) asynchronously, at least from 09 onwards. Myquestion to you is: can you see any reason why does thisacknowledgement have to synchronous on RabbitMQ?
I believe it should be ok, given that those messages are alreadyreflected in the checkpointed state, but I'm not sure if there are anynegatives consequences correctness-wise.
Thanks,

Re: RMQSource synchronous message ack

Reply via email to