Re: Spike in checkpoint start delay every 15 minutes

2022-06-16 Thread Hangxiang Yu
Is the 4th "checkpointed size" and "checkpoint duration" bigger than others? If it is true, I guess it's related to the flush of rocksdb. It may delay the next checkpoint. Best, Hangxiang. On Fri, Jun 17, 2022 at 2:31 PM Hangxiang Yu wrote: > Is the 4th "checkpointed size" and "checkpoint durat

Re: Spike in checkpoint start delay every 15 minutes

2022-06-14 Thread Hangxiang Yu
Hi, Jai. Could you share your configuration about the checkpoint (interval, min-pause, and so on) and the checkpoint details in the Flink UI ? I guess the delay of the checkpoint may be related to the last checkpoint completion time as you could see in the CheckpointRequestDecider#chooseRequestToE

Spike in checkpoint start delay every 15 minutes

2022-06-14 Thread Jai Patel
We've noticed a spike in the start delays in our incremental checkpoints every 15 minutes. The Flink job seems to start out smooth, with checkpoints in in the 15s range and negligible start delays. Then every 3rd or 4th checkpoint has a long start delay (~2-3 minutes). Teh checkpoints in between