Is the 4th "checkpointed size" and "checkpoint duration" bigger than
others? If it is true, I guess it's related to the flush of rocksdb.
It may delay the next checkpoint.
Best,
Hangxiang.
On Fri, Jun 17, 2022 at 2:31 PM Hangxiang Yu wrote:
> Is the 4th "checkpointed size" and "checkpoint durat
Hi, Jai.
Could you share your configuration about the checkpoint (interval,
min-pause, and so on) and the checkpoint details in the Flink UI ?
I guess the delay of the checkpoint may be related to the last checkpoint
completion time as you could see in the
CheckpointRequestDecider#chooseRequestToE
We've noticed a spike in the start delays in our incremental checkpoints
every 15 minutes. The Flink job seems to start out smooth, with
checkpoints in in the 15s range and negligible start delays. Then every
3rd or 4th checkpoint has a long start delay (~2-3 minutes). Teh
checkpoints in between