Hi Lars,
Currently, there is no configuration available to trigger a checkpoint
immediately after the job starts in Flink.
But we
can address this issue from multiple perspectives using the insights
provided in this document
[1].
[1]
https://nightlies.apache.org/flink/flink-docs-release-1.19/
Hello,
When restarting jobs (e.g. after upgrade) with "large" state a task can
take some time to "initialize" (depending on the state size). During this
time I noticed that Flink attempts to checkpoint. In many cases
checkpointing will fail repeatedly, and cause the job to hit the
tolerable-failed