各位好,
两个流进行interval join,时间窗口是 -23h,+1h,任务可以正常运行23小时左右,之后便报错checkpoint失败,jobmanager
log中的报错信息为:

2020-12-10 10:46:51,813 INFO org.apache.flink.runtime.checkpoint.
CheckpointCoordinator - Checkpoint 143 of job
ee4114a1c5413bd02a68b1165090578e expired before completing.


无其他报错信息,最大checkpoint时间为10min;


flink版本:1.9.0

checkpooint配置信息为:

env.enableCheckpointing(600000);
env.getCheckpointConfig().setCheckpointingMode(CheckpointingMode.AT_LEAST_ONCE);
env.getCheckpointConfig().setMinPauseBetweenCheckpoints(1000);
env.setStreamTimeCharacteristic(TimeCharacteristic.EventTime);


各位大佬能否给些排查建议呢?

回复