各位好, 两个流进行interval join,时间窗口是 -23h,+1h,任务可以正常运行23小时左右,之后便报错checkpoint失败,jobmanager log中的报错信息为:
2020-12-10 10:46:51,813 INFO org.apache.flink.runtime.checkpoint. CheckpointCoordinator - Checkpoint 143 of job ee4114a1c5413bd02a68b1165090578e expired before completing. 无其他报错信息,最大checkpoint时间为10min; flink版本:1.9.0 checkpooint配置信息为: env.enableCheckpointing(600000); env.getCheckpointConfig().setCheckpointingMode(CheckpointingMode.AT_LEAST_ONCE); env.getCheckpointConfig().setMinPauseBetweenCheckpoints(1000); env.setStreamTimeCharacteristic(TimeCharacteristic.EventTime); 各位大佬能否给些排查建议呢?