Behavior for flink job running on K8S failed after restart strategy exhausted

Eleanore Jin Fri, 31 Jul 2020 09:21:44 -0700

Hi Experts,

I have a flink cluster (per job mode) running on kubernetes. The job is
configured with restart strategy


restart-strategy.fixed-delay.attempts: 3restart-strategy.fixed-delay.delay: 10 s


So after 3 times retry, the job will be marked as FAILED, hence the pods
are not running. However, kubernetes will then restart the job again as the
available replicas do not match the desired one.

I wonder what are the suggestions for such a scenario? How should I
configure the flink job running on k8s?

Thanks a lot!
Eleanore

Behavior for flink job running on K8S failed after restart strategy exhausted

Reply via email to