Are your containers on separate nodes? Are you running in Kubernetes? Have you
set hard resource limits?
When I’ve run into this issue it’s been because the JobManager was restarted (I
wasn’t running in HA mode). Your node could have been restarted or Docker could
have OOM-killed the process
If a TaskManager ran out of memory there should be something in the
JobManager logs about a unreachable TaskManager.
That said, there should also be something in the JobManager logs about
the job disappearing...
Could you set the logging level to DEBUG, run the job again, and provide
us (or
HI Chesnay,
I am currently using Flink - 1.3 using docker containers. I am not using it
in HA mode. I have 3 task managers and one job manager. This happens
randomly and not every time. Does it mean the task manager ran out of
memory etc? I am using slots more than the available core , I hope
Hello,
could you tell us a bit more about your setup? Which Flink version
you're using, whether HA is enabled, does this happen every time etc. .
Regards,
Chesnay
On 06.07.2017 21:43, G.S.Vijay Raajaa wrote:
HI,
I am using Flink Task manager and Job Manager as docker containers.
Strangely,
HI,
I am using Flink Task manager and Job Manager as docker containers.
Strangely, I find the jobs to disappear from the web portal after some
time. The jobs don't move to the failed state either. Any pointers will be
really helpful. Not able to get a clue from the logs.
Kindly let me know if I