from:"T Obi"

Re: Taskmanagers are quarantined

2017-12-07 Thread T Obi

pendency and just drop in the CDH dependecy or > jars. > > Stephan > > > On Nov 29, 2017 14:34, "Till Rohrmann" wrote: > > Hi, > > you could also try increasing the heartbeat timeout via > `akka.watch.heartbeat.pause`. Maybe this helps to overcome the GC paus

Re: Taskmanagers are quarantined

2017-11-29 Thread T Obi

ime. I try to make a few taskmanagers run with divided memory size on each machine. Also I will tune JVM memory parameters to reduce the frequency of "Full GC (Metadata GC Threshold)". Best, Tetsuya 2017-11-28 16:30 GMT+09:00 T Obi : > Hello Chesnay, > > Thank you for answ

Re: Taskmanagers are quarantined

2017-11-27 Thread T Obi

m the stack-trace it appears that multiple hdfs nodes are being > corrupted. > The taskmanagers timeout since the connection to zookeeper breaks down, > at which point it no longer knows who the leading jobmanager knows and > subsequently shuts down. > > > On 27.11.2017 08:02,

Taskmanagers are quarantined

2017-11-26 Thread T Obi

Hello all, We run jobs on a standalone cluster with Flink 1.3.2 and we're facing a problem. Suddenly a connection between a taskmanager and the jobmanager is timed out and the taskmanager is "quarantined" by jobmanager. Once a taskmanager is quarantined, of course jobs are restarted, but the timeo