Thanks for the clarification Rong! As per my understanding, the Docker containers monitors the job Flink Job which are running in Yarn Cluster. Flink JM's have HA enabled. So there's a standby JM in case the JM fails and in case of TM failure, that TM will be re-deployed. All good. My concern is what if the Yarn Master node goes down. Is the Yarn cluster running with Multi-master or in case of failure do you migrate your job do a different cluster. If so is this failover to a different cluster built into Athenax. Regards, Anil.
-- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/