Susan X. Huynh created SPARK-21419: -------------------------------------- Summary: Support Mesos failover_timeout in driver (Mesos cluster mode) Key: SPARK-21419 URL: https://issues.apache.org/jira/browse/SPARK-21419 Project: Spark Issue Type: Improvement Components: Mesos Affects Versions: 2.2.0 Reporter: Susan X. Huynh
In Mesos cluster mode, the driver framework's failover_timeout is currently set to zero. This means that if the driver temporarily loses connectivity with the master, the driver is considered disconnected, and the master will immediately kill all tasks and executors associated with the framework. To avoid this behavior, I would like to make this failover_timeout configurable. A user could then set it to a non-zero value, so that during a temporary disconnection the master would wait before tearing down the framework. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org