Susan X. Huynh created SPARK-21419:
--------------------------------------

             Summary: Support Mesos failover_timeout in driver (Mesos cluster 
mode)
                 Key: SPARK-21419
                 URL: https://issues.apache.org/jira/browse/SPARK-21419
             Project: Spark
          Issue Type: Improvement
          Components: Mesos
    Affects Versions: 2.2.0
            Reporter: Susan X. Huynh


In Mesos cluster mode, the driver framework's failover_timeout is currently set 
to zero. This means that if the driver temporarily loses connectivity with the 
master, the driver is considered disconnected, and the master will immediately 
kill all tasks and executors associated with the framework.

To avoid this behavior, I would like to make this failover_timeout 
configurable. A user could then set it to a non-zero value, so that during a 
temporary disconnection the master would wait before tearing down the framework.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to