GitHub user mccheah opened a pull request:

    https://github.com/apache/spark/pull/21241

    [SPARK-24135][K8s] Resilience to init-container errors on executors.

    ## What changes were proposed in this pull request?
    
    Spark doesn't attach init-containers. But if a custom web hook or pod 
preset adds init-containers, we need to be resilient to transient failures of 
these containers and to at least retry them.
    
    ## How was this patch tested?
    
    Unit tests.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/palantir/spark handle-init-errors

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/21241.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #21241
    
----
commit 27cfc943e684ac6c20949b300767bb1b29b496e6
Author: mcheah <mcheah@...>
Date:   2018-05-01T16:22:19Z

    [SPARK-24135][K8s] Resilience to init-container errors on executors.
    
    Spark doesn't attach init-containers. But if a custom web hook or pod
    preset adds init-containers, we need to be resilient to transient
    failures of these containers and to at least retry them.

commit c9f7e102dee6ab453f97401d274a05cd23a2c3e2
Author: mcheah <mcheah@...>
Date:   2018-05-04T22:16:41Z

    Make the failure count configurable.

commit 52df0f24d66e97be73d57e7121195170e3b0960b
Author: mcheah <mcheah@...>
Date:   2018-05-04T22:19:53Z

    Fix compilation

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to