[
https://issues.apache.org/jira/browse/MESOS-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Greg Mann reassigned MESOS-9847:
--------------------------------
Assignee: Andrei Budnik
> Docker executor doesn't wait for status updates to be ack'd before shutting
> down.
> ---------------------------------------------------------------------------------
>
> Key: MESOS-9847
> URL: https://issues.apache.org/jira/browse/MESOS-9847
> Project: Mesos
> Issue Type: Bug
> Components: executor
> Reporter: Meng Zhu
> Assignee: Andrei Budnik
> Priority: Major
> Labels: containerization
>
> The docker executor doesn't wait for pending status updates to be
> acknowledged before shutting down, instead it sleeps for one second and then
> terminates:
> {noformat}
> void _stop()
> {
> // A hack for now ... but we need to wait until the status update
> // is sent to the slave before we shut ourselves down.
> // TODO(tnachen): Remove this hack and also the same hack in the
> // command executor when we have the new HTTP APIs to wait until
> // an ack.
> os::sleep(Seconds(1));
> driver.get()->stop();
> }
> {noformat}
> This would result in racing between task status update (e.g. TASK_FINISHED)
> and executor exit. The latter would lead agent generating a `TASK_FAILED`
> status update by itself, leading to the confusing case where the agent
> handles two different terminal status updates.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)