Meng Zhu created MESOS-9847:
-------------------------------
Summary: Docker executor doesn't wait for status updates to be
ack'd before shutting down.
Key: MESOS-9847
URL: https://issues.apache.org/jira/browse/MESOS-9847
Project: Mesos
Issue Type: Bug
Components: executor
Reporter: Meng Zhu
The docker executor doesn't wait for pending status updates to be acknowledged
before shutting down, instead it sleeps for one second and then terminates:
{noformat}
void _stop()
{
// A hack for now ... but we need to wait until the status update
// is sent to the slave before we shut ourselves down.
// TODO(tnachen): Remove this hack and also the same hack in the
// command executor when we have the new HTTP APIs to wait until
// an ack.
os::sleep(Seconds(1));
driver.get()->stop();
}
{noformat}
This would result in racing between task status update (e.g. TASK_FINISHED) and
executor exit. The latter would lead agent generating a `TASK_FAILED` status
update by itself, leading to the confusing case where the agent handles two
different terminal status updates.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)