Meng Zhu created MESOS-9847:
-------------------------------

             Summary: Docker executor doesn't wait for status updates to be 
ack'd before shutting down.
                 Key: MESOS-9847
                 URL: https://issues.apache.org/jira/browse/MESOS-9847
             Project: Mesos
          Issue Type: Bug
          Components: executor
            Reporter: Meng Zhu


The docker executor doesn't wait for pending status updates to be acknowledged 
before shutting down, instead it sleeps for one second and then terminates:

{noformat}
  void _stop()
  {
    // A hack for now ... but we need to wait until the status update
    // is sent to the slave before we shut ourselves down.
    // TODO(tnachen): Remove this hack and also the same hack in the
    // command executor when we have the new HTTP APIs to wait until
    // an ack.
    os::sleep(Seconds(1));
    driver.get()->stop();
  }
{noformat}

This would result in racing between task status update (e.g. TASK_FINISHED) and 
executor exit. The latter would lead agent generating a `TASK_FAILED` status 
update by itself, leading to the confusing case where the agent handles two 
different terminal status updates.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to