Tobias Weingartner created MESOS-1819:
-----------------------------------------

             Summary: Ignore signals during executor critical startup
                 Key: MESOS-1819
                 URL: https://issues.apache.org/jira/browse/MESOS-1819
             Project: Mesos
          Issue Type: Bug
          Components: containerization, isolation, slave
            Reporter: Tobias Weingartner
            Priority: Minor


If the slave receives a SIGTERM between the time that it checkpoints a PID of a 
new task/container, and the time that the container is fully functional, the 
task will end up getting lost upon recovery.

Possibly handle this via either a graceful shutdown hook (via signal handler, 
or possibly web endpoint), or possibly defer signals during the critical 
section.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to