Jan Schlicht created MESOS-4673:
-----------------------------------

             Summary: Agent fails to register after re-registering period 
timed-out.
                 Key: MESOS-4673
                 URL: https://issues.apache.org/jira/browse/MESOS-4673
             Project: Mesos
          Issue Type: Bug
            Reporter: Jan Schlicht
            Assignee: Jan Schlicht


Under certain conditions, when a mesos agent looses connection to the master 
for an extended period of time (Say a switch fails), the master will 
de-register the agent, and then when the agent comes back up, refuse to let it 
register: {{Slave asked to shut down by master@10.102.25.1:5050 because 'Slave 
attempted to re-register after removal'}}.

The agent doesn't seem to be able to properly shutdown and remove running tasks 
as it should do to register as a new agent. Hence this message will persist 
until it's resolved by manual intervetion.

This seems to be caused by Docker tasks that couldn't shutdown cleanly when the 
agent is asked to shutdown running tasks to be able to register as a new agent 
with the master.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to