GitHub user m9a opened a pull request:
https://github.com/apache/mesos/pull/279
WIP: Remove unknown unreachable tasks when agent re-registers.
A RunTask messsage could get dropped for an agent while it's
disconnected from the master and when such an agent goes unreachable
then this dropped task message gets added to the unreachable tasks.
When the agent re-registers, the master sends status updates for the
tasks that the agent reported when re-registering and these tasks are
also removed from the unreachableTasks on the framework but since the
agent doesn't know about the dropped task so it doesn't get removed
from the unreachableTasks leading to a check failure when
this inconsistency is detected during framework removal.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/m9a/mesos bug-fix
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/mesos/pull/279.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #279
commit 76c0c34bb4f676b960532d6d0dd86308cd59b0ad
Author: Megha Sharma <msharma3@...>
Date: 2018-03-16T00:27:06Z
Remove unknown unreachable tasks when agent re-registers.
A RunTask messsage could get dropped for an agent while it's
disconnected from the master and when such an agent goes unreachable
then this dropped task message gets added to the unreachable tasks.
When the agent re-registers, the master sends status updates for the
tasks that the agent reported when re-registering and these tasks are
also removed from the unreachableTasks on the framework but since the
agent doesn't know about the dropped task so it doesn't get removed
from the unreachableTasks leading to a check failure when
this inconsistency is detected during framework removal.
---