Bill Farner created AURORA-99:
---------------------------------
Summary: Do not queue task for local gc if it started after the
AdjustRetainedTask message was sent
Key: AURORA-99
URL: https://issues.apache.org/jira/browse/AURORA-99
Project: Aurora
Issue Type: Task
Components: Executor
Reporter: Bill Farner
Priority: Minor
Quoting [~wickman]:
Hmm, reading the logic in reconcile_states:
{noformat}
Local vs Scheduler => Action
===================================
ACTIVE ACTIVE => no-op
ACTIVE STARTING => no-op
ACTIVE TERMINAL => maybe kill task*
ACTIVE !EXISTS => maybe kill task*
TERMINAL ACTIVE => send actual status**
TERMINAL STARTING => send actual status**
TERMINAL TERMINAL => no-op
TERMINAL !EXISTS => gc locally
!EXISTS ACTIVE => send LOST**
!EXISTS STARTING => no-op
!EXISTS TERMINAL => gc remotely
{noformat}
What happened (I believe) is that when the AdjustRetainedTasks was sent, the
task did not exist, but by the time that the GC was run, the task had run to
completion locally.
We should probably not queue anything for local deletion if its start time
occurred after the AdjustRetainedTasks was sent.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)