Bill Farner created AURORA-1149:
-----------------------------------

             Summary: Decide the appropriate way to handle TASK_ERROR
                 Key: AURORA-1149
                 URL: https://issues.apache.org/jira/browse/AURORA-1149
             Project: Aurora
          Issue Type: Story
          Components: Scheduler, Usability
            Reporter: Bill Farner


This may mean the state machine needs to accommodate this state.  The outcome 
should depend on whether we believe TASK_ERROR is truly permanently fatal, or 
whether it could represent a transient configuration error in some component 
(e.g. scheduler, master).  If we believe TASK_ERROR is indeed always fatal, 
this should be a terminal and non-reschedulable state regardless of whether the 
task is a service.  However, if it can be transient, we may still want to try 
to revive the task to avoid temporary configuration errors from tearing down 
services.

Personally, i'm leaning towards the retry case.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to