[ 
https://issues.apache.org/jira/browse/AURORA-894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14361577#comment-14361577
 ] 

Moses Nakamura commented on AURORA-894:
---------------------------------------

I took a first stab at the python bits here: 
https://reviews.apache.org/r/31104/ but I didn't have time to write tests or do 
any of the java bits.  Let me know if you need any help.

> Server updater should watch healthy instances
> ---------------------------------------------
>
>                 Key: AURORA-894
>                 URL: https://issues.apache.org/jira/browse/AURORA-894
>             Project: Aurora
>          Issue Type: Task
>          Components: Scheduler
>            Reporter: Maxim Khutornenko
>            Assignee: Maxim Khutornenko
>
> Instead of starting the {{minWaitInInstanceRunningMs}} (aka {{watch_secs}}) 
> countdown when an instance reaches RUNNING state, the updater should rely on 
> the first successful health check instead. This will potentially speed up 
> updates as the {{minWaitInInstanceRunningMs}} will no longer have to be 
> chosen based on the worst observed instance startup/warmup delay but rather 
> as a desired health check duration according to the following formula:
> {noformat}
> minWaitInInstanceRunningMs = interval_secs x num_desired_healthchecks x 1000
> {noformat}
> where:
>   {{interval_secs}} - 
> https://github.com/apache/incubator-aurora/blob/master/docs/configuration-reference.md#healthcheckconfig-objects
>   {{num_desired_healthchecks}} - the desired number of OK health checks to 
> observe before declaring an instance updated successfully
>   
> The above would allow every instance to start watching interval depending on 
> the individual instance performance and potentially exit updater earlier. 
> This feature requires AURORA-279.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to