[ 
https://issues.apache.org/jira/browse/AURORA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14380908#comment-14380908
 ] 

Maxim Khutornenko commented on AURORA-1220:
-------------------------------------------

In addition to the above, it would be great to track a 
{{median_instance_update_time}} metric. This would be the time between 
INSTANCE_UPDATING and INSTANCE_UPDATED instance events and would help job 
owners track their instance downtime during updates.

> Add SLA stats tracking job updates
> ----------------------------------
>
>                 Key: AURORA-1220
>                 URL: https://issues.apache.org/jira/browse/AURORA-1220
>             Project: Aurora
>          Issue Type: Task
>          Components: Scheduler
>            Reporter: Maxim Khutornenko
>
> With the scheduler updater graduating from beta, we should add job update 
> metrics to help job owners track update history/performance, e.g.:
> *Per Job and Per cluster:*
> - updates_created
> - updates_succeeded
> - updates_failed
> - updates_aborted
> - updates_rolled_back
> - time_spent_in_active
> - instances_added
> - instances_removed



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to