[
https://issues.apache.org/jira/browse/MESOS-8609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16399215#comment-16399215
]
Zhitao Li edited comment on MESOS-8609 at 3/14/18 8:27 PM:
-----------------------------------------------------------
{noformat}
commit 82c50c0e00284c131354499f74176b19d89bd21d (HEAD -> master, origin/master,
origin/HEAD)
Author: Zhitao Li <[email protected]>
Date: Wed Mar 14 09:25:01 2018 -0700
Document new `slave/recovery_time_secs` gauge.
Review: https://reviews.apache.org/r/66070
commit b8526c61403214aaa67fa941b4e8b0fd8e3328f2
Author: Zhitao Li <[email protected]>
Date: Wed Mar 7 15:18:53 2018 -0800
Added a test to make sure `slave/recovery_time_secs` is reported.
Review: https://reviews.apache.org/r/65959
commit 026dafd33cd23d41818e18e31ec271fa2c13abd2
Author: Zhitao Li <[email protected]>
Date: Tue Mar 6 17:43:48 2018 -0800
Added a gauge for how long agent recovery takes.
The new metric `slave/recover_time_secs` can be used to tell us how long
Mesos agent needed to finish its recovery cycle. This is an important
metric on agent machines which have a lot of completed executor
sandboxes.
Note that the metric 1) will only be available after recovery succeeded
and 2) never change its value across agent process lifecycle afterwards.
Review: https://reviews.apache.org/r/65954
{noformat}
was (Author: zhitao):
commit 82c50c0e00284c131354499f74176b19d89bd21d (HEAD -> master, origin/master,
origin/HEAD)
Author: Zhitao Li <[email protected]>
Date: Wed Mar 14 09:25:01 2018 -0700
Document new `slave/recovery_time_secs` gauge.
Review: https://reviews.apache.org/r/66070
commit b8526c61403214aaa67fa941b4e8b0fd8e3328f2
Author: Zhitao Li <[email protected]>
Date: Wed Mar 7 15:18:53 2018 -0800
Added a test to make sure `slave/recovery_time_secs` is reported.
Review: https://reviews.apache.org/r/65959
commit 026dafd33cd23d41818e18e31ec271fa2c13abd2
Author: Zhitao Li <[email protected]>
Date: Tue Mar 6 17:43:48 2018 -0800
Added a gauge for how long agent recovery takes.
The new metric `slave/recover_time_secs` can be used to tell us how long
Mesos agent needed to finish its recovery cycle. This is an important
metric on agent machines which have a lot of completed executor
sandboxes.
Note that the metric 1) will only be available after recovery succeeded
and 2) never change its value across agent process lifecycle afterwards.
Review: https://reviews.apache.org/r/65954
> Create a metric to indicate how long agent takes to recover executors
> ---------------------------------------------------------------------
>
> Key: MESOS-8609
> URL: https://issues.apache.org/jira/browse/MESOS-8609
> Project: Mesos
> Issue Type: Improvement
> Components: agent
> Reporter: Zhitao Li
> Assignee: Zhitao Li
> Priority: Minor
> Labels: Metrics, agent
> Fix For: 1.6.0
>
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)