[ https://issues.apache.org/jira/browse/MESOS-7815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Benjamin Mahler updated MESOS-7815: ----------------------------------- Labels: mesosphere metrics observability (was: mesosphere metrics reliability) > Add gauge for master event processing time > ------------------------------------------ > > Key: MESOS-7815 > URL: https://issues.apache.org/jira/browse/MESOS-7815 > Project: Mesos > Issue Type: Improvement > Components: master > Reporter: Benjamin Bannier > Labels: mesosphere, metrics, observability > > To diagnose cases where e.g., the master is backlogged, looking at just > {{event_queue_messages}} will only tell about the size of the queue, but > diagnosing whether this is due to higher message arrival rate or slower > processing requires complicated interference with other metrics. > We should provide metrics to characterize the time it takes to process > messages in the queue, optimally with statistics over some window. This would > allow better identification of slow requests. > We should also consider ways to characterizing the arrival rate via some > metric with statistics. -- This message was sent by Atlassian JIRA (v6.4.14#64029)