[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16345529#comment-16345529 ] Greg Mann commented on MESOS-3022: -- Since this is a long outstanding issue related exclusively to the SchedulerDriver, I'm going to close the ticket. If we add more metrics for the scheduler API, we should do so with the intention of accommodating both the unversioned and the V1 scheduler APIs. > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: James Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14958379#comment-14958379 ] Yong Qiao Wang commented on MESOS-3022: --- [~bmahler], Per my investigation, following messages by type should be added into the scheduler metrics: {code} process::metrics::Counter messages_registered_framework; process::metrics::Counter messages_reregistered_framework; process::metrics::Counter messages_resource_offers; process::metrics::Counter messages_rescind_offer; process::metrics::Counter messages_status_update; process::metrics::Counter messages_executor_to_framework; process::metrics::Counter messages_slave_lost; process::metrics::Counter messages_framework_error_messages; {code} If something is missed, could you let me know? > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14958207#comment-14958207 ] Yong Qiao Wang commented on MESOS-3022: --- Thanks [~bmahler], I have understood your comments, I will update my patch to add the metrics of messages by type. In addition, the current patch is to added the other events (exiteds, https, terminates) in the metrics, I think we need to keep that fix. Do you think so? > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14936056#comment-14936056 ] Benjamin Mahler commented on MESOS-3022: We want to have metrics for each message that gets sent to the driver, the current patch exposes which events. Here's an example of where the master keeps metrics for the different kinds of messages coming from the slaves: https://github.com/apache/mesos/blob/0.25.0-rc1/src/master/metrics.hpp#L140 > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14905872#comment-14905872 ] Yong Qiao Wang commented on MESOS-3022: --- Hi [~bmahler], the messages by type have been added in my patch. Could you give ma a further review and let me know your concern. Thanks! > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14805145#comment-14805145 ] Yong Qiao Wang commented on MESOS-3022: --- Thanks [~bmahler], I know version information will be available through MESOS-1841, so I only add messages by type in this patch. I am sorry to can not follow you, can you give me a detailed comments based on the code changes in RR? > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14744556#comment-14744556 ] Benjamin Mahler commented on MESOS-3022: This is about adding version information, which will be available through MESOS-1841, as for messages by type, the current patch you've posted does not add them? > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734827#comment-14734827 ] Yong Qiao Wang commented on MESOS-3022: --- Hi [~benjaminhindman], [~jieyu] and [~vinodkone], Cloud you help to review this patch? > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14732139#comment-14732139 ] Yong Qiao Wang commented on MESOS-3022: --- Hi [~benjaminhindman], [~jieyu] and [~vinodkone], Cloud you help to review this patch? Thanks in advance! > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14731964#comment-14731964 ] Yong Qiao Wang commented on MESOS-3022: --- Append the related review request: https://reviews.apache.org/r/38145/ > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14731932#comment-14731932 ] Yong Qiao Wang commented on MESOS-3022: --- [~klausma1982] and [~haosd...@gmail.com], Thank you so much for your important comments. So in this ticket, it only needs to add a count of messages by message type, and the version information can be get from http endpoint after MESOS-1841 is applied. > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14730831#comment-14730831 ] haosdent commented on MESOS-3022: - Oh, current could get version informations from http endpoint before after MESOS-1841 is applied. > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14730825#comment-14730825 ] Klaus Ma commented on MESOS-3022: - Scheduler version is necessary according to the following description; but we did not need to build it from scratch, MESOS-1841 has a patch for master; we can add scheduler's. CC [~haosd...@gmail.com], [~marco-mesos] to align the solution. {quote} The scheduler driver version could be used during troubleshooting to identify frameworks that are using an old, potentially backwards incompatible, scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, so it still links against an old incompatible libmesos). {quote} > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729023#comment-14729023 ] Yong Qiao Wang commented on MESOS-3022: --- In addition, I think the scheduler version does not be as a metrics, because it is a static value, and maybe we should provide another way to get the version for framework if it is needed. [~benjaminhindman] any thoughts for this? > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14728908#comment-14728908 ] Yong Qiao Wang commented on MESOS-3022: --- According to my investigation, there are five message types for scheduler, they are: MessageEvent; DispatchEvent; HttpEvent; ExitedEvent; TerminateEvent; So the updated metrics exported by scheduler should be: $ curl -s localhost:20902/metrics/snapshot | python -m json.tool { "scheduler/version": 0.25.0, "scheduler/event_queue_dispatches": 0, "scheduler/event_queue_messages": 0, "scheduler/event_queue_https": 0, "scheduler/event_queue_exiteds": 0, "scheduler/event_queue_terminates": 0, "system/cpus_total": 24, "system/load_15min": 0.49, "system/load_1min": 0.36, "system/load_5min": 0.46, "system/mem_free_bytes": 269713408, "system/mem_total_bytes": 33529266176 } [~benjaminhindman] and [~davidtrobinson] , Are above exported metrics expected for us? Welcome any of your comments. Thanks! > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Assignee: Yong Qiao Wang >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver
[ https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14727254#comment-14727254 ] Yong Qiao Wang commented on MESOS-3022: --- If no one working on this, I will have a try. Thanks! > export additional metrics from scheduler driver > --- > > Key: MESOS-3022 > URL: https://issues.apache.org/jira/browse/MESOS-3022 > Project: Mesos > Issue Type: Improvement >Reporter: David Robinson >Priority: Minor > > The scheduler driver only exports the metrics below, but ideally it would > export its version and a count of messages by message type. > {code} > $ curl -s localhost:20902/metrics/snapshot | python -m json.tool > { > "scheduler/event_queue_dispatches": 0, > "scheduler/event_queue_messages": 0, > "system/cpus_total": 24, > "system/load_15min": 0.49, > "system/load_1min": 0.36, > "system/load_5min": 0.46, > "system/mem_free_bytes": 269713408, > "system/mem_total_bytes": 33529266176 > } > {code} > The scheduler driver version could be used during troubleshooting to identify > frameworks that are using an old, potentially backwards incompatible, > scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, > so it still links against an old incompatible libmesos). > A count of messages by message type would help identify a problem w/ a > specific feature, eg task reconciliation. -- This message was sent by Atlassian JIRA (v6.3.4#6332)