[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2018-01-30 Thread Greg Mann (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16345529#comment-16345529
 ] 

Greg Mann commented on MESOS-3022:
--

Since this is a long outstanding issue related exclusively to the 
SchedulerDriver, I'm going to close the ticket. If we add more metrics for the 
scheduler API, we should do so with the intention of accommodating both the 
unversioned and the V1 scheduler APIs.

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: James Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-10-14 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14958379#comment-14958379
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

[~bmahler], Per my investigation, following messages by type should be added 
into the scheduler metrics:
{code}
process::metrics::Counter messages_registered_framework;
process::metrics::Counter messages_reregistered_framework;
process::metrics::Counter messages_resource_offers;​
process::metrics::Counter messages_rescind_offer;​
process::metrics::Counter messages_status_update;
​process::metrics::Counter messages_executor_to_framework;
process::metrics::Counter messages_slave_lost;​
process::metrics::Counter messages_framework_error_messages;​
{code}

If something is missed, could you let me know?

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-10-14 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14958207#comment-14958207
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

Thanks [~bmahler], I have understood your comments, I will update my patch to 
add the metrics of messages by type. 

In addition, the current patch is to added the other events (exiteds, https, 
terminates) in the metrics, I think we need to keep that fix. Do you think so?

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-29 Thread Benjamin Mahler (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14936056#comment-14936056
 ] 

Benjamin Mahler commented on MESOS-3022:


We want to have metrics for each message that gets sent to the driver, the 
current patch exposes which events. Here's an example of where the master keeps 
metrics for the different kinds of messages coming from the slaves:

https://github.com/apache/mesos/blob/0.25.0-rc1/src/master/metrics.hpp#L140

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-23 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14905872#comment-14905872
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

Hi [~bmahler], the messages by type have been added in my patch. Could you give 
ma a further review and let me know your concern. Thanks!

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-18 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14805145#comment-14805145
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

Thanks [~bmahler], I know version information will be available through 
MESOS-1841, so I only add messages by type in this patch. I am sorry to can not 
follow you, can you give me a detailed comments based on the code changes in 
RR? 

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-14 Thread Benjamin Mahler (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14744556#comment-14744556
 ] 

Benjamin Mahler commented on MESOS-3022:


This is about adding version information, which will be available through 
MESOS-1841, as for messages by type, the current patch you've posted does not 
add them?

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-08 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734827#comment-14734827
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

Hi [~benjaminhindman], [~jieyu] and [~vinodkone], Cloud you help to review this 
patch? 

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-05 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14732139#comment-14732139
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

Hi [~benjaminhindman], [~jieyu] and [~vinodkone], Cloud you help to review this 
patch? Thanks in advance!

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-05 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14731964#comment-14731964
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

Append the related review request: https://reviews.apache.org/r/38145/

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-05 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14731932#comment-14731932
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

[~klausma1982] and [~haosd...@gmail.com],  Thank you so much for your important 
comments.

So in this ticket, it only needs to add a count of messages by message type, 
and the version information can be get from http endpoint after MESOS-1841 is 
applied. 

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-04 Thread haosdent (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14730831#comment-14730831
 ] 

haosdent commented on MESOS-3022:
-

Oh, current could get version informations from http endpoint before after 
MESOS-1841 is applied.

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-04 Thread Klaus Ma (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14730825#comment-14730825
 ] 

Klaus Ma commented on MESOS-3022:
-

Scheduler version is necessary according to the following description; but we 
did not need to build it from scratch, MESOS-1841 has a patch for master; we 
can add scheduler's. 

CC [~haosd...@gmail.com], [~marco-mesos] to align the solution.

{quote}
The scheduler driver version could be used during troubleshooting to identify 
frameworks that are using an old, potentially backwards incompatible, scheduler 
driver (eg, a framework hasn't been restarted after a Mesos deploy, so it still 
links against an old incompatible libmesos).
{quote}

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-03 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14729023#comment-14729023
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

In addition, I think the scheduler version does not be as a metrics, because it 
is a static value, and maybe we should provide another way to get the version 
for framework if it is needed.
[~benjaminhindman] any thoughts for this?

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-03 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14728908#comment-14728908
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

According to my investigation, there are five message types for scheduler, they 
are:
MessageEvent;
DispatchEvent;
HttpEvent;
ExitedEvent;
TerminateEvent;

So the updated metrics exported by scheduler should be:

$ curl -s localhost:20902/metrics/snapshot | python -m json.tool
{
"scheduler/version": 0.25.0,
"scheduler/event_queue_dispatches": 0,
"scheduler/event_queue_messages": 0,
"scheduler/event_queue_https": 0,
"scheduler/event_queue_exiteds": 0,
"scheduler/event_queue_terminates": 0,
"system/cpus_total": 24,
"system/load_15min": 0.49,
"system/load_1min": 0.36,
"system/load_5min": 0.46,
"system/mem_free_bytes": 269713408,
"system/mem_total_bytes": 33529266176
}

[~benjaminhindman] and [~davidtrobinson] , Are above exported metrics expected 
for us?  Welcome any of your comments. Thanks!

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Assignee: Yong Qiao Wang
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (MESOS-3022) export additional metrics from scheduler driver

2015-09-02 Thread Yong Qiao Wang (JIRA)

[ 
https://issues.apache.org/jira/browse/MESOS-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14727254#comment-14727254
 ] 

Yong Qiao Wang commented on MESOS-3022:
---

If no one working on this, I will have a try. Thanks!

> export additional metrics from scheduler driver
> ---
>
> Key: MESOS-3022
> URL: https://issues.apache.org/jira/browse/MESOS-3022
> Project: Mesos
>  Issue Type: Improvement
>Reporter: David Robinson
>Priority: Minor
>
> The scheduler driver only exports the metrics below, but ideally it would 
> export its version and a count of messages by message type.
> {code}
> $ curl -s localhost:20902/metrics/snapshot | python -m json.tool
> {
> "scheduler/event_queue_dispatches": 0,
> "scheduler/event_queue_messages": 0,
> "system/cpus_total": 24,
> "system/load_15min": 0.49,
> "system/load_1min": 0.36,
> "system/load_5min": 0.46,
> "system/mem_free_bytes": 269713408,
> "system/mem_total_bytes": 33529266176
> }
> {code}
> The scheduler driver version could be used during troubleshooting to identify 
> frameworks that are using an old, potentially backwards incompatible, 
> scheduler driver (eg, a framework hasn't been restarted after a Mesos deploy, 
> so it still links against an old incompatible libmesos).
> A count of messages by message type would help identify a problem w/ a 
> specific feature, eg task reconciliation.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)