[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13996002#comment-13996002 ] Otis Gospodnetic commented on MESOS-780: I think if these stats/health APIs are well documented and relatively stable or backwards compatible, then that's all you really need to expose to support 3rd party performance and health monitoring. I poked around http://mesos.apache.org/documentation/latest/ but could not find any mentions of these end points. Could anyone please point me to them? TIA! Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio Assignee: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814190#comment-13814190 ] Dave Lester commented on MESOS-780: --- [~berngp] Are you currently working on this? Would be interesting to gather requirements. Maybe you could ping the user and dev lists to see if anyone else has other tools they'd like to integrate with. Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814199#comment-13814199 ] Bernardo Gomez Palacio commented on MESOS-780: -- [~davelester] yes I am. I think starting with the above four is more than enough for now but will ping both lists. [~t.st.clair] yes it should be pluggable. As of now we are thinking on a push model where the mesos-master, mesos-slave and mesos-local will offer a linux socket server where they will make the metrics available every sec in a JSON format. This could easily enable a pluggable interface to the above four monitoring frameworks. Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814215#comment-13814215 ] Bernardo Gomez Palacio commented on MESOS-780: -- [~joe42] I think this is enough to bootstrap this effort but I think it will be interesting/good if the slaves also reported, without the need of polling them, tasks there are running and the resources each task is consuming. Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814225#comment-13814225 ] Benjamin Mahler commented on MESOS-780: --- What would be the difference between pushing every 3 seconds and polling every 3 seconds? I ask because we have existing endpoints that can be polled for statistics / health / etc. Were the tools you listed above built for push-based models? I do see the advantage of push with respect to only being notified when changes occur, but this may prove quite complicated to implement and so a good starting point is to simply build on top of the existing endpoints before we explore a push-based approach. Let me know your thoughts! Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814227#comment-13814227 ] Tobias Weingartner commented on MESOS-780: -- I don't think that plug-able support is a must. Having an endpoint that you can query/scrape should be enough. There is nothing preventing the running of an agent that scrapes these endpoints and then pushed the data (if push is wanted) or offers the data up in a manner that is required for whatever health monitoring that is present within the infrastructure. In many ways, I think that the support for 3rd part performance and health monitoring is already there. Certainly there are improvements that can be done (exporting more information, etc), but I think that the basic framework is present and usable. Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814234#comment-13814234 ] Timothy St. Clair commented on MESOS-780: - Examples: https://issues.apache.org/jira/browse/HADOOP-4594 https://htcondor-wiki.cs.wisc.edu/index.cgi/tktview?tn=3515 Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814307#comment-13814307 ] David Robinson commented on MESOS-780: -- You can already solve the push problem quite easily, with no changes to Mesos. eg: https://collectd.org/wiki/index.php/Plugin:cURL-JSON https://collectd.org/wiki/index.php/Plugin:Write_Graphite Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (MESOS-780) Adding support for 3rd party performance and health monitoring.
[ https://issues.apache.org/jira/browse/MESOS-780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13814316#comment-13814316 ] Bernardo Gomez Palacio commented on MESOS-780: -- [~tweingartner] agree, there is nothing preventing anyone to build such integration. This ticket, which was spawn from the 1st Mesos Hackathon at Mesosphere, is meant to facilitate the integration of Mesos with the tools mentioned in the ticket. [~bmahler] some tools, like Ganglia, work on a pull model. [~davidtrobinson] I never mentioned that changing the Mesos Source Code is a requirement nor the main intention. The goal should be, I think, to facilitate the integration of Mesos to Monitoring Tools. I think that the reference to CollectD and should be covered but not everyone wants it or has it deployed. Adding support for 3rd party performance and health monitoring. --- Key: MESOS-780 URL: https://issues.apache.org/jira/browse/MESOS-780 Project: Mesos Issue Type: Improvement Components: framework Reporter: Bernardo Gomez Palacio User Story: As a SysAdmin I should be able to monitor Mesos (Masters and Slaves) with 3rd party tools such as: * [Ganglia|http://ganglia.sourceforge.net/] * [Graphite|http://graphite.wikidot.com/] * [Nagios|http://www.nagios.org/] * [Zabbix|http://www.zabbix.com/] -- This message was sent by Atlassian JIRA (v6.1#6144)