Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Billy Bones
Well, as we build our business on the "Failure is a feature" credo we don't really have a really advanced notification system, but we use this instead: https://github.com/AcalephStorage/consul-alerts you can run it on a docker container, have fine configuration settings, events handlers (built-in

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Leigh Martell
Hey Gary, I just finished up setting up consul, you need to setup handlers. In my case I used a project called consul-alerts, the advantage here is that it hold the alerts state so if it is not cleared in x seconds it will than alert your end point(ie: pagerduty). Here is the link https://github

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Gary Malouf
Billy, thanks for the link. It was not easy to tell from the website, but do you get email/text alerts if something goes wrong overnight? On Wed, Dec 10, 2014 at 3:54 AM, Billy Bones wrote: > Here we use the wonderfull consul tool as our monitoring and health check > dashboard plus some other t

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Billy Bones
Here we use the wonderfull consul tool as our monitoring and health check dashboard plus some other things. Check it out at consul.io, it's made by Hashicorp. I kinda like it because it's fast, realiable and it is build with huge distributed systems in mind from the ground up. 2014-12-10 1:11 GMT

Re: Monitoring Mesos slave/master processes

2014-12-09 Thread Steven Schlansker
On Dec 9, 2014, at 3:45 PM, Gary Malouf wrote: > We did this in the past with Nagios, but I was wondering if there was a > recommended way from others using in production. I wrote a Nagios plugin for it https://github.com/opentable/nagios-mesos

Re: Monitoring Mesos slave/master processes

2014-12-09 Thread Tom Arnfeld
We're monitoring the processes with M/Monit on each machine and pumping all the metrics to Graphite with https://github.com/rayrod2030/collectd-mesos. -- Tom Arnfeld Developer // DueDil On Tue, Dec 9, 2014 at 11:46 PM, Gary Malouf wrote: > We did this in the past with Nagios, but I was won

Monitoring Mesos slave/master processes

2014-12-09 Thread Gary Malouf
We did this in the past with Nagios, but I was wondering if there was a recommended way from others using in production.