Well, as we build our business on the "Failure is a feature" credo we don't really have a really advanced notification system, but we use this instead: https://github.com/AcalephStorage/consul-alerts
you can run it on a docker container, have fine configuration settings, events handlers (built-in and customs) and it rely on consul KV Store for the configuration. Here we just have a docker container hosting a consul agent in client mode and consul-alert on top of this consul agent to use the distributed KV and don't rely on the master one. 2014-12-10 15:47 GMT+01:00 Gary Malouf <malouf.g...@gmail.com>: > Billy, thanks for the link. It was not easy to tell from the website, but > do you get email/text alerts if something goes wrong overnight? > > On Wed, Dec 10, 2014 at 3:54 AM, Billy Bones <gael.ther...@gmail.com> > wrote: > >> Here we use the wonderfull consul tool as our monitoring and health check >> dashboard plus some other things. >> >> Check it out at consul.io, it's made by Hashicorp. >> I kinda like it because it's fast, realiable and it is build with huge >> distributed systems in mind from the ground up. >> >> 2014-12-10 1:11 GMT+01:00 Steven Schlansker <sschlans...@opentable.com>: >> >>> >>> On Dec 9, 2014, at 3:45 PM, Gary Malouf <malouf.g...@gmail.com> wrote: >>> >>> > We did this in the past with Nagios, but I was wondering if there was >>> a recommended way from others using in production. >>> >>> I wrote a Nagios plugin for it >>> >>> https://github.com/opentable/nagios-mesos >>> >>> >>> >>> >> >