subject:"\"Monitoring\""

Re: Reservation status monitoring

2018-01-18 Thread Vinod Kone

The agents tab in Mesos WebUI should have a table for per role reservations. This is a new feature, so you might need to upgrade to the latest version to get it. On Tue, Jan 16, 2018 at 9:23 PM, 박도형 wrote: > Hi Folks, > > > > Is there a easy way to see the current overall status of reservation o

Reservation status monitoring

2018-01-16 Thread 박도형

Hi Folks, Is there a easy way to see the current overall status of reservation of resources in Mesos Master? I want to monitor status such as reserved resources per agent or per role with the Mesos Web UI. DH Park. Dohyeong Park, Engineer Cloud Platform Group, Mobile R&D Office

Re: Mesos Cluster Monitoring

2017-07-27 Thread Dharama Rao

Thursday, July 27, 2017 at 9:07 AM To: "user@mesos.apache.org<mailto:user@mesos.apache.org>" mailto:user@mesos.apache.org>> Subject: RE: Mesos Cluster Monitoring Hi there I am using plugins from this repo for monitoring Master servers https://github.com/HariSekhon/na

RE: Mesos Cluster Monitoring

2017-07-27 Thread Dino Lokmić

Hi there I am using plugins from this repo for monitoring Master servers https://github.com/HariSekhon/nagios-plugins For slaves I use standard Nagios plugins (Memory, disk, etc.) Best regards From: Dharama Rao Sent: četvrtak, 27. juli 2017. 15:03 To: user@mesos.apache.org Subject: Mesos

Mesos Cluster Monitoring

2017-07-27 Thread Dharama Rao

Hi Team, We have a ask to monitor Mesos master , Slaves and Zookeeper using nagios. Do you have any suggestions for us? I have gone through web, but not finding appropriate guide for Setting up nagios on DCOS mesos. Could you please guide me here? Thanks & Regards, Dharama Rao, 470-589-8123. :

RE: Cadvisor and Mesos: cgroup monitoring

2016-08-01 Thread Aurélien DEHAY

My bad. I forgot to restart the tasks. It works fine. Thanks again. De : haosdent [mailto:haosd...@gmail.com] Envoyé : lundi 1 août 2016 16:10 À : user Objet : Re: Cadvisor and Mesos: cgroup monitoring Hi, which mesos version you used? cgroups/devices is available since 1.0.0 On Mon, Aug 1

Re: Cadvisor and Mesos: cgroup monitoring

2016-08-01 Thread haosdent

nap with the plugin from Roger. > > > Thanks. > -- > *De :* haosdent > *Envoyé :* vendredi 29 juillet 2016 17:30:20 > > *À :* user > *Objet :* Re: Cadvisor and Mesos: cgroup monitoring > > Hi, you could add `cgroups/devices` into your

RE: Cadvisor and Mesos: cgroup monitoring

2016-08-01 Thread Aurélien DEHAY

Mesos: cgroup monitoring Hi, you could add `cgroups/devices` into your Mesos Agent isolation flags and restart it. On Fri, Jul 29, 2016 at 10:30 PM, aurelien.de...@gmail.com<mailto:aurelien.de...@gmail.com> mailto:aurelien.de...@gmail.com>> wrote: Hello. For example: proc

RE: Cadvisor and Mesos: cgroup monitoring

2016-07-29 Thread aurelien.de...@gmail.com

and Mesos: cgroup monitoring Hey Aurélien, I'm not sure if using cadvisor is a hard requirement for you, but might I suggest checking out Snap[0]? I recently wrote a Mesos metrics collector[1] that enumerates all running containers on an agent, in addition to other higher level res

Re: Cadvisor and Mesos: cgroup monitoring

2016-07-29 Thread Roger Ignazio

nto cgroups. `ps` uses `devices` > cgroups to identify the cgroups of a process. In your case, devices cgroup > is still set to root `/`. > > Thanks for the answer. > -- > *De :* haosdent > *Envoyé :* vendredi 29 juillet 2016 16:21:07 > *À :* user &

Re: Cadvisor and Mesos: cgroup monitoring

2016-07-29 Thread haosdent

to root `/`. > > Thanks for the answer. > -- > *De :* haosdent > *Envoyé :* vendredi 29 juillet 2016 16:21:07 > *À :* user > *Objet :* Re: Cadvisor and Mesos: cgroup monitoring > > >- is there a way to "link" the cgroup id to the m

RE: Cadvisor and Mesos: cgroup monitoring

2016-07-29 Thread aurelien.de...@gmail.com

to root `/`. Thanks for the answer. De : haosdent Envoyé : vendredi 29 juillet 2016 16:21:07 À : user Objet : Re: Cadvisor and Mesos: cgroup monitoring >- is there a way to "link" the cgroup id to the mesos task id? The cgroup id you saw is the Cont

Re: Cadvisor and Mesos: cgroup monitoring

2016-07-29 Thread haosdent

>- is there a way to "link" the cgroup id to the mesos task id? The cgroup id you saw is the ContainerId actually. You could get the mapping relations from querying the state endpoint of Mesos agent. >In mesos, this information is set to /, so all my process are shown in the / cgroup. May you mind

Cadvisor and Mesos: cgroup monitoring

2016-07-29 Thread aurelien.de...@gmail.com

Hello. I'm trying to find a solution to monitor the real usage of my mesos tasks. I don't use docker at all, but I gave a look to cadvisor. Unfortunatly, cadvisor uses ps and the device information to determine the cgroup ownership of a process. In mesos, this information is set to /, so a

Re: Monitoring at container level

2016-07-07 Thread Michał Łowicki

For now the easiest solution not requiring changing monitoring infrastructure would be the one proposed by Steven Schlansker so will try to get some information about it from Marathon team. Thanks! On Fri, Jul 8, 2016 at 5:20 AM, wrote: > Small plug for snap (https://github.com/intelsdi-x/s

Re: Monitoring at container level

2016-07-07 Thread connor . p . d

wrote: >> We use a combination of new relic for application level monitoring and a >> custom python script that scrapes a bunch of stats from the docker socket >> file and throws them into elastic so we can use kibana to make graphs. >> >> >> >>

Re: Monitoring at container level

2016-07-07 Thread Guangya Liu

Have you ever tried prometheus + Grafana? Please take a look at https://prometheus.io/docs/visualization/grafana/ to see if it helps. On Fri, Jul 8, 2016 at 5:51 AM, David Kesler wrote: > We use a combination of new relic for application level monitoring and a > custom python scrip

RE: Monitoring at container level

2016-07-07 Thread David Kesler

We use a combination of new relic for application level monitoring and a custom python script that scrapes a bunch of stats from the docker socket file and throws them into elastic so we can use kibana to make graphs. From: Gregory Durham [mailto:gregory.dur...@gmail.com] Sent: Thursday, July

Re: Monitoring at container level

2016-07-07 Thread Gregory Durham

I have been using datadog to monitor my infrastructure. The integration into service discovery has been really helpful for these environments. On Thu, Jul 7, 2016 at 1:37 PM, Steven Schlansker wrote: > We use Graphite and ran into similar problems with huge metric namespaces. > We use the Singul

Re: Monitoring at container level

2016-07-07 Thread Steven Schlansker

We use Graphite and ran into similar problems with huge metric namespaces. We use the Singularity framework which provides both the task "request id" (name) and "instance number" (0..N) to the task. So we set our Graphite namespace to be "request-number" e.g. "myservice-3" This has the downside o

Re: Monitoring at container level

2016-07-07 Thread Krish

I have had a good experience so far with bosun and scollector with cadvisor. Check it out at bosun.org. On Friday 8 July 2016, Pradeep Chhetri wrote: > Hi Michal, > > Do have a look at sysdig (http://www.sysdig.org). It is basically an > open-source tool which provides container insights. Maybe

Re: Monitoring at container level

2016-07-07 Thread Pradeep Chhetri

Hi Michal, Do have a look at sysdig (http://www.sysdig.org). It is basically an open-source tool which provides container insights. Maybe your will find something helpful over there. To tackle the case of new metrics for new containers, maybe you should tag metrics by service-name instead of cont

Monitoring at container level

2016-07-07 Thread Michał Łowicki

Hi, Before introducing Mesos we're using mainly Graphite / Grafana. Ideally we would like to have metrics per container as an easy way to detect if problem touches only single, subset of containers or it's global. Unfortunately using Graphite for that is far from being perfect. Having container i

Re: Monitoring

2016-01-19 Thread Erb, Stephan

of the individual services running on your cluster (http://prometheus.io/docs/operating/configuration/#marathon-sd-configurations-marathon_sd_configs) From: Tom Arnfeld Sent: Tuesday, January 19, 2016 10:27 PM To: user@mesos.apache.org Subject: Re: Monitoring

Re: Monitoring

2016-01-19 Thread Tom Arnfeld

y use it to monitor both Apache > Mesos and Apache Aurora. > > Twitter has an internal monitoring system, and we have an agent which is > installed via RPM/puppet on each host that scrapes the metrics pages and > pushes data to our time series database. If you wanted to setup an agent &

Re: Monitoring

2016-01-19 Thread Joe Smith

TellApart also has a rather active fork of Diamond (they're working to merge it back upstream ~soonish) that you can take a look at https://github.com/tellapart/Diamond. They use it to monitor both Apache Mesos and Apache Aurora. Twitter has an internal monitoring system, and we have an

Re: Monitoring

2016-01-19 Thread Tomek Janiszewski

://www.youtube.com/watch?v=zlgAT_xFNzU Tomek wt., 19.01.2016 o 21:04 użytkownik Michał Łowicki napisał: > Hi, > > I've read Mesos Observability Metrics > <http://mesos.apache.org/documentation/latest/monitoring/> which gives > nice overview of cluster's health. What about other pa

Monitoring

2016-01-19 Thread Michał Łowicki

Hi, I've read Mesos Observability Metrics <http://mesos.apache.org/documentation/latest/monitoring/> which gives nice overview of cluster's health. What about other parameters like I/O usage (disk, network), number of processes etc. Maybe there are some tools or their configuratio

Re: [Announce] Satellite: Two Sigma's Monitoring, Alerting, and Self-Healing Application for Apache Mesos

2015-06-09 Thread Niklas Nielsen

Amazing! Thanks for sharing Sunil! On 9 June 2015 at 11:22, Sunil Abraham wrote: > We are pleased to announce that Two Sigma is open-sourcing Satellite, a > Mesos > > monitoring and alerting application with self-healing capabilities. > Satellite > > makes it easy to cont

[Announce] Satellite: Two Sigma's Monitoring, Alerting, and Self-Healing Application for Apache Mesos

2015-06-09 Thread Sunil Abraham

We are pleased to announce that Two Sigma is open-sourcing Satellite, a Mesos monitoring and alerting application with self-healing capabilities. Satellite makes it easy to continuously monitor your cluster, automatically add and remove slaves, and alert you via email or PagerDuty if your cluster

Resource Monitoring-run time optimization

2015-04-09 Thread Pradeep Kiruvale

Hi All, Is there a with which in mesos we can monitor the scheduled tasks and their resource access pattern and reschedule the tasks on the optimized resources? For example like NUMA balancer in case of Linux on a NUMA system. Regards, Pradeep

Re: Question on Monitoring a Mesos Cluster

2015-03-11 Thread Alex Rukletsov

eport the node load. This however includes all the process running on a node, not only mesos and its tasks. Hope this helps. On Mon, Mar 9, 2015 at 8:16 AM, Andras Kerekes < andras.kere...@ishisystems.com> wrote: > We use the same monitoring script from rayrod2030. However

RE: Question on Monitoring a Mesos Cluster

2015-03-09 Thread Andras Kerekes

We use the same monitoring script from rayrod2030. However instead of the master_cpus_percent, we use the master_cpus_used and master_cpus_total to calculate a percentage. And this will give the allocated percentage of CPUs in the cluster, the actual utilization is measured by collectd

Re: Question on Monitoring a Mesos Cluster

2015-03-07 Thread Dick Davies

age metrics above go up >>> considerably. I did notice that cpus_percent went to around 0.94. >>> >>> What is the correct way to measure overall cluster utilization for >>> capacity planning? We can have the NOC watch this and simply add more >>> har

Re: Question on Monitoring a Mesos Cluster

2015-03-07 Thread Jeff Schroeder

measure overall cluster utilization for >> capacity planning? We can have the NOC watch this and simply add more >> hardware when the number starts getting low. >> > > Boy, I cannot wait to read the tidbits of wisdom here. Maybe the > development group has more accurate informa

Re: Question on Monitoring a Mesos Cluster

2015-03-07 Thread CCAAT

, where a single problem spans large numbers of processors and resources, as well as Clusters/Clouds where a myriad of small to large tasks are processed concurrently. Robustly monitoring both scenarios does require different tools, greatly dependent on the granularity of the monitoring needs

Question on Monitoring a Mesos Cluster

2015-03-07 Thread Jeff Schroeder

I wrote a python collectd plugin which pulls both master (only if master/elected == 1) and slave stats from the rest api under /metrics/snapshot and /slave(1)/stats.json respectively and throws those into graphite. After getting everything working, I built a few dashboards, one of which displays t

Re: Monitoring mesos processes and application in New Relic

2015-02-11 Thread Gary Malouf

no Mesos plugin for New Relic at the moment. You can > probably build one fairly easily though, using their API. > > On Wed, Feb 11, 2015 at 10:18 PM, Gary Malouf > wrote: > >> Our team is using New Relic for all of our standard monitoring and would >> like to ad

Re: Monitoring mesos processes and application in New Relic

2015-02-11 Thread Steve Domin

Afaik there is no Mesos plugin for New Relic at the moment. You can probably build one fairly easily though, using their API. On Wed, Feb 11, 2015 at 10:18 PM, Gary Malouf wrote: > Our team is using New Relic for all of our standard monitoring and would > like to add Mesos/Chronos to the

Monitoring mesos processes and application in New Relic

2015-02-11 Thread Gary Malouf

Our team is using New Relic for all of our standard monitoring and would like to add Mesos/Chronos to the stack. Just wondering if anyone knows of ways to monitor Mesos processes or even performance through the New Relic interface.

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Billy Bones

ry Malouf : > Billy, thanks for the link. It was not easy to tell from the website, but > do you get email/text alerts if something goes wrong overnight? > > On Wed, Dec 10, 2014 at 3:54 AM, Billy Bones > wrote: > >> Here we use the wonderfull consul tool as our monitoring a

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Leigh Martell

4 at 3:54 AM, Billy Bones > wrote: > >> Here we use the wonderfull consul tool as our monitoring and health check >> dashboard plus some other things. >> >> Check it out at consul.io, it's made by Hashicorp. >> I kinda like it because it's fast, realiabl

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Gary Malouf

Billy, thanks for the link. It was not easy to tell from the website, but do you get email/text alerts if something goes wrong overnight? On Wed, Dec 10, 2014 at 3:54 AM, Billy Bones wrote: > Here we use the wonderfull consul tool as our monitoring and health check > dashboard plus some

Re: Monitoring Mesos slave/master processes

2014-12-10 Thread Billy Bones

Here we use the wonderfull consul tool as our monitoring and health check dashboard plus some other things. Check it out at consul.io, it's made by Hashicorp. I kinda like it because it's fast, realiable and it is build with huge distributed systems in mind from the ground up. 2014-

Re: Monitoring Mesos slave/master processes

2014-12-09 Thread Steven Schlansker

On Dec 9, 2014, at 3:45 PM, Gary Malouf wrote: > We did this in the past with Nagios, but I was wondering if there was a > recommended way from others using in production. I wrote a Nagios plugin for it https://github.com/opentable/nagios-mesos

Re: Monitoring Mesos slave/master processes

2014-12-09 Thread Tom Arnfeld

We're monitoring the processes with M/Monit on each machine and pumping all the metrics to Graphite with https://github.com/rayrod2030/collectd-mesos. -- Tom Arnfeld Developer // DueDil On Tue, Dec 9, 2014 at 11:46 PM, Gary Malouf wrote: > We did this in the past with Nagios, b

Monitoring Mesos slave/master processes

2014-12-09 Thread Gary Malouf

We did this in the past with Nagios, but I was wondering if there was a recommended way from others using in production.

47 matches

Mail list logo