The agents tab in Mesos WebUI should have a table for per role
reservations. This is a new feature, so you might need to upgrade to the
latest version to get it.
On Tue, Jan 16, 2018 at 9:23 PM, 박도형 wrote:
> Hi Folks,
>
>
>
> Is there a easy way to see the current overall status of reservation o
Hi Folks,
Is there a easy way to see the current overall status of reservation of resources in Mesos Master?
I want to monitor status such as reserved resources per agent or per role with the Mesos Web UI.
DH Park.
Dohyeong Park, Engineer
Cloud Platform Group, Mobile R&D Office
Thursday, July 27, 2017 at 9:07 AM
To: "user@mesos.apache.org<mailto:user@mesos.apache.org>"
mailto:user@mesos.apache.org>>
Subject: RE: Mesos Cluster Monitoring
Hi there
I am using plugins from this repo for monitoring Master servers
https://github.com/HariSekhon/na
Hi there
I am using plugins from this repo for monitoring Master servers
https://github.com/HariSekhon/nagios-plugins
For slaves I use standard Nagios plugins (Memory, disk, etc.)
Best regards
From: Dharama Rao
Sent: četvrtak, 27. juli 2017. 15:03
To: user@mesos.apache.org
Subject: Mesos
Hi Team,
We have a ask to monitor Mesos master , Slaves and Zookeeper using nagios.
Do you have any suggestions for us?
I have gone through web, but not finding appropriate guide for Setting up
nagios on DCOS mesos.
Could you please guide me here?
Thanks & Regards,
Dharama Rao,
470-589-8123.
:
My bad.
I forgot to restart the tasks. It works fine.
Thanks again.
De : haosdent [mailto:haosd...@gmail.com]
Envoyé : lundi 1 août 2016 16:10
À : user
Objet : Re: Cadvisor and Mesos: cgroup monitoring
Hi, which mesos version you used? cgroups/devices is available since 1.0.0
On Mon, Aug 1
nap with the plugin from Roger.
>
>
> Thanks.
> --
> *De :* haosdent
> *Envoyé :* vendredi 29 juillet 2016 17:30:20
>
> *À :* user
> *Objet :* Re: Cadvisor and Mesos: cgroup monitoring
>
> Hi, you could add `cgroups/devices` into your
Mesos: cgroup monitoring
Hi, you could add `cgroups/devices` into your Mesos Agent isolation flags and
restart it.
On Fri, Jul 29, 2016 at 10:30 PM,
aurelien.de...@gmail.com<mailto:aurelien.de...@gmail.com>
mailto:aurelien.de...@gmail.com>> wrote:
Hello.
For example:
proc
and Mesos: cgroup monitoring
Hey Aurélien,
I'm not sure if using cadvisor is a hard requirement for you, but might I
suggest checking out Snap[0]? I recently wrote a Mesos metrics collector[1]
that enumerates all running containers on an agent, in addition to other higher
level res
nto cgroups. `ps` uses `devices`
> cgroups to identify the cgroups of a process. In your case, devices cgroup
> is still set to root `/`.
>
> Thanks for the answer.
> --
> *De :* haosdent
> *Envoyé :* vendredi 29 juillet 2016 16:21:07
> *À :* user
&
to root `/`.
>
> Thanks for the answer.
> --
> *De :* haosdent
> *Envoyé :* vendredi 29 juillet 2016 16:21:07
> *À :* user
> *Objet :* Re: Cadvisor and Mesos: cgroup monitoring
>
> >- is there a way to "link" the cgroup id to the m
to root `/`.
Thanks for the answer.
De : haosdent
Envoyé : vendredi 29 juillet 2016 16:21:07
À : user
Objet : Re: Cadvisor and Mesos: cgroup monitoring
>- is there a way to "link" the cgroup id to the mesos task id?
The cgroup id you saw is the Cont
>- is there a way to "link" the cgroup id to the mesos task id?
The cgroup id you saw is the ContainerId actually. You could get the
mapping relations from querying the state endpoint of Mesos agent.
>In mesos, this information is set to /, so all my process are shown in the
/ cgroup.
May you mind
Hello.
I'm trying to find a solution to monitor the real usage of my mesos tasks. I
don't use docker at all, but I gave a look to cadvisor.
Unfortunatly, cadvisor uses ps and the device information to determine the
cgroup ownership of a process. In mesos, this information is set to /, so a
For now the easiest solution not requiring changing monitoring
infrastructure would be the one proposed by Steven Schlansker so will try
to get some information about it from Marathon team. Thanks!
On Fri, Jul 8, 2016 at 5:20 AM, wrote:
> Small plug for snap (https://github.com/intelsdi-x/s
wrote:
>> We use a combination of new relic for application level monitoring and a
>> custom python script that scrapes a bunch of stats from the docker socket
>> file and throws them into elastic so we can use kibana to make graphs.
>>
>>
>>
>>
Have you ever tried prometheus + Grafana? Please take a look at
https://prometheus.io/docs/visualization/grafana/ to see if it helps.
On Fri, Jul 8, 2016 at 5:51 AM, David Kesler wrote:
> We use a combination of new relic for application level monitoring and a
> custom python scrip
We use a combination of new relic for application level monitoring and a custom
python script that scrapes a bunch of stats from the docker socket file and
throws them into elastic so we can use kibana to make graphs.
From: Gregory Durham [mailto:gregory.dur...@gmail.com]
Sent: Thursday, July
I have been using datadog to monitor my infrastructure. The integration
into service discovery has been really helpful for these environments.
On Thu, Jul 7, 2016 at 1:37 PM, Steven Schlansker wrote:
> We use Graphite and ran into similar problems with huge metric namespaces.
> We use the Singul
We use Graphite and ran into similar problems with huge metric namespaces.
We use the Singularity framework which provides both the task "request id"
(name)
and "instance number" (0..N) to the task.
So we set our Graphite namespace to be "request-number" e.g. "myservice-3"
This has the downside o
I have had a good experience so far with bosun and scollector with cadvisor.
Check it out at bosun.org.
On Friday 8 July 2016, Pradeep Chhetri wrote:
> Hi Michal,
>
> Do have a look at sysdig (http://www.sysdig.org). It is basically an
> open-source tool which provides container insights. Maybe
Hi Michal,
Do have a look at sysdig (http://www.sysdig.org). It is basically an
open-source tool which provides container insights. Maybe your will find
something helpful over there.
To tackle the case of new metrics for new containers, maybe you should tag
metrics by service-name instead of cont
Hi,
Before introducing Mesos we're using mainly Graphite / Grafana. Ideally we
would like to have metrics per container as an easy way to detect if
problem touches only single, subset of containers or it's global.
Unfortunately using Graphite for that is far from being perfect. Having
container i
of the individual services running on your cluster
(http://prometheus.io/docs/operating/configuration/#marathon-sd-configurations-marathon_sd_configs)
From: Tom Arnfeld
Sent: Tuesday, January 19, 2016 10:27 PM
To: user@mesos.apache.org
Subject: Re: Monitoring
y use it to monitor both Apache
> Mesos and Apache Aurora.
>
> Twitter has an internal monitoring system, and we have an agent which is
> installed via RPM/puppet on each host that scrapes the metrics pages and
> pushes data to our time series database. If you wanted to setup an agent
&
TellApart also has a rather active fork of Diamond (they're working to
merge it back upstream ~soonish) that you can take a look at
https://github.com/tellapart/Diamond. They use it to monitor both Apache
Mesos and Apache Aurora.
Twitter has an internal monitoring system, and we have an
://www.youtube.com/watch?v=zlgAT_xFNzU
Tomek
wt., 19.01.2016 o 21:04 użytkownik Michał Łowicki
napisał:
> Hi,
>
> I've read Mesos Observability Metrics
> <http://mesos.apache.org/documentation/latest/monitoring/> which gives
> nice overview of cluster's health. What about other pa
Hi,
I've read Mesos Observability Metrics
<http://mesos.apache.org/documentation/latest/monitoring/> which gives nice
overview of cluster's health. What about other parameters like I/O usage
(disk, network), number of processes etc. Maybe there are some tools or
their configuratio
Amazing! Thanks for sharing Sunil!
On 9 June 2015 at 11:22, Sunil Abraham wrote:
> We are pleased to announce that Two Sigma is open-sourcing Satellite, a
> Mesos
>
> monitoring and alerting application with self-healing capabilities.
> Satellite
>
> makes it easy to cont
We are pleased to announce that Two Sigma is open-sourcing Satellite, a Mesos
monitoring and alerting application with self-healing capabilities. Satellite
makes it easy to continuously monitor your cluster, automatically add and remove
slaves, and alert you via email or PagerDuty if your cluster
Hi All,
Is there a with which in mesos we can monitor the scheduled tasks and their
resource access pattern and reschedule the tasks on the optimized
resources? For example like NUMA balancer in case of Linux on a NUMA system.
Regards,
Pradeep
eport the node load. This however includes all the
process running on a node, not only mesos and its tasks. Hope this
helps.
On Mon, Mar 9, 2015 at 8:16 AM, Andras Kerekes <
andras.kere...@ishisystems.com> wrote:
> We use the same monitoring script from rayrod2030. However
We use the same monitoring script from rayrod2030. However instead of the
master_cpus_percent, we use the master_cpus_used and master_cpus_total to
calculate a percentage. And this will give the allocated percentage of CPUs in
the cluster, the actual utilization is measured by collectd
age metrics above go up
>>> considerably. I did notice that cpus_percent went to around 0.94.
>>>
>>> What is the correct way to measure overall cluster utilization for
>>> capacity planning? We can have the NOC watch this and simply add more
>>> har
measure overall cluster utilization for
>> capacity planning? We can have the NOC watch this and simply add more
>> hardware when the number starts getting low.
>>
>
> Boy, I cannot wait to read the tidbits of wisdom here. Maybe the
> development group has more accurate informa
, where a single problem spans
large numbers of processors and resources, as well as Clusters/Clouds
where a myriad of small to large tasks are processed concurrently.
Robustly monitoring both scenarios does require different tools, greatly
dependent on the granularity of the monitoring needs
I wrote a python collectd plugin which pulls both master (only if
master/elected == 1) and slave stats from the rest api under
/metrics/snapshot and /slave(1)/stats.json respectively and throws those
into graphite.
After getting everything working, I built a few dashboards, one of which
displays t
no Mesos plugin for New Relic at the moment. You can
> probably build one fairly easily though, using their API.
>
> On Wed, Feb 11, 2015 at 10:18 PM, Gary Malouf
> wrote:
>
>> Our team is using New Relic for all of our standard monitoring and would
>> like to ad
Afaik there is no Mesos plugin for New Relic at the moment. You can
probably build one fairly easily though, using their API.
On Wed, Feb 11, 2015 at 10:18 PM, Gary Malouf wrote:
> Our team is using New Relic for all of our standard monitoring and would
> like to add Mesos/Chronos to the
Our team is using New Relic for all of our standard monitoring and would
like to add Mesos/Chronos to the stack. Just wondering if anyone knows of
ways to monitor Mesos processes or even performance through the New Relic
interface.
ry Malouf :
> Billy, thanks for the link. It was not easy to tell from the website, but
> do you get email/text alerts if something goes wrong overnight?
>
> On Wed, Dec 10, 2014 at 3:54 AM, Billy Bones
> wrote:
>
>> Here we use the wonderfull consul tool as our monitoring a
4 at 3:54 AM, Billy Bones
> wrote:
>
>> Here we use the wonderfull consul tool as our monitoring and health check
>> dashboard plus some other things.
>>
>> Check it out at consul.io, it's made by Hashicorp.
>> I kinda like it because it's fast, realiabl
Billy, thanks for the link. It was not easy to tell from the website, but
do you get email/text alerts if something goes wrong overnight?
On Wed, Dec 10, 2014 at 3:54 AM, Billy Bones wrote:
> Here we use the wonderfull consul tool as our monitoring and health check
> dashboard plus some
Here we use the wonderfull consul tool as our monitoring and health check
dashboard plus some other things.
Check it out at consul.io, it's made by Hashicorp.
I kinda like it because it's fast, realiable and it is build with huge
distributed systems in mind from the ground up.
2014-
On Dec 9, 2014, at 3:45 PM, Gary Malouf wrote:
> We did this in the past with Nagios, but I was wondering if there was a
> recommended way from others using in production.
I wrote a Nagios plugin for it
https://github.com/opentable/nagios-mesos
We're monitoring the processes with M/Monit on each machine and pumping all the
metrics to Graphite with https://github.com/rayrod2030/collectd-mesos.
--
Tom Arnfeld
Developer // DueDil
On Tue, Dec 9, 2014 at 11:46 PM, Gary Malouf
wrote:
> We did this in the past with Nagios, b
We did this in the past with Nagios, but I was wondering if there was a
recommended way from others using in production.
47 matches
Mail list logo