Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2019-02-15 Thread Andreas Dilger
On Feb 14, 2019, at 08:09, Laifer, Roland (SCC)  wrote:
> 
> Hello,
> 
> since I usually just want to do a short check what is going on on
> clients and servers the hint for lltop seemed to be the best for my needs.
> 
> I tried it but there was an error message on the MDS. However, I found
> an easy fix and just had to replace
>  "/proc/fs/lustre/mds",
> with
>  "/proc/fs/lustre/mdt",
> in lltop-master/serv.c
> 
> It seems like many tools need to be adapted for the structures of new
> Lustre versions.
> 
> lltop is a nice tool to show which clients are currently doing most
> reads, writes or general requests.

Feel free to make a fork of the lltop repo and push a patch?  I don't think
that John is maintaining it anymore.  You could change the link on the wiki
page to point at your fork.

> Am 21.12.18 um 11:43 schrieb Martin Hecht:
>> Hello Roland,
>> 
>> there is a nice collection of lustre monitoring tools on the lustre wiki:
>> 
>> http://wiki.lustre.org/Lustre_Monitoring_and_Statistics_Guide
>> 
>> which also contains a couple of references. One of them is lltop, which
>> has already been mentioned a couple of times and that's what came to my
>> mind as well when I read your question.
>> 
>> best regards,
>> Martin
> 
> ___
> lustre-discuss mailing list
> lustre-discuss@lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Cheers, Andreas
---
Andreas Dilger
Principal Lustre Architect
Whamcloud







___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2019-02-14 Thread Laifer, Roland (SCC)
Hello,

since I usually just want to do a short check what is going on on
clients and servers the hint for lltop seemed to be the best for my needs.

I tried it but there was an error message on the MDS. However, I found
an easy fix and just had to replace
  "/proc/fs/lustre/mds",
with
  "/proc/fs/lustre/mdt",
in lltop-master/serv.c

It seems like many tools need to be adapted for the structures of new
Lustre versions.

lltop is a nice tool to show which clients are currently doing most
reads, writes or general requests.

Regards,
  Roland


Am 21.12.18 um 11:43 schrieb Martin Hecht:
> Hello Roland,
> 
> there is a nice collection of lustre monitoring tools on the lustre wiki:
> 
> http://wiki.lustre.org/Lustre_Monitoring_and_Statistics_Guide
> 
> which also contains a couple of references. One of them is lltop, which
> has already been mentioned a couple of times and that's what came to my
> mind as well when I read your question.
> 
> best regards,
> Martin

___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2018-12-21 Thread Christopher Johnston
I use the exact same setup along with Kapacitor for alerting (alerta as the
dash).  We have created have dozens of panels in Grafana that are very
useful for troubleshooting bottlenecks with the OSS nodes, disks, as well
as the clients.

Can't go wrong with it I feel, easy to setup and fun to make graphs :-)



On Thu, Dec 20, 2018 at 12:15 PM Alexander I Kulyavtsev 
wrote:

> 1) cerebro + ltop still work.
>
>
> 2) telegraf + inflixdb (collector, time series DB ). Telegraf has
> input plugins for lustre ("lustre2"), zfs,  and many others. Grafana to
> plot live data from DB. Also, influxDB integrates with Prometheus.
>
> Basically, each component can feed data to different output types through
> plugins; or take data from multiple type of sources so you can use
> different combination for your monitoring stack.
>
>
> For the simplest tool you may take a look if telegraf from influxdb stack
> has proper output plugin (see influxdata on github).
>
>
> Alex.
> --
> *From:* lustre-discuss  on
> behalf of Laifer, Roland (SCC) 
> *Sent:* Thursday, December 20, 2018 8:04:55 AM
> *To:* lustre-discuss@lists.lustre.org
> *Subject:* [lustre-discuss] Command line tool to monitor Lustre I/O ?
>
> Dear Lustre administrators,
>
> what is a good command line tool to monitor current Lustre metadata and
> throughput operations on the local client or server? Up to now we had
> used collectl but this no longer works for Lustre 2.10.
>
> Some background about collectl: The Lustre support of collectl was
> removed many years ago but up to Lustre 2.7 it was still possible to
> monitor metadata and throughput operations on clients. In addition,
> there were plugins which also worked for the server side, see
>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.lustre.org_Collectl=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=23V5nhLj03jeTboyg6QveA=RpMjhssRJoiP3ANRP6Ze3_nBrliMMPOgQaewqEwRTn4=QmdmoNcRR5A0sOgiJimMo0KtZnc-ne44A4YY8aSWbuI=
> However, it seems that there was no update for these plugins to adapt
> them for Lustre 2.10.
>
> Regards,
>   Roland
> ___
> lustre-discuss mailing list
> lustre-discuss@lists.lustre.org
>
> https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.lustre.org_listinfo.cgi_lustre-2Ddiscuss-2Dlustre.org=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=23V5nhLj03jeTboyg6QveA=RpMjhssRJoiP3ANRP6Ze3_nBrliMMPOgQaewqEwRTn4=SXbueuHkxyBAq95D_-bLmBayRVDMtR-l7t0XZfNXEXk=
> ___
> lustre-discuss mailing list
> lustre-discuss@lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
>
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2018-12-21 Thread Andrew Elwell
On Fri., 21 Dec. 2018, 01:05 Laifer, Roland (SCC)  Dear Lustre administrators,
>
> what is a good command line tool to monitor current Lustre metadata and
> throughput operations on the local client or server?
>

I wrote a small python script to parse lctl get_param and inject it
straight into our influxdb server - As I was dropping this onto a sonnexion
(as well as our newer systems which had collectd installed) I didn't want
to require any software not already installed on the system.

My plan [one of these days in my spare time] is to wrap it properly as a
collectd python plugin - would people be interested and I'll probably see
if I can find some time to work on it over xmas?

Once it's in influx, we can then just plot it with our normal tooling
(grafana) - some pictures in the pptx at
https://www.dropbox.com/s/rck1lm73wlwlg6v/monitoring.pptx?dl=0 (near end)

Andrew

>
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2018-12-21 Thread Martin Hecht
Hello Roland,

there is a nice collection of lustre monitoring tools on the lustre wiki:

http://wiki.lustre.org/Lustre_Monitoring_and_Statistics_Guide

which also contains a couple of references. One of them is lltop, which
has already been mentioned a couple of times and that's what came to my
mind as well when I read your question.

best regards,
Martin



___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2018-12-20 Thread Harr, Cameron
I use ltop heavily:

https://github.com/LLNL/lmt


On 12/20/18 9:15 AM, Alexander I Kulyavtsev wrote:

1) cerebro + ltop still work.


2) telegraf + inflixdb (collector, time series DB ). Telegraf has input plugins 
for lustre ("lustre2"), zfs,  and many others. Grafana to  plot live data from 
DB. Also, influxDB integrates with Prometheus.

Basically, each component can feed data to different output types through 
plugins; or take data from multiple type of sources so you can use different 
combination for your monitoring stack.


For the simplest tool you may take a look if telegraf from influxdb stack has 
proper output plugin (see influxdata on github).


Alex.


From: lustre-discuss 
<mailto:lustre-discuss-boun...@lists.lustre.org>
 on behalf of Laifer, Roland (SCC) 
<mailto:roland.lai...@kit.edu>
Sent: Thursday, December 20, 2018 8:04:55 AM
To: lustre-discuss@lists.lustre.org<mailto:lustre-discuss@lists.lustre.org>
Subject: [lustre-discuss] Command line tool to monitor Lustre I/O ?

Dear Lustre administrators,

what is a good command line tool to monitor current Lustre metadata and
throughput operations on the local client or server? Up to now we had
used collectl but this no longer works for Lustre 2.10.

Some background about collectl: The Lustre support of collectl was
removed many years ago but up to Lustre 2.7 it was still possible to
monitor metadata and throughput operations on clients. In addition,
there were plugins which also worked for the server side, see
https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.lustre.org_Collectl=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=23V5nhLj03jeTboyg6QveA=RpMjhssRJoiP3ANRP6Ze3_nBrliMMPOgQaewqEwRTn4=QmdmoNcRR5A0sOgiJimMo0KtZnc-ne44A4YY8aSWbuI=
However, it seems that there was no update for these plugins to adapt
them for Lustre 2.10.

Regards,
  Roland
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org<mailto:lustre-discuss@lists.lustre.org>
https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.lustre.org_listinfo.cgi_lustre-2Ddiscuss-2Dlustre.org=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=23V5nhLj03jeTboyg6QveA=RpMjhssRJoiP3ANRP6Ze3_nBrliMMPOgQaewqEwRTn4=SXbueuHkxyBAq95D_-bLmBayRVDMtR-l7t0XZfNXEXk=



___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org<mailto:lustre-discuss@lists.lustre.org>
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2018-12-20 Thread Daniel Kobras
Hi Roland!

> Am 20.12.2018 um 15:04 schrieb Laifer, Roland (SCC) :
> 
> what is a good command line tool to monitor current Lustre metadata and
> throughput operations on the local client or server? Up to now we had
> used collectl but this no longer works for Lustre 2.10.

The Lustre exporter (https://github.com/HewlettPackard/lustre_exporter) for 
Prometheus copes well with 2.10. Calling it a command-line tool is a bit of a 
stretch (hey, there’s curl after all!), but it can certainly step in for 
collect’s non-interactive mode of operation.

Kind regards,

Daniel
-- 
Daniel Kobras
Principal Architect
Puzzle ITC Deutschland
https://www.puzzle-itc.de

-- 
Puzzle ITC Deutschland GmbH
Sitz der Gesellschaft:  Jurastr. 27/1, 72072 
Tübingen
Eingetragen am Amtsgericht Stuttgart HRB 765802
Geschäftsführer: 
Lukas Kallies
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


Re: [lustre-discuss] Command line tool to monitor Lustre I/O ?

2018-12-20 Thread Alexander I Kulyavtsev
1) cerebro + ltop still work.


2) telegraf + inflixdb (collector, time series DB ). Telegraf has input plugins 
for lustre ("lustre2"), zfs,  and many others. Grafana to  plot live data from 
DB. Also, influxDB integrates with Prometheus.

Basically, each component can feed data to different output types through 
plugins; or take data from multiple type of sources so you can use different 
combination for your monitoring stack.


For the simplest tool you may take a look if telegraf from influxdb stack has 
proper output plugin (see influxdata on github).


Alex.


From: lustre-discuss  on behalf of 
Laifer, Roland (SCC) 
Sent: Thursday, December 20, 2018 8:04:55 AM
To: lustre-discuss@lists.lustre.org
Subject: [lustre-discuss] Command line tool to monitor Lustre I/O ?

Dear Lustre administrators,

what is a good command line tool to monitor current Lustre metadata and
throughput operations on the local client or server? Up to now we had
used collectl but this no longer works for Lustre 2.10.

Some background about collectl: The Lustre support of collectl was
removed many years ago but up to Lustre 2.7 it was still possible to
monitor metadata and throughput operations on clients. In addition,
there were plugins which also worked for the server side, see
https://urldefense.proofpoint.com/v2/url?u=http-3A__wiki.lustre.org_Collectl=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=23V5nhLj03jeTboyg6QveA=RpMjhssRJoiP3ANRP6Ze3_nBrliMMPOgQaewqEwRTn4=QmdmoNcRR5A0sOgiJimMo0KtZnc-ne44A4YY8aSWbuI=
However, it seems that there was no update for these plugins to adapt
them for Lustre 2.10.

Regards,
  Roland
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
https://urldefense.proofpoint.com/v2/url?u=http-3A__lists.lustre.org_listinfo.cgi_lustre-2Ddiscuss-2Dlustre.org=DwICAg=gRgGjJ3BkIsb5y6s49QqsA=23V5nhLj03jeTboyg6QveA=RpMjhssRJoiP3ANRP6Ze3_nBrliMMPOgQaewqEwRTn4=SXbueuHkxyBAq95D_-bLmBayRVDMtR-l7t0XZfNXEXk=
___
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org