[ceph-users] Re: Status of diskprediction MGR module?

2023-08-28 Thread Anthony D'Atri


>> The module don't have new commits for more than two year
> 
> So diskprediction_local is unmaintained. Will it be removed?
> It looks like a nice feature but when you try to use it it's useless.

IIRC it has only a specific set of drive models, and the binary blob from 
ProphetStor.

>> I suggest to use smartctl_exporter [1] for monitoring drives health
> 
> I tried to deploy that with cephadm as a custom container.

I would deploy it outside of cephadm.

This exporter is promising, but note that if you have drives hidden under RoC 
HBA VDs, it does not have the ability to jump through the requisite hoops to 
extract metrics from them.


> 
> Follow-up questions:
> 
> How do I tell cephadm that smartctl_exporter has to run in a priviledged 
> container as root with all the devices?
> 
> How do I tell the cephadm managed Prometheus that it can scrape these new 
> exporters?
> 
> How do I add a dashboard in cephadm managed Grafana that shows the values 
> from smartctl_exporter? Where do I get such a dashboard?
> 
> How do I add alerts to the cephadm managed Alert-Manager? Where do I get 
> useful alert definitions for smartctl_exporter metrics?
> 
> Regards
> -- 
> Robert Sander
> Heinlein Consulting GmbH
> Schwedter Str. 8/9b, 10119 Berlin
> 
> https://www.heinlein-support.de
> 
> Tel: 030 / 405051-43
> Fax: 030 / 405051-19
> 
> Amtsgericht Berlin-Charlottenburg - HRB 220009 B
> Geschäftsführer: Peer Heinlein - Sitz: Berlin
> ___
> ceph-users mailing list -- ceph-users@ceph.io
> To unsubscribe send an email to ceph-users-le...@ceph.io
___
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io


[ceph-users] Re: Status of diskprediction MGR module?

2023-08-28 Thread Robert Sander

On 8/28/23 13:26, Konstantin Shalygin wrote:


The module don't have new commits for more than two year


So diskprediction_local is unmaintained. Will it be removed?
It looks like a nice feature but when you try to use it it's useless.


I suggest to use smartctl_exporter [1] for monitoring drives health


I tried to deploy that with cephadm as a custom container.

Follow-up questions:

How do I tell cephadm that smartctl_exporter has to run in a priviledged 
container as root with all the devices?


How do I tell the cephadm managed Prometheus that it can scrape these 
new exporters?


How do I add a dashboard in cephadm managed Grafana that shows the 
values from smartctl_exporter? Where do I get such a dashboard?


How do I add alerts to the cephadm managed Alert-Manager? Where do I get 
useful alert definitions for smartctl_exporter metrics?


Regards
--
Robert Sander
Heinlein Consulting GmbH
Schwedter Str. 8/9b, 10119 Berlin

https://www.heinlein-support.de

Tel: 030 / 405051-43
Fax: 030 / 405051-19

Amtsgericht Berlin-Charlottenburg - HRB 220009 B
Geschäftsführer: Peer Heinlein - Sitz: Berlin
___
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io


[ceph-users] Re: Status of diskprediction MGR module?

2023-08-28 Thread Konstantin Shalygin
Hi,

> On 28 Aug 2023, at 12:45, Robert Sander  wrote:
> 
> Several years ago the diskprediction module was added to the MGR collecting 
> SMART data from the OSDs.
> 
> There were local and cloud modes available claiming different accuracies. Now 
> only the local mode remains.
> 
> What is the current status of that MGR module (diskprediction_local)?
> 
> We have a cluster where SMART data is available from the disks (tested with 
> smartctl and visible in the Ceph dashboard), but even with an enabled 
> diskprediction_local module no health and lifetime info is shown.

The module don't have new commits for more than two year
I suggest to use smartctl_exporter [1] for monitoring drives health


[1] https://github.com/prometheus-community/smartctl_exporter
k
___
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io