[ceph-users] Re: iDRAC 9 version 6.10 shows 0% for write endurance on non-dell drives, work around?

2023-02-14 Thread Konstantin Shalygin

Hi,

You can use smartctl_exporter [1] for all your media, not only the SSD


k
[1] https://github.com/prometheus-community/smartctl_exporter

Sent from my iPhone

> On 14 Feb 2023, at 23:01, Drew Weaver  wrote:
> Hello,
> 
> After upgrading a lot of iDRAC9 modules to version 6.10 in servers that are 
> involved in a Ceph cluster we noticed that the iDRAC9 shows the write 
> endurance as 0% on any non-certified disk.
> 
> OMSA still shows the correct remaining write endurance but I am assuming that 
> they are working feverishly to eliminate that too.
> 
> I opened a support ticket with Dell once this was brought to my attention and 
> they basically told me that I was lucky that it ever worked at all, which I 
> thought was an odd response given that the iDRAC enterprise licenses cost 
> several hundred dollars each.
> 
> I know that the old Intel Datacenter Tool used to be able to reach through a 
> MegaRAID controller and read the remaining write endurance but that tool is 
> essentially defunct now.
> 
> What are you folks using to monitor your write endurance on your SSDs that 
> you couldn't buy from Dell because they had a 16 week lead time while the MFG 
> could deliver the drives in 3 days?
> 
> Thanks,
> -Drew
> 
> ___
> ceph-users mailing list -- ceph-users@ceph.io
> To unsubscribe send an email to ceph-users-le...@ceph.io
___
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io


[ceph-users] Re: iDRAC 9 version 6.10 shows 0% for write endurance on non-dell drives, work around? [EXT]

2023-02-14 Thread Drew Weaver
That is pretty awesome, I will look into doing it that way. All of our 
monitoring is integrated to use the very very expensive DRAC enterprise license 
we pay for (my fault for trusting Dell).

We are looking for a new hardware vendor but this will likely work for the 
mistake we already made.

Thanks,
-Drew





-Original Message-
From: Dave Holland  
Sent: Tuesday, February 14, 2023 11:39 AM
To: Drew Weaver 
Cc: 'ceph-users@ceph.io' 
Subject: Re: [ceph-users] iDRAC 9 version 6.10 shows 0% for write endurance on 
non-dell drives, work around? [EXT]

On Tue, Feb 14, 2023 at 04:00:30PM +, Drew Weaver wrote:
> What are you folks using to monitor your write endurance on your SSDs that 
> you couldn't buy from Dell because they had a 16 week lead time while the MFG 
> could deliver the drives in 3 days?

Our Ceph servers are SuperMicro not Dell but this approach is portable. We 
wrote a little shell script to parse the output of "nvme"
and/or "smartctl" every hour and send the data to a Graphite server.
We have a Grafana dashboard to display the all-important graphs. After
~5 years life, our most worn NVMe (used for journal/db only -- data is on HDD) 
is showing 89% life remaining.

Dave
-- 
**   Dave Holland   ** Systems Support -- Informatics Systems Group **
** d...@sanger.ac.uk **Wellcome Sanger Institute, Hinxton, UK**


--
 The Wellcome Sanger Institute is operated by Genome Research  Limited, a 
charity registered in England with number 1021457 and a  company registered in 
England with number 2742969, whose registered  office is 215 Euston Road, 
London, NW1 2BE. 
___
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io


[ceph-users] Re: iDRAC 9 version 6.10 shows 0% for write endurance on non-dell drives, work around? [EXT]

2023-02-14 Thread Dave Holland
On Tue, Feb 14, 2023 at 04:00:30PM +, Drew Weaver wrote:
> What are you folks using to monitor your write endurance on your SSDs that 
> you couldn't buy from Dell because they had a 16 week lead time while the MFG 
> could deliver the drives in 3 days?

Our Ceph servers are SuperMicro not Dell but this approach is
portable. We wrote a little shell script to parse the output of "nvme"
and/or "smartctl" every hour and send the data to a Graphite server.
We have a Grafana dashboard to display the all-important graphs. After
~5 years life, our most worn NVMe (used for journal/db only -- data is
on HDD) is showing 89% life remaining.

Dave
-- 
**   Dave Holland   ** Systems Support -- Informatics Systems Group **
** d...@sanger.ac.uk **Wellcome Sanger Institute, Hinxton, UK**


-- 
 The Wellcome Sanger Institute is operated by Genome Research 
 Limited, a charity registered in England with number 1021457 and a 
 company registered in England with number 2742969, whose registered 
 office is 215 Euston Road, London, NW1 2BE. 
___
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io