Re: [gentoo-user] Is my SSD dying?

2017-09-06 Thread R0b0t1
On Wednesday, September 6, 2017, Peter Humphrey 
wrote:
> On Sunday, 3 September 2017 21:56:43 BST R0b0t1 wrote:
>> On Sun, Sep 3, 2017 at 2:39 AM, Peter Humphrey 
> wrote:
>> > On Sunday, 3 September 2017 03:34:06 BST R0b0t1 wrote:
>> >> On Sat, Sep 2, 2017 at 8:49 AM, Peter Humphrey
> 
>> >
>> > wrote:
>> >> > A week or two ago I was investigating some other weirdnesses and at
>> >> > one
>> >> > point I zeroed out the first partition: the unformatted one
>> >> > containing
>> >> > the UEFI data. It took longer than I expected, having only 2MB to
>> >> > fill.
>> >> > I wonder if it strayed outside the partition...
>> >>
>> >> Are you trimming your drive?
>> >
>> > Yes; this is root's crontab:
>> >
>> > 9 3,15 * * */sbin/fstrim -a
>>
>> I think a reduction in drive performance (when you are maintaining it
>> properly) is the best argument for being ready to replace the drive,
>> as this seems unlikely to happen to me unless the drive is actually
>> wearing out.
>
> I haven't noticed any degradation of performance, though I haven't run any
> tests.
>

I interpreted the slow zeroing as a performance decrease. If you can
benchmark to check you may want to.

If that situation doesn't correspond to a general decrease in performance I
will be very surprised.

>> At the same time I have seen this exact situation fixed by a firmware
>> upgrade. Still, this seems more alarming than the other issues you've
>> described.
>
> Do you mean the firmware of the NVMe drive? How would I go about that? I
> don't see any mention of firmware on Samsung's site.
>

Typically a closed source Windows program which may bundle the firmware in
it.


Re: [gentoo-user] Is my SSD dying?

2017-09-06 Thread Daniel Frey
On 09/06/2017 03:01 AM, Peter Humphrey wrote:
>> At the same time I have seen this exact situation fixed by a firmware
>> upgrade. Still, this seems more alarming than the other issues you've
>> described.
> 
> Do you mean the firmware of the NVMe drive? How would I go about that? I 
> don't see any mention of firmware on Samsung's site.
> 

I mentioned the same in an earlier post, but unfortunately it seems
Samsung's Magician software is required to do this (Windows application.)

Dan



Re: [gentoo-user] Is my SSD dying?

2017-09-06 Thread Peter Humphrey
On Sunday, 3 September 2017 21:56:43 BST R0b0t1 wrote:
> On Sun, Sep 3, 2017 at 2:39 AM, Peter Humphrey  
wrote:
> > On Sunday, 3 September 2017 03:34:06 BST R0b0t1 wrote:
> >> On Sat, Sep 2, 2017 at 8:49 AM, Peter Humphrey 

> > 
> > wrote:
> >> > A week or two ago I was investigating some other weirdnesses and at
> >> > one
> >> > point I zeroed out the first partition: the unformatted one
> >> > containing
> >> > the UEFI data. It took longer than I expected, having only 2MB to
> >> > fill.
> >> > I wonder if it strayed outside the partition...
> >> 
> >> Are you trimming your drive?
> > 
> > Yes; this is root's crontab:
> > 
> > 9 3,15 * * */sbin/fstrim -a
> 
> I think a reduction in drive performance (when you are maintaining it
> properly) is the best argument for being ready to replace the drive,
> as this seems unlikely to happen to me unless the drive is actually
> wearing out.

I haven't noticed any degradation of performance, though I haven't run any 
tests.

> At the same time I have seen this exact situation fixed by a firmware
> upgrade. Still, this seems more alarming than the other issues you've
> described.

Do you mean the firmware of the NVMe drive? How would I go about that? I 
don't see any mention of firmware on Samsung's site.

-- 
Regards
Peter




Re: [gentoo-user] Is my SSD dying?

2017-09-03 Thread R0b0t1
On Sun, Sep 3, 2017 at 2:39 AM, Peter Humphrey  wrote:
> On Sunday, 3 September 2017 03:34:06 BST R0b0t1 wrote:
>> On Sat, Sep 2, 2017 at 8:49 AM, Peter Humphrey 
> wrote:
>> > A week or two ago I was investigating some other weirdnesses and at one
>> > point I zeroed out the first partition: the unformatted one containing
>> > the UEFI data. It took longer than I expected, having only 2MB to fill.
>> > I wonder if it strayed outside the partition...
>>
>> Are you trimming your drive?
>
> Yes; this is root's crontab:
>
> 9 3,15 * * */sbin/fstrim -a
>

I think a reduction in drive performance (when you are maintaining it
properly) is the best argument for being ready to replace the drive,
as this seems unlikely to happen to me unless the drive is actually
wearing out.

At the same time I have seen this exact situation fixed by a firmware
upgrade. Still, this seems more alarming than the other issues you've
described.



Re: [gentoo-user] Is my SSD dying?

2017-09-03 Thread Peter Humphrey
On Sunday, 3 September 2017 03:34:06 BST R0b0t1 wrote:
> On Sat, Sep 2, 2017 at 8:49 AM, Peter Humphrey  
wrote:
> > A week or two ago I was investigating some other weirdnesses and at one
> > point I zeroed out the first partition: the unformatted one containing
> > the UEFI data. It took longer than I expected, having only 2MB to fill.
> > I wonder if it strayed outside the partition...
> 
> Are you trimming your drive?

Yes; this is root's crontab:

9 3,15 * * */sbin/fstrim -a

-- 
Regards,
Peter.



Re: [gentoo-user] Is my SSD dying?

2017-09-02 Thread R0b0t1
On Sat, Sep 2, 2017 at 8:49 AM, Peter Humphrey  wrote:
> A week or two ago I was investigating some other weirdnesses and at one
> point I zeroed out the first partition: the unformatted one containing the
> UEFI data. It took longer than I expected, having only 2MB to fill. I wonder
> if it strayed outside the partition...
>

Are you trimming your drive?



Re: [gentoo-user] Is my SSD dying?

2017-09-02 Thread Peter Humphrey
On Saturday, 2 September 2017 13:28:44 BST Jacques Montier wrote:

> I once encountered the problem with my Crucial SSD.
> I found a procedure to make the SSD detected which worked for me.
> http://forums.crucial.com/t5/Crucial-SSDs/Why-did-my-SSD-quot-disappear-qu
> ot-from-my-system/ta-p/65215 Hope this will help.

Mine's by Samsung:

# lspci -v -s 05:00.0
05:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD 
Controller SM951/PM951 (rev 01) (prog-if 02 [NVM Express])
Subsystem: Samsung Electronics Co Ltd NVMe SSD Controller SM951/
PM951
Flags: bus master, fast devsel, latency 0, IRQ 35, NUMA node 0
Memory at fbd0 (64-bit, non-prefetchable) [size=16K]
I/O ports at d000 [size=256]
Capabilities: [40] Power Management version 3
Capabilities: [50] MSI: Enable- Count=1/8 Maskable- 64bit+
Capabilities: [70] Express Endpoint, MSI 00
Capabilities: [b0] MSI-X: Enable+ Count=9 Masked-
Capabilities: [100] Advanced Error Reporting
Capabilities: [148] Device Serial Number 00-00-00-00-00-00-00-00
Capabilities: [158] Power Budgeting 
Capabilities: [168] #19
Capabilities: [188] Latency Tolerance Reporting
Capabilities: [190] L1 PM Substates
Kernel driver in use: nvme

It's clearly visible to the kernel, and smartd finds it too if I tell it 
what to look for.

I don't want to start unplugging it unless I have to, as it's in a PCI slot 
and I'd probably make things worse. And it is only 18 months old, as I said.

A week or two ago I was investigating some other weirdnesses and at one 
point I zeroed out the first partition: the unformatted one containing the 
UEFI data. It took longer than I expected, having only 2MB to fill. I wonder 
if it strayed outside the partition...

-- 
Regards,
Peter.



Re: [gentoo-user] Is my SSD dying?

2017-09-02 Thread Jacques Montier
Hello,

I once encountered the problem with my Crucial SSD.
I found a procedure to make the SSD detected which worked for me.
http://forums.crucial.com/t5/Crucial-SSDs/Why-did-my-SSD-quot-disappear-quot-from-my-system/ta-p/65215
Hope this will help.

Cheers,



*--*
*Jacques*

2017-09-02 11:51 GMT+02:00 Peter Humphrey :

> On Saturday, 2 September 2017 10:32:23 BST I wrote:
>
> > ... Now smartmon appears to run ok - provided that I remove DEVICESCAN
> > from /etc/smartd.conf and give it a specific device to monitor ...
>
> Some months ago someone here mentioned a test suite for SSDs, but I can't
> remember what it was called and now I can't find it. Can someone point me
> in
> the right direction, please?
>
> --
> Regards,
> Peter.
>
>
>


Re: [gentoo-user] Is my SSD dying?

2017-09-02 Thread Peter Humphrey
On Saturday, 2 September 2017 10:32:23 BST I wrote:

> ... Now smartmon appears to run ok - provided that I remove DEVICESCAN
> from /etc/smartd.conf and give it a specific device to monitor ...

Some months ago someone here mentioned a test suite for SSDs, but I can't 
remember what it was called and now I can't find it. Can someone point me in 
the right direction, please?

-- 
Regards,
Peter.




Re: [gentoo-user] Is my SSD dying?

2017-09-02 Thread Peter Humphrey
On Saturday, 2 September 2017 02:24:57 BST Adam Carter wrote:
> On Fri, Sep 1, 2017 at 7:46 PM, Peter Humphrey 
> wrote:

> > For the last week or two my NVMe SSD isn't being detected on startup. I
> > get this error on manual invocation:
> > 
> > # smartctl -a /dev/nvme0n1
> > smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.12.5-gentoo] (local build)
> 
> Probably also worth updating to 4.12.10, there's some important sounding
> security fixes in it, and the Changelogs for 4.12.6 and 4.12.8 mention
> nvme.

I went to version 6.5. Now smartmon appears to run ok - provided that I 
remove DEVICESCAN from /etc/smartd.conf and give it a specific device to 
monitor, like this:

/dev/nvme0n1 -a -o on -S on -s (S/../.././02|L/../../6/03)

(following an example in the file). So I'm still feeling somewhat edgy.

-- 
Regards,
Peter.



Re: [gentoo-user] Is my SSD dying?

2017-09-02 Thread Peter Humphrey
On Friday, 1 September 2017 10:54:45 BST Arthur Țițeică wrote:
> În 1 septembrie 2017 12:46:39 EEST, Peter Humphrey  
a scris:
> >Hello list,
> >
> >For the last week or two my NVMe SSD isn't being detected on startup. I
> >get
> >this error on manual invocation:
> >
> ># smartctl -a /dev/nvme0n1
> >smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.12.5-gentoo] (local
> >build)
> >Copyright (C) 2002-15, Bruce Allen, Christian Franke,
> >www.smartmontools.org
> >
> >/dev/nvme0n1: Unable to detect device type
> >Please specify device type with the -d option.
> 
> Smartmontools supports NVMe starting from version 6.5.

That was it - thanks. It's odd, though, that I hadn't noticed this before.

> >Most things still seem to be working, but do I need to rush out and buy
> >
> >another drive? This one's only 18 months old. I don't really want to
> >box up
> >the machine and send it to Watford under warranty.
> >
> >Two things that aren't working properly are KMail (surprise!), and
> >BOINC,
> >which insists that VirtualBox isn't installed, when of course it (more
> >or
> >less) always has been.
> 
> Is the boinc user in the vboxusers group?

Yes:

# groups boinc
vboxusers boinc

-- 
Regards,
Peter.




Re: [gentoo-user] Is my SSD dying?

2017-09-01 Thread Adam Carter
On Fri, Sep 1, 2017 at 7:46 PM, Peter Humphrey 
wrote:

> Hello list,
>
> For the last week or two my NVMe SSD isn't being detected on startup. I get
> this error on manual invocation:
>
> # smartctl -a /dev/nvme0n1
> smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.12.5-gentoo] (local build)
>
>
Probably also worth updating to 4.12.10, there's some important sounding
security fixes in it, and the Changelogs for 4.12.6 and 4.12.8 mention nvme.


Re: [gentoo-user] Is my SSD dying?

2017-09-01 Thread Daniel Frey
On 09/01/2017 02:46 AM, Peter Humphrey wrote:
> Hello list,
> 
> For the last week or two my NVMe SSD isn't being detected on startup. I get 
> this error on manual invocation:
> 
> # smartctl -a /dev/nvme0n1
> smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.12.5-gentoo] (local build)
> Copyright (C) 2002-15, Bruce Allen, Christian Franke, www.smartmontools.org
> 
> /dev/nvme0n1: Unable to detect device type
> Please specify device type with the -d option.
> 
> 
> Most things still seem to be working, but do I need to rush out and buy 
> another drive? This one's only 18 months old. I don't really want to box up 
> the machine and send it to Watford under warranty.
> 
> Two things that aren't working properly are KMail (surprise!), and BOINC, 
> which insists that VirtualBox isn't installed, when of course it (more or 
> less) always has been.
> 

If your BIOS isn't detecting it, it's probably on its way out. Before
dismissing it to that though, I'd see if there's a firmware update for it.

I had a Crucial (regular SSD, non-NVME) that was doing this and after a
firmware update it's still going 3 years later.

Make sure you back up your data on it (if you can.)

Dan



Re: [gentoo-user] Is my SSD dying?

2017-09-01 Thread Arthur Țițeică


În 1 septembrie 2017 12:46:39 EEST, Peter Humphrey  a 
scris:
>Hello list,
>
>For the last week or two my NVMe SSD isn't being detected on startup. I
>get 
>this error on manual invocation:
>
># smartctl -a /dev/nvme0n1
>smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.12.5-gentoo] (local
>build)
>Copyright (C) 2002-15, Bruce Allen, Christian Franke,
>www.smartmontools.org
>
>/dev/nvme0n1: Unable to detect device type
>Please specify device type with the -d option.
>

Smartmontools supports NVMe starting from version 6.5.

>
>Most things still seem to be working, but do I need to rush out and buy
>
>another drive? This one's only 18 months old. I don't really want to
>box up 
>the machine and send it to Watford under warranty.
>
>Two things that aren't working properly are KMail (surprise!), and
>BOINC, 
>which insists that VirtualBox isn't installed, when of course it (more
>or 
>less) always has been.

Is the boinc user in the vboxusers group?