-----BEGIN PGP SIGNED MESSAGE-----
Hash: RIPEMD160

Hi Ian,


On 05/20/2014 09:51 PM, Ian Collins via smartos-discuss wrote:
> Has anyone ever seen erroneous overheat errors from a 4U
> Supermicro chassis?
> 
> I'm currently testing a new system with pretty standard parts: a 
> Supermicro X9DRD-7LN4F-JBOD motherboard in a CSE-846BE16-R1K28B 4U 
> chassis with 22 3TB drives.  Almost as soon as SmartOS (or Solaris
> to double check) boots, most of the disk overheat LEDs illuminate
> and FMA takes them out of service.  The system is well cooled and
> the drives are cool to touch.  Odd.

Yes, we see this on Solaris 11.1 too. There seems to be something
seriuosly wrong with the topology information in fmadm for these
systems. Our workaround is to disable these tests all together (there
are numerous other temperature sensors in these systems and disk
errors tend to be noticed by SMART-reporting, so we see not much harm
in this).

Disable this through "/usr/lib/fm/fmd/plugins/disk-transport.conf"

In our configuration management we have the following as default:

[origin /export/itree/info]# cat
files//usr/lib/fm/fmd/plugins/disk-transport.conf/_DEFAULT
#
# Copyright (c) 2007, 2011, Oracle and/or its affiliates. All rights
reserved.
#

# This file allows configuration of the disk-transport functionality.  All
# parameters are optional.
# Parameters are set using the syntax:
# setprop <parameter> <value>
#
# Supported parameters are as follows:
#
# The "interval" parameter governs how frequently the transport runs.
 Default
# value is one hour (60m).
# setprop interval <time interval>
#
# The "temp-multiple", "selftest-multiple" and "smart-multiple" parameters
# indicate on which multiple of the interval parameter that this
particular test
# runs. Zero (0) prevents the test from executing.  Default value is
one (1).
#
# WARNING: Setting multiples to zero (0) is not recommended especially
for the
# smart test as this will prevent the system from detecting and
reporting disk
# errors.
#
# setprop temp-multiple     <integer>
# setprop selftest-multiple <integer>
# setprop smart-multiple    <integer>
setprop temp-multiple     0





In case fmadm allready disabled your disks, just run:

#for i in `fmadm faulty | grep Major | awk '{ print $4 }'`; do fmadm
acquit $i; done

Good luck!

Jeroen

- -- 
Jeroen Roodhart              University of Amsterdam
Strategic IT Consultant      Science faculty
                             Researchers IT support
[email protected]          [email protected]
Tel. 020 525 7203
- --
See http://www.uva.nl/profile/j.r.roodhart for openPGP public key
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iEYEAREDAAYFAlN7wOEACgkQ37AP1zFtDU2cWwCg1qnFgSMaEqOptvCYwZx12DHl
hC8An0lmL2bk20h5fKuoDn1iYzksLMHM
=5F34
-----END PGP SIGNATURE-----


-------------------------------------------
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125&id_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com

Reply via email to