Re: IDE Hard Drive maintenance

2003-05-06 Thread Michel Lanners

This electronic message is not binding on its sender nor on Cetrel S.C.
Any use of information of this mail except the use by the addressee
within his or her business relation with Cetrel is strictly forbidden
CETREL S.C. L-2956 Luxembourg; Tel: 00352 35566-1; http://www.cetrel.lu
=== 

Sorry for replying to such old mails, but I'm cleaning my mailboxes
:-)

Lauchlin Wilkinson wrote:
 I was wondering what most people on the list did when it came to keeping
 tabs on the health of IDE hard drives?  I have a server in  a remote
 location that I fear has one HD that is going flaky. Is there a way of
 doing a bad block scan on a mounted partition safely or am I asking the
 impossible.

For monitoring, use SMART as in smartmontools (it's in unstable, but
you can recompile easily for stable). It's a more advanced version of
smartsuite in stable.

Some things to watch out for:

- enable SMART on your drives. Some may have it disabled by default.
- enable automatic offline tests. These are non-destructive and
non-captive, i.e. they can run in the background. On not-too-busy
servers, this load on the disk should not be a problem. YMMV. Don't
know whether it's the same for all disks, but I have one at home that
does tests every 4 hours.
- configure smartd to send email on problems.
- have a watch on the SMART error log on the drives.

If something apears in the logs, you will also see the block address
in there, but it might not be obvious to associate that to a
filesystem or devic block as you see it 'from the outisde'. You can
run badblocks to find that info.

Also be carefull when translating block numbers. badblocks will report
filesystem blocks (thei size is in the superblock), the kernel log
will show device blocks (i.e. 512-byte blocks), and something else I
forget will show 1K blocks. Just be sure you get it right

A trick to make disks with isolated read errors behave again (although
you shouldn't trust them too much important data...) is to _write_ to
those blocks. This will make the drive controller remap those bad
blocks to good spare blocks. Bingo, errors disappeared. Until the next
one appears :-)

Cheers

Michel

-- 

Michel Lanners  | Being able to break security
PRO-SSC |  doesn't make you a hacker
[EMAIL PROTECTED]|  more than being able to hotwire cars
Cetrel S.C. |  makes you an automotive engineer.
10, Parc d'Activite Syrdall |
L-5365 Munsbach |   Eric S. Raymond




Re: IDE Hard Drive maintenance

2003-01-10 Thread Christian Hammers
On Tue, Jan 07, 2003 at 02:31:33PM +1100, Lauchlin Wilkinson wrote:
 I was wondering what most people on the list did when it came to keeping
 tabs on the health of IDE hard drives?  I have a server in  a remote
Apart from that you should install sensors to monitor your systems
temperature, logcheck to let it mail you any anomalies and the usual 
bigbrother/netsaint/mon to watch services.

bye,

-christian-

-- 
Christian HammersWESTEND GmbH - Aachen und Dueren Tel 0241/701333-0
[EMAIL PROTECTED] Internet  Security for ProfessionalsFax 0241/911879
  WESTEND ist CISCO Systems Partner - Authorized Reseller


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]




IDE Hard Drive maintenance

2003-01-06 Thread Lauchlin Wilkinson
Hi,

I was wondering what most people on the list did when it came to keeping
tabs on the health of IDE hard drives?  I have a server in  a remote
location that I fear has one HD that is going flaky. Is there a way of
doing a bad block scan on a mounted partition safely or am I asking the
impossible.

Cheers,

Lauchlin


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]




Re: IDE Hard Drive maintenance

2003-01-06 Thread Fraser Campbell
On January 6, 2003 10:31 pm, the fabulous Lauchlin Wilkinson wrote:

 I was wondering what most people on the list did when it came to keeping
 tabs on the health of IDE hard drives?  I have a server in  a remote
 location that I fear has one HD that is going flaky. Is there a way of
 doing a bad block scan on a mounted partition safely or am I asking the
 impossible.

I have recently (past 6 months) begun using ide-smart and hddtemp to monitor 
health of my IDE drives.  I don't have it on a lot of systems yet so they 
have yet to aid me in detecting (hopefully pre-detecting) failures.

I'd be very interested in hearing if anyone has successfully used these tools 
(or others) for predicting drive failures.

This is from the ide-smart man page:

   ide-smart performs and queries the results of various  non  destructive
   tests on a SMART capable IDE DEVICE.  You must have a BIOS and hardware
   that supports it.

   SMART stands for Self-Monitoring, Analysis  and  Reporting  Technology,
   which  provides  near  future  failure  prediction monitoring different
   attributes (listed as Id) of the device.  If a value of a particular Id
   (or  attribute) is under a certain threshold, the device most probabily
   will be about to fail.

This is from the hddtemp man page:

   hddtemp will give you the temperature of your IDE hard drive by reading
   Self-Monitoring Analysis and Reporting Technology (S.M.A.R.T)  informa-
   tion  (on  drives  that  support this feature). Only modern hard drives
   have a temperature sensor. hddtemp does not support  reading  S.M.A.R.T
   information from SCSI drives.



-- 
Fraser
real sig coming RSN


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]