Hello,

I think I may have a faild device on my linux
raid.  As I am not sure what sorts of status
messages go in /proc/mdstat, I am not sure if the
device is failed or not.

I have a DPT raid tower with hot swap drive bays.
I conducted a test to try to generate an error by
removing one of the drives.

Here is the before and after status.

The status should be:
md0 : active raid5 sdf1[5] sde1[4] sdd1[3] sdc1[2]
sdb1[1] sda1[0] 
71681024
+blocks level 5, 256k chunk, algorithm 0 [5/5]
[UUUUU]

The status is:
md0 : active raid5 sdf1[4] sde1[4](F) sdd1[3]
sdc1[2] sdb1[1] sda1[0] 
71681024
+blocks level 5, 256k chunk, algorithm 0 [5/5]
[UUUUU]

I noted the (F) by sde1.  Does this stand for
failed?  Is there any references to the types of
errors that will be reported in the syslog or
/proc/mdstat?

I am concerned, because a production system has
the same (F) by one of its drives.

Personalities : [raid5] 
read_ahead 1024 sectors
md0 : active raid5 sdg1[6] sdf1[5] sde1[4] sdd1[3]
sdc1[2] sdb1[1](F) sda1[0] 106653696 blocks level
5, 256k chunk, algorithm 0 [7/6] [U_UUUUU]
unused devices: <none>

Reading this status from /proc/mdstat, I am
thinking that the raid is running in degraded mode
with "sdb1" as the failed drive.  The [7/6],  does
that mean that there are 7 devices and only 6 are
currently running?

Thanks,
Kirk



-- 
Kirk Patton
System Administrator
CA Microprocessor Division
Advanced Micro Devices
Phone (408) 774-7883
Pgr. (408) 870-5461
Fax (408)774-7606

Reply via email to