Hello,
I think I may have a faild device on my linux
raid. As I am not sure what sorts of status
messages go in /proc/mdstat, I am not sure if the
device is failed or not.
I have a DPT raid tower with hot swap drive bays.
I conducted a test to try to generate an error by
removing one of the drives.
Here is the before and after status.
The status should be:
md0 : active raid5 sdf1[5] sde1[4] sdd1[3] sdc1[2]
sdb1[1] sda1[0]
71681024
+blocks level 5, 256k chunk, algorithm 0 [5/5]
[UUUUU]
The status is:
md0 : active raid5 sdf1[4] sde1[4](F) sdd1[3]
sdc1[2] sdb1[1] sda1[0]
71681024
+blocks level 5, 256k chunk, algorithm 0 [5/5]
[UUUUU]
I noted the (F) by sde1. Does this stand for
failed? Is there any references to the types of
errors that will be reported in the syslog or
/proc/mdstat?
I am concerned, because a production system has
the same (F) by one of its drives.
Personalities : [raid5]
read_ahead 1024 sectors
md0 : active raid5 sdg1[6] sdf1[5] sde1[4] sdd1[3]
sdc1[2] sdb1[1](F) sda1[0] 106653696 blocks level
5, 256k chunk, algorithm 0 [7/6] [U_UUUUU]
unused devices: <none>
Reading this status from /proc/mdstat, I am
thinking that the raid is running in degraded mode
with "sdb1" as the failed drive. The [7/6], does
that mean that there are 7 devices and only 6 are
currently running?
Thanks,
Kirk
--
Kirk Patton
System Administrator
CA Microprocessor Division
Advanced Micro Devices
Phone (408) 774-7883
Pgr. (408) 870-5461
Fax (408)774-7606