Hi all,
        I have a pernennial problem with a raid array (which contains a 400GB 
filesystem 72% full.
Since its raid 1 I am bemused that I keep getting file-system errors every 2 or 
threee days on reboot (requiring a manual fsck).
The kernel log shows (e.g):
___________________________
....
....
....
[ 7551.160178] ata10.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6
[ 7551.160182] ata10.00: BMDMA stat 0x4
[ 7551.160186] ata10.00: failed command: WRITE DMA
[ 7551.160194] ata10.00: cmd ca/00:18:39:50:a1/00:00:00:00:00/e9 tag 0 dma 
12288 out
[ 7551.160196]          res 51/84:00:50:50:a1/00:00:00:00:00/e9 Emask 0x10 
(ATA bus error)
[ 7551.160200] ata10.00: status: { DRDY ERR }
[ 7551.160203] ata10.00: error: { ICRC ABRT }
[ 7551.160236] ata10: soft resetting link
[ 7551.323572] ata10.00: configured for UDMA/100
[ 7551.329343] ata10.01: configured for UDMA/66
[ 7551.329595] ata10: EH complete
....
....
....
________________________

Is anyone able to reasonably decipher this?
I recently upgraded the filesystem from ext3 to ext4 (via "appropriate 
incantation"  of tune2fs).
This made not the slightest difference.
Strangley, after the fscks has fixed things (mainly dup allocated blocks) I 
never appear to have lost anything (!).

Any (reasonably polite) suggestions gratefully received. In particuylar - how 
to match the ata no. given with one of my actual disks 
filesystems mounted as follows:

/dev/mapper/vg_waverley-lv_root on / type ext4 (rw)
proc on /proc type proc (rw)
sysfs on /sys type sysfs (rw)
devpts on /dev/pts type devpts (rw,gid=5,mode=620)
tmpfs on /dev/shm type tmpfs (rw,rootcontext="system_u:object_r:tmpfs_t:s0")
/dev/sda1 on /boot type ext4 (rw)
/dev/mapper/vg_waverley-lv_home on /opt type ext4 (rw)
/dev/md1 on /home type ext4 (rw)
/dev/md0 on /multimedia type ext3 (rw)
none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw)
fusectl on /sys/fs/fuse/connections type fusectl (rw)
gvfs-fuse-daemon on /home/andyp/.gvfs type fuse.gvfs-fuse-daemon 
(rw,nosuid,nodev,user=andyp)


/proc/diskstats:
_______________
...
...
...
   8       0 sda 72495 13319 3310860 320940 21615 267129 2184958 674670 0 
250084 995628
   8       1 sda1 644 250 9656 919 170 17603 35558 3187 0 1309 4106
   8       2 sda2 71683 13065 3299828 319382 18924 249526 2149400 637608 0 
215408 957010
  11       0 sr0 0 0 0 0 0 0 0 0 0 0 0
   8      16 sdb 27076 13275 1084119 146144 10482 18556 231792 92334 0 218764 
238432
   8      17 sdb1 615 1460 3079 547 0 0 0 0 0 469 546
   8      18 sdb2 2083 1991 14586 2839 2 0 16 3 0 2738 2842
   8      19 sdb3 22345 9518 909948 134862 10385 18551 231528 91810 0 209747 
226630
   8      20 sdb4 1985 290 155994 7715 26 5 248 210 0 6991 7922
   8      32 sdc 23652 12092 690676 137121 10457 18579 231776 93315 0 210435 
230407
   8      33 sdc1 691 1384 3079 495 0 0 0 0 0 437 494
   8      34 sdc2 412 1523 1959 549 0 0 0 0 0 518 548
   8      35 sdc3 20659 8873 550762 129205 10362 18574 231528 92761 0 203760 
221944
   8      36 sdc4 1842 296 134364 6601 26 5 248 217 0 6235 6813
  11       1 sr1 0 0 0 0 0 0 0 0 0 0 0
 253       0 dm-0 83866 0 3290154 502689 268450 0 2143624 13403047 0 248238 
13905734
 253       1 dm-1 394 0 3152 1919 719 0 5752 4035 0 994 5954
   9       1 md1 61401 0 1460282 0 25500 0 203448 0 0 0 0
   9       0 md0 4348 0 289946 0 19 0 152 0 0 0 0
 253       2 dm-2 722 0 5770 1378 5 0 24 78 0 554 1456
...
...
_______________

/proc/mdstat:
_____________
Personalities : [raid1] 
md0 : active raid1 sdc4[1] sdb4[0]
      58605056 blocks [2/2] [UU]
      
md1 : active raid1 sdc3[1] sdb3[0]
      58605056 blocks [2/2] [UU]
      
unused devices: <none>
_______________


md1 is the offending mirror.

System is FC14: Fedora release 14 (Laughlin)
(But I was getting this with FC11 prior to an upgrade)

TIA
Andy







--
Next meeting:  Blandford Forum, Wednesday 2011-03-02 20:00
Meets, Mailing list, IRC, LinkedIn, ...  http://dorset.lug.org.uk/
How to Report Bugs Effectively:  http://goo.gl/4Xue

Reply via email to