What model hard drives? The only time I can think of having seen repeated scrubs doing that is when the hard drives were e.g. Samsung HD204UI drives with the hilarious firmware bug [1].
- Rich [1] - http://knowledge.seagate.com/articles/en_US/FAQ/223571en On Tue, Jan 21, 2014 at 1:01 PM, Swâmi Petaramesh <sw...@petaramesh.org> wrote: > Hi there, > > I have met a dramatic issue on my Linux (Ubuntu 13.10) box running ZFS as its > root filesystem (zfsonlinux), and I'm afraid all my data is lost (but I would > do about anything for getting it back...) > > I have a ZFS pool made out of a disk mirror (sda3, sdb3) plus a L2 cache out > of an SSD (sdd4). > > I the past, when scrubbing the pool, I happened to get some errors, mostly on > sda, which zfs "fixed". > > I was puzzled by that because the disks SMART says "no errors whatsoever", > disks SMART tests pass OK , syslog doesn't record interface errors, system > memory (non-ECC) passes Memtest86+, system never malfunctions... so there's no > visible issue except for ZFS recording (and fixing ?) errors while scrubbing. > > I decided to "live whith this for a while because no money for replacing a > working HD"... (Yes, people before you tell me go get other disks, another > mobo, another PSU... I'm really straight out of cash... It's not an option.) > > At some point in time my box PSU died, and as I needed my system I just > dropped the 2 disks in another box, and it kept on working (Linux magic). > > I took advantage of that to perform another couple scrubs with the new box, > and it gave about the same results (so the issue lies either with the disks or > ZFS software ?) > > I eventually got another PSU for my initial system to repair, and dropped the > disks back in. > > I messed a bit with the SATA cables and drives order, and as Linux doesn't > seem to be able to use drives IDs, but devices names, for a root pool (too > bad...) my system happened to come up with a degraded mirror on a single disk > (sda3, missing sdb3). But OK. > > I turned the system off, fiddled with the cables, restarted, I reinserted > sdb3, > and then it became to resilver. > > After a day it eventually finished, but resilvering had noticed about 120 > "Checksum errors" on sda, and about 10 on sdb. It said that the system had > found an uncorrectable error, identifying it something like <metadata>: <00x> > > Still, it was working but I didn't know how to clear this seemingly minor > error. > > Turned the system off. > > The next day the system booted OK, but still started immediately to "resilver" > again, still showing quite the same amount of errors as usual. > > But at some point the system completely hanged, leaving me no other choice > than pulling the power cord. > > > Since, my system won't boot at all. Trying to mount the root pool ends in the > following kernel rude words you can see here: > > https://www.dropbox.com/s/ggrl2148t9brehh/P1030505.JPG > https://www.dropbox.com/s/sm0hfmpjy63emj4/P1030506.JPG > > I tried to boot an Ubuntu live USB, then install ZFS and import the pool, with > the same result. > > I got the same system crashes trying to import in FreeBSD : > https://www.dropbox.com/s/f2jtg864jzut6o5/P1030508.JPG > > And even it crashed OpenIndiana (so fast that I could only take a blurry pic): > https://www.dropbox.com/s/03w81p1xshekb79/P1030511.JPG > > I'm currently getting some help and the issue is being worked on here: > https://github.com/zfsonlinux/spl/issues/329 > > ....Where you can find links to other pics with debugging output, the last one > so far ending in error...: > https://www.dropbox.com/s/jdxfn6zq9ffv02q/P1030517.JPG > > I'd love to be able to rescue data from this pool, as there's about 900GB > there, part of it being "not easily replaceable"... (Read: I don't backup > things I can live without, but would love to get them back ;-) > > Any help will be warmly welcomed and highly appreciated. It's a real bad > situation :-( > > TIA > > -- > Swâmi Petaramesh <sw...@petaramesh.org> http://petaramesh.org PGP 9076E32E > _______________________________________________ > developer mailing list > developer@open-zfs.org > http://lists.open-zfs.org/mailman/listinfo/developer _______________________________________________ developer mailing list developer@open-zfs.org http://lists.open-zfs.org/mailman/listinfo/developer