I have a 9 drive system (four mirrors of two disks and one hot spare) with a 
10th SSD drive for ZIL.

The ZIL is corrupt.

I've been unable to recover using FreeBSD 8, Opensolaris x86, and using logfix 
(http://github.com/pjjw/logfix)

In FreeBSD 8.0RC3 and below (uses v13 ZFS):
1) Boot Single User (both i386 and amd64)
2) /etc/rc.d/hostid start
3) "zpool import" results in system lockup (infinite time or at least 3 days)

In FreeBSD 8.0 Release:
1) Do #1 & #2 above, then "zpool import -f" results is being told there are 
missing elements (namely the log disk ad4p2)

In OpenSolaris x86:
1) "zpool import -f" reports log disk is missing.

Use Logfix under OpenSolaris:
1) make new pool junkpool
2) logfix using a disk from the pool and the new log disk and the guid of the 
old corrupt ZIL log from the freebsd box.
3) "zpool import -f" is different, it now shows the new log but reports a disk 
pair (mirror of da4p5 & da5p5 using the FreeBSD names since I don't understand 
OpenSolaris names) missing.  They show up before the log disk is changed, but 
now do not.
4) If I remove the log disk, they reappear.
5) Of note, 8 of the disks (the four mirrors) are one one SAS HBA.  The spare 
is on another SATA controller with the SSD disk.
6) Could it be that the disks span controllers?  Like c8t[1-8]d0s4 are the 8 
disks and c7d0 is the spare and c8d1 is the SSD.

I've spent 2 weeks trying to recover this pool, and been unable to do so in 
FreeBSD or OpenSolaris.  Is there anyone who could help?  Or suggest things I 
have not tried?  I'm fine with copying the data off if I could just mount the 
thing read only even.
-- 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to