Re: ZFS...

Steven Hartland Wed, 01 May 2019 10:42:39 -0700


On 01/05/2019 15:53, Michelle Sullivan wrote:

Paul Mather wrote:
On Apr 30, 2019, at 11:17 PM, Michelle Sullivan <miche...@sorbs.net>wrote:
Been there done that though with ext2 rather than UFS.. still gotall my data back... even though it was a nightmare..
Is that an implication that had all your data been on UFS (or ext2:)this time around you would have got it all back? (I've got thatimpression through this thread from things you've written.) That sortof makes it sound like UFS is bulletproof to me.
Its definitely not (and far from it) bullet proof - however when thedata on disk is not corrupt I have managed to recover it - even if ithas been a nightmare - no structure - all files in lost+found etc...or even resorting to r-studio in the even of lost raid information etc..

Yes but you seem to have done this with ZFS too, just not in thisparticularly bad case.

If you imagine that the in memory update for the metadata was corruptedand then written out to disk, which is what you seem to have experiencedwith your ZFS pool, then you'd be in much the same position.

This case - from what my limited knowledge has managed to fathom is aspacemap has become corrupt due to partial write during the hard powerfailure. This was the second hard outage during the resilver processfollowing a drive platter failure (on a ZRAID2 - so single platterfailure should be completely recoverable all cases - except hbafailure or other corruption which does not appear to be the case)..the spacemap fails checksum (no surprises there being that it was partwritten) however it cannot be repaired (for what ever reason))... howI get that this is an interesting case... one cannot just assumeanything about the corrupt spacemap... it could be complete and justthe checksum is wrong, it could be completely corrupt and ignorable..but what I understand of ZFS (and please watchers chime in if I'mwrong) the spacemap is just the freespace map.. if corrupt or missingone cannot just 'fix it' because there is a very good chance that thefix would corrupt something that is actually allocated and thereforethe best solution would be (to "fix it") would be consider it 100%full and therefore 'dead space' .. but zfs doesn't do that - probablya good thing - the result being that a drive that is supposed to begood (and zdb reports some +36m objects there) becomes completelyunreadable ... my thought (desire/want) on a 'walk' tool would be alast resort tool that could walk the datasets and send them elsewhere(like zfs send) so that I could create a new pool elsewhere and sendthe data it knows about to another pool and then blow away theoriginal - if there are corruptions or data missing, thats my problemit's a last resort.. but in the case the critical structures becomecorrupt it means a local recovery option is enabled.. it means that ifthe data is all there and the corruption is just a spacemap one cantransfer the entire drive/data to a new pool whilst the original hostis rebuilt... this would *significantly* help most people with largepools that have to blow them away and re-create the pools because oferrors/corruptions etc... and with the addition of 'rsync' (thechecksumming of files) it would be trivial to just 'fix' the datacorrupted or missing from a mirror host rather than transferring theentire pool from (possibly) offsite....

From what I've read that's not a partial write issue, as in that casethe pool would have just rolled back. It sounds more like the write wassuccessful but the data in that write was trashed due to your powerincident and that was replicated across ALL drives.

To be clear this may or may not be what your seeing as you don't see tohave covered any of the details of the issues your seeing and what indetail steps you have tried to recover with?

I'm not saying this is the case but all may not be lost depending on theexact nature of the corruption.


For more information on space maps see:
https://www.delphix.com/blog/delphix-engineering/openzfs-code-walk-metaslabs-and-space-maps
https://sdimitro.github.io/post/zfs-lsm-flushing/

A similar behavior resulted in being a bug:
https://www.reddit.com/r/zfs/comments/97czae/zfs_zdb_space_map_errors_on_unmountable_zpool/

    Regards
    Steve
_______________________________________________
freebsd-stable@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Re: ZFS...

Reply via email to