Re: [PATCH RFC] btrfs: csum: Introduce partial csum for tree block.

Facebook Thu, 18 Jun 2015 08:59:09 -0700

On Wed, Jun 17, 2015 at 9:34 PM, Qu Wenruo <quwen...@cn.fujitsu.com>wrote:

Ping?

New new comments?

As our block sizes get bigger, it makes sense to think about more finegrained checksums. We're using crcs for:

1) memory corruption on the way down to the storage. We could be verysmall (bitflips) or smaller chunks (dma corrupting the whole bio). Theplaces I've seen this in production, the partial crcs might help save apercentage of the blocks, but overall the corruptions were just toopervasive to get back the data.

2) incomplete writes. We're sending down up to 64K btree blocks, thestorage might only write some of them.

3) IO errors from the drive. These are likely to fail in much biggerchunks and the partial csums probably won't help at all.

I think the best way to repair all of these is with replication, eitherRAID5/6 or some number of mirrored copies. It's more reliable thantrying to stitch together streams from multiple copies, and the codecomplexity is much lower.

But, where I do find the partial crcs interesting is the ability tomore accurately detect those three failure modes with our larger blocksizes. That's pure statistics based on the crc we've chosen and thesize of the block. The right answer might just be a different crc, butI'm more than open to data here.


-chris


--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH RFC] btrfs: csum: Introduce partial csum for tree block.

Reply via email to