Re: [ANNOUNCE] Btrfs: a copy on write, snapshotting FS

Vladislav Bolkhovitin Wed, 20 Jun 2007 01:42:01 -0700

[EMAIL PROTECTED] wrote:

> 3. De-de-duplicate blocks on disk, i.e. copy them on write
> > I suppose that de-duplication itself would be done by some userspace
> process that would scan files, determine blocks with the same data and
> then de-duplicate them by using syscall or IOCTL (2).
> > That would be very usable feature, which in most cases wouldallow to
> shrink occupied disk space on 50-90%.
 Have you references for this number?
No, I've seen it somewhere and it well confirms with my own observations.
 In my experience one gets a lot of benefit from
 the much simpler process of "de-duplication" of files.
Yes, sure, de-duplication on files level brings its benefits, but onFS blocks level it would bring ever more benefits, because there aremany more or less big files, which are different as a whole, but witha lot of the same blocks. Simple example of such files is UNIX-stylemail boxes on a mail server.
unix style mail boxes would not be a good example of wins forsector-based de-duplication since the duplicate mail is not going to besector aligned.

Yes, I realized that after I sent the e-mail. Handling of the same, butnot aligned, data in different files would need more complex logic.Maybe too complex.


Vlad
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Re: [ANNOUNCE] Btrfs: a copy on write, snapshotting FS

Reply via email to