I don't know if this strikes me as a 1.0 feature, but it doesn't seem like
something that wouldn't need changes to the disk format. :)

Some ioctls occured to me that might be helpful here, without unduly
loading up the kernel with dedup logic:

-A speculative dedup that you could open, do whatever verification you need
(it would be up to the application), and then commit. I believe there's
already an ioctl to retrieve checksums, or your app could compare the bytes
if you were paranoid; this approach would be agnostic to that. Your dedup
handle would be automatically invalidated if any changes happened in the
specified ranges of bytes in the specified files, so it would be safe.

-A way to subscribe to writes in a subvolume, something similar to the
inotify API. You'd get structs that would tell you the
file/offset/size/csum of writes, allowing you to dedup on the fly without
having to crawl the whole filesystem. I'm not sure about the feasibility of
dealing with the volume of traffic you'd get from something like that, but
if it's something where you can keep a hash table in memory it might be
doable.

-Anthony

On Tue, 28 Apr 2009 06:02:51 -0400, Chris Mason <chris.ma...@oracle.com>
wrote:
> On Tue, 2009-04-28 at 07:22 +0200, Thomas Glanzmann wrote:
>> Hello Chris,
>> 
>> > There is a btrfs ioctl to clone individual files, and this could be
>> > used
>> > to implement an online dedup.  But, since it is happening from
>> > userland,
>> > you can't lock out all of the other users of a given file.
>> 
>> > So, the dedup application would be responsible for making sure a given
>> > file was not being changed while the dedup scan was running.
>> 
>> I see, does that mean that I can not do ,,dedup'' for files that are
>> currently opened by a userland program?
> 
> No, but it does mean the dedup done from userland is racey.  Picture
> this:
> 
> process A:
>     create some_file # some_file matches the contents of another file
> 
> dedup proc:
>     check some_file
>     decide to start block dedup
> 
> process A:
>      modify some_file
> 
> dedup proc:
>      progress through block dedup
> 
> So, this will happily replace blocks in some_file with the dedup blocks.
> But there's no way to atomically swap them.
> 
> We could create new ioctls for this, basically a variant of the clone
> file ioctl that makes sure a given set of pages has a given sum (or
> strict memory contents) before doing the swap.
> 
> But they don't exist yet.
> 
> -chris
> 
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to