[PATCH] btrfs-progs: dduper - BTRFS offline deduplication tool

2018-08-23 Thread Lakshmipathi Ganapathi
dduper is an offline dedupe tool. It works by fetching checksum from BTRFS csum tree, instead of reading whole file blocks and computing checksum. This tool relies on output from 'btrfs inspect-internal dump-csum' command. Signed-off-by: Lakshmipathi.G --- dduper | 310

Re: Btrfs offline deduplication

2014-08-01 Thread Austin S Hemmelgarn
On 07/31/2014 07:54 PM, Timofey Titovets wrote: Good time of day. I have several questions about data deduplication on btrfs. Sorry if i ask stupid questions or waste you time %) What about implementation of offline data deduplication? I don't see any activity on this place, may be i need

Re: Btrfs offline deduplication

2014-08-01 Thread David Sterba
On Fri, Aug 01, 2014 at 06:17:44AM -0400, Austin S Hemmelgarn wrote: I do think however that having the option of a background thread doing deduplication asynchronously is a good idea, but then you would have to have some way to trigger it on individual files/trees, and triggering on writes

Re: Btrfs offline deduplication

2014-08-01 Thread Mark Fasheh
On Fri, Aug 01, 2014 at 10:16:08AM -0400, Austin S Hemmelgarn wrote: On 2014-08-01 09:23, David Sterba wrote: On Fri, Aug 01, 2014 at 06:17:44AM -0400, Austin S Hemmelgarn wrote: I do think however that having the option of a background thread doing deduplication asynchronously is a good

Re: Btrfs offline deduplication

2014-08-01 Thread Austin S Hemmelgarn
On 08/01/2014 02:55 PM, Mark Fasheh wrote: On Fri, Aug 01, 2014 at 10:16:08AM -0400, Austin S Hemmelgarn wrote: On 2014-08-01 09:23, David Sterba wrote: On Fri, Aug 01, 2014 at 06:17:44AM -0400, Austin S Hemmelgarn wrote: I do think however that having the option of a background thread doing

Re: Btrfs offline deduplication

2014-08-01 Thread Mark Fasheh
On Fri, Aug 01, 2014 at 03:18:46PM -0400, Austin S Hemmelgarn wrote: Why does this have to be kernel side? There's userspace software already to dedupe that can be run on a regular basis. Exporting checksums is a differnet story (you can do that via ioctl) but running the dedupe software

Btrfs offline deduplication

2014-07-31 Thread Timofey Titovets
Good time of day. I have several questions about data deduplication on btrfs. Sorry if i ask stupid questions or waste you time %) What about implementation of offline data deduplication? I don't see any activity on this place, may be i need to ask a particular person? Where the problem? May be a