Re: Feature requests: online backup - defrag - change RAID level

General Zed Thu, 12 Sep 2019 15:47:33 -0700


Quoting "Austin S. Hemmelgarn" <ahferro...@gmail.com>:

On 2019-09-12 15:18, webmas...@zedlx.com wrote:


Quoting "Austin S. Hemmelgarn" <ahferro...@gmail.com>:

On 2019-09-11 17:37, webmas...@zedlx.com wrote:


Quoting "Austin S. Hemmelgarn" <ahferro...@gmail.com>:

On 2019-09-11 13:20, webmas...@zedlx.com wrote:


Quoting "Austin S. Hemmelgarn" <ahferro...@gmail.com>:

On 2019-09-10 19:32, webmas...@zedlx.com wrote:


Quoting "Austin S. Hemmelgarn" <ahferro...@gmail.com>:

* Reflinks can reference partial extents. This means,ultimately, that you may end up having to split extents in oddways during defrag if you want to preserve reflinks, and mighthave to split extents _elsewhere_ that are only tangentiallyrelated to the region being defragmented. See the example in myprevious email for a case like this, maintaining the sharedregions as being shared when you defragment either file to asingle extent will require splitting extents in the other file(in either case, whichever file you don't defragment to a singleextent will end up having 7 extents if you try to force the onethat's been defragmented to be the canonical version). Once youconsider that a given extent can have multiple ranges reflinkedfrom multiple other locations, it gets even more complicated.
I think that this problem can be solved, and that it can besolved perfectly (the result is a perfectly-defragmented file).But, if it is so hard to do, just skip those problematic extentsin initial version of defrag.
Ultimately, in the super-duper defrag, those partially-referencedextents should be split up by defrag.
* If you choose to just not handle the above point by notletting defrag split extents, you put a hard lower limit on theamount of fragmentation present in a file if you want topreserve reflinks. IOW, you can't defragment files past acertain point. If we go this way, neither of the two files inthe example from my previous email could be defragmented anyfurther than they already are, because doing so would requiresplitting extents.
Oh, you're reading my thoughts. That's good.
Initial implementation of defrag might be not-so-perfect. Itwould still be better than the current defrag.
This is not a one-way street. Handling of partially-used extentscan be improved in later versions.
* Determining all the reflinks to a given region of a givenextent is not a cheap operation, and the information mayimmediately be stale (because an operation right after you fetchthe info might change things). We could work around this bylocking the extent somehow, but doing so would be expensivebecause you would have to hold the lock for the entire defragoperation.
No. DO NOT LOCK TO RETRIEVE REFLINKS.
Instead, you have to create a hook in every function that updatesthe reflink structure or extents (for exaple, write-to-fileoperation). So, when a reflink gets changed, the defrag isimmediately notified about this. That way the defrag can keep itsdata about reflinks in-sync with the filesystem.
This doesn't get around the fact that it's still an expensiveoperation to enumerate all the reflinks for a given region of afile or extent.
No, you are wrong.
In order to enumerate all the reflinks in a region, the defragneeds to have another array, which is also kept in memory and insync with the filesystem. It is the easiest to divide the disk intoregions of equal size, where each region is a few MB large. Letscall this array "regions-to-extents" array. This array doesn't needto be associative, it is a plain array.This in-memory array links regions of disk to extents that are inthe region. The array in initialized when defrag starts.
This array makes the operation of finding all extents of a regionextremely fast.
That has two issues:
* That's going to be a _lot_ of memory. You still need to be ableto defragment big (dozens plus TB) arrays without needing multipleGB of RAM just for the defrag operation, otherwise it's notrealistically useful (remember, it was big arrays that had issueswith the old reflink-aware defrag too).

* You still have to populate the array in the first place. A saneimplementation wouldn't be keeping it in memory even when defrag isnot running (no way is anybody going to tolerate even dozens of MBof memory overhead for this), so you're not going to get around theneed to enumerate all the reflinks for a file at least once (duringstartup, or when starting to process that file), so you're justmoving the overhead around instead of eliminating it.

Nope, I'm not just "moving the overhead around instead of eliminatingit", I am eliminating it.

The only overhead is at defrag startup, when the entire b-treestructure has to be loaded and examined. That happens in a few seconds.

After this point, there is no more "overhead" because the runningdefrag is always notified of any changes to the b-trees (by hookc inb-tree update routines). Whenever there is such a change,region-extents array gets updated. Since this region-extents array isin-memory, the update is so fast that it can be considered a zerooverhead.

Re: Feature requests: online backup - defrag - change RAID level

Reply via email to