Re: [zfs-discuss] ZFS dedup accounting & reservations

Nils Goroll Tue, 03 Nov 2009 18:09:58 -0800

Well, then you could have more "logical space" than "physical space"

Reconsidering my own question again, it seems to me that the question of spacemanagement is probably more fundamental than I had initially thought, and Iassume members of the core team will have thought through much of it.

I will try to share my thoughts and I would very much appreciate any correctionsor additional explanations.

For dedup, my understanding at this point is that, first of all, every referenceto dedup'ed data must be accounted to the respective dataset.

Obviously, a decision has been made to account that space as "used", rather than"referenced". I am trying to understand, why.

At first sight, referring to the definition of "used" space as being unique tothe respective dataset, it would seem natural to account all de-duped space as"referenced". But this could lead to much space never being accounted as "used"anywhere (but for the pool). This would differs from the observed behavior fromnon-deduped datasets, where, to my understanding, all "referred" space is "used"by some other dataset. Despite being a little counter-intuitive, first I foundthis simple solution quite attractive, because it wouldn't alter the semanticsof used vs. referenced space (under the assumption that my understanding iscorrect).

My understanding from Eric's explanation is that it has been decided to go analternative route and account all de-duped space as "used" to all datasetsreferencing it because, in contrast to snapshots/clones, it is impossible (?) todifferentiate between used and referred space for de-dup. Also, at first sight,this seems to be a way to keep the current semantics for (ref)reservations.

But while without de-dup, all the usedsnap and usedds values should roughly sumup to the pool used space, they can't with this concept - which is why I thoughta solution could be to compensate for multiply accounted "used" space byartificially increasing the pool size.

Instead, from the examples given here, what seems to have been implemented withde-dup is to simply maintain space statistics for the pool on the basis ofactually used space.

While one find it counter-intuitive that the used sizes of alldatasets/snapshots will exceed the pool used size with de-dedup, if myunderstanding is correct, this design seems to be consistent.

I am very interested in the reasons why this particular approach has been chosenand why others have been dropped.

Now to the more general question: If all datasets of a pool contained the samedata and got de-duped, the sums of their "used" space still seems to be limitedby the "locical" pool size, as we've seen in examples given by Jürgen and othersand, to get a benefit of de-dup, this implementation obviously needs to be changed.

But: Isn't there an implicit expectation for a space guarantee associated with adataset? In other words, if a dataset has 1GB of data, isn't it natural toexpect to be able to overwrite that space with other data? One might want todefine space guarantees (like with (ref)reservation), but I don't see how thoseshould work with the currently implemented concept.

Do we need something like a de-dup-reservation, which is substracted from thepool free space?



Thank you for reading,

Nils
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] ZFS dedup accounting & reservations

Reply via email to