Re: thoughts on ETags and mod_dav

Michael Clark Fri, 28 Dec 2007 21:19:59 -0800

Henrik Nordström wrote:

Modifications made using direct filesystem access, in the same second as
the last WebDAV update and only rewriting the file in-place without
changing the length.

That would be a reasonable limitation - hence the suggesion for aDavETagIsolation dav-only (the default being a more conservative dav+fs)

A correct implementation would have to regenerate the strong ETag forall of these sub second occurring requests (which is counter intuitivefor a cached property and also very inefficient).
No, but if direct filesystem access is allowed it would need to be able
to detect that there has been a such access in the same second and
invalidate the strong ETag.

AFAICT, we are in agreement here. My point was related to the currentinability to detect the direct filesystem access i.e. with theDavETagIsolation dav+fs you would have to invalidate the ETag unless youhad some sort of mechanism to detect sub second direct filesystemaccesses (or mandatory locking as you propose) - i.e. invalidate strongETag == regenerate strong ETag.

Assuming direct filesystem access is not allowed then a stored ETag withmtime+inode+length+modification counter would be guaranteed to be enough(yes, it would need to be stored due to the additional counter) asmod_dav would be the only one changing anything.

Alternatively, a PUT without Content-Range (the common case) could bechanged to mktemp / open / write / close / rename - this may remove theneed for storing anything as a normal PUT would always change the inode(it would catch most of the common cases - excluding direct filesystemmodification within the same second or mmap changes which don't updatemtime and PUT with Content-Range which I think is a pretty special case?).

The other proposals I've been reading seem to require saying that thisETag is strong when we know that this it not guaranteed to be true - butperhaps this should be up to the admin (DavETagIsolation dav-mods-only).
filesystem metadata based on just size + inode + mtime is never a
guaranteed change identifier. Nothing stops an in-place edit where the
timestamp is kept.

Yes but in the first paragraph you are excluding these sub second filesystem access anyway :) . Excluding sub second direct file system accessand with the addition of a modification counter it would be enough? (orby making PUT without Content-Range change the inode).

The isolation levels should perhaps be discussed independently as thisis leading to some of the confusion. Here is my take on it:


dav+fs (assumptions of current code)

* needs to generate a weak ETag for subsecond mtime unless it coulddetect sub second modifications or use a mandatory lock for 1 second.* it can't detect sub second modifications and doesn't use a mandatorylock for 1 second.* storing ETag will not improve your ability to generate a strong ETag(unless you could detect the direct filesystem modifications or use amandatory lock).* if you could detect the direct filesystem modifications or use amandatory lock then mtime+inode+length (+ modification counter) would beenough (although this would exclude mmap modifications on unix)* sub-second direct modifications can't be detected easily (requiresnon portable interfaces like inotify) and mmap changes can probablynever be detected.


dav-only

* could generate strong ETags always if sub-second mod_davmodifications were tracked somehow.* mtime+inode+length (+ modification counter) would be enough (whatmore is needed? no changes are happening outside mod_dav)

* probably needs to store modification/generation counter, although...

* if PUT without Content-Range was made to change the inode then mostof the common cases could probably be covered without storing anything(although this would not cover the case where the ETag would need tochange on sub second PROPPATCH?? and PUT with Content-Range - how commonis this?).


Perhaps there is a third isolation level?:

dav+loose-fs

* only differs from dav-only in that it makes a best effort at handlingFS changes.* could incorrectly return a strong ETag in cases where a direct filesystem modification was made in the same second as a mod_dav modification

* mtime+inode+length (+ modification counter) would be enough?
* Is this any different from dav-only? Is this just an alias for dav-only?

Re: thoughts on ETags and mod_dav

Reply via email to