Re: Call for comments on new dirstate format contents

Simon Sapin Tue, 29 Jun 2021 13:28:46 -0700

On 29/06/2021 20:48, Kyle Lippincott wrote:

Can you elaborate a bit on what this append-only tree looks like (and why that'spreferred)

It’s a tree in that there are nodes for files and directories. We can quickly findroot nodes, and from a given node we can quickly find its direct child nodes, allwithout parsing the entire file.

It’s "append-mostly" because changes are made by adding new nodes at the end of thefile and reusing nodes for unchanged sub-trees. Nodes that have been replaced becomeunreachable but still take up space. Occasionally based on some heuristic, the wholefile is rewritten without unreachable nodes. This makes most writes cheaper thanre-serializing and writing the entire file.

and why stem compression would cause performance issues?

Each node contains its full path from the repository root. This allows status code topass around a slice (pointer + length) to the middle of the mmap’ed file. If a nodeonly had its basename we’d have to allocate a string to reconstitute a path byconcatenating the names of ancestor directories. The cost of many memory allocationscan add up.

When loading this new dirstate, would it require loading the entire thing from 
the
 beginning and replacing entries with the newer ones?

No, that’s the point of making it a tree of fixed-size nodes that contain data atfixed-size offsets, with pseudo-pointers for variable-size data (paths and child nodes).

You say the Python implementation will offer no purposeful performance improvements,but how likely is it that it will be slower than the current format?

The current implementations (Python and C) of dirstate-v1 work by parsing the entiredirstate into large Python dicts. The Python implementation of dirstate-v2 would dothe same, only parsing a different format.

What level of performance degradation would be considered acceptable?


Good question. We don’t have a hard criteria.

However this fallback implementation of dirstate-v2 will only be used when foraccessing an existing local repository that uses that format. When creating a newclone, dirstate-v2 is only used if a fast implementation is available.

What happens if the docket and data file get out of sync somehow (maybe hg crashes inthe middle of writing, or Google has a network write race)?

A docket that refers to a new data file is only swap-renamed after the data file wasfinished writing.

I don’t know what ordering guarantees between writes exist or not on Google’s networkfilesystem.

              - A count of dead (unreachable) bytes
              - A count of alive (reachable) bytes

What are these two?

Only one of them is needed, the other can be deduced by subtracting from the size ofthe file. Unreachable means obsolete parts of the file that have been replaced byother nodes, see "append-mostly" above. The heuristic for rewriting the whole file toget rid of unreachable data is based on this counter.

Is there a good way of determining what the timestamp resolution of a 
filesystem is?


As far as I know there is not.

What we can do is create a temporary file and take its mtime as the current time withthe same (unknown) truncation as other file’s mtimes. If we observe a "current mtime"strictly later than a given file’s mtime, we know that further changes to that fileare extremely likely[1] to cause a different mtime since the clock has already tickedsince the last change.

([1] The system clock is not monotonous, so it could jump back and still have thesame clock-reported date happen again. If we get unlucky another change to the filecould happen exactly then, modulo truncation.)


See comments starting at
https://www.mercurial-scm.org/repo/hg-committed/file/5fa083a5ff04/rust/hg-core/src/dirstate_tree/status.rs#l401

(I don't know how various OSes treatthese timestamps when the underlying filesystem doesn't support higher precision; isit 100% guaranteed that they just extend it with zeroes?)

Regardless, there’s also the case where the filesystem can store enough bits but thekernel only updates an internal clock at some arbitrary ticks:


https://stackoverflow.com/a/14393315/1162888

              - All of the info needed to get the previous state of a Removed
    file in case we `hg add` it back
Can you explain the use case for this (and/or what would be in it)? I would thinkthat `hg rm foo && echo hi > foo && hg add foo` should be equivalent to `echo hi >foo`, but I might be missing something?

I still don’t fully understand this, but it also exists in dirstate-v1. I think it’srelevant when in the middle of merging.


https://www.mercurial-scm.org/wiki/DirState#Summary

My biggest concern is extensibility. As an example, as you were writing this up, youthought of something else to add, so we probably don't want to restrict ourselves toomuch :) The file format is already going to not be anything resembling fixed recordsize, having a section for generic key/value data that extensions can use might bequite useful (and maybe future core code, though I'm assuming the format can be suchthat this would be able to be made to work without the size/parsing complexity ofkey/value).

The proposed dirstate-v2 is based on fixed-size records. This is what enablesaccessing parts of it without parsing.

A problem with file format extensibility is dealing with clients that don’t knowabout a given extension. Since we can never rely on "new" fields to be present orbeing kept up to date, they’re of rather limited use.

My opinion is that we can anticipate some things now-ish, and for further changes oneday we can make a dirstate-v3 format.


--
Simon Sapin
_______________________________________________
Mercurial-devel mailing list
Mercurial-devel@mercurial-scm.org
https://www.mercurial-scm.org/mailman/listinfo/mercurial-devel

Re: Call for comments on new dirstate format contents

Reply via email to