Re: [darcs-users] patch metadata, annotations, Ignore-this, tagging, etc

Max Battcher Mon, 22 Mar 2010 11:05:58 -0700

Eric Kow wrote:

Long term [Darcs 3]
-------------------
A new patch format in general may be interesting for the long term.
http://bugs.darcs.net/patch1096 appears to be a step in that direction.

Long term I'd like a pony, but more importantly for darcs patches to bein some easy to parse markup format like JSON, perhaps.

Medium term [Polished Darcs 2]
------------------------------
I claim that this coming up with this new patch format is unrealistic
for the medium term (defined as post-performance-obsession and
pre-Darcs-3).  If we were to use anything better, it'd have to be
backwards-compatible (ie. using the patch long comment?)

Therefore, it would be interesting to determine if

1. If a new backwards compatible format will be useful in the medium
   term [which could last for many years mind you, if you also add in
   the short-term], or if we can get away with using Ignore-this for
   that time

2. If the new format could just start with "Ignore-this:"

3. What the new format would actually look like

We don't have to open this discussion now, but it's now being tracked as
a potential project in <http://bugs.darcs.net/issue1787>.  My request is
for whoever launches the third salvo in this discussion please research
the past threads (eg. when we introduced the Ignore-this salt for
issue27?) and link them here

There's also some very interesting future work on patch annotations
<http://bugs.darcs.net/issue1613> for optional metadata.  It may even
be medium-term if we're lucky.

When Ignore-this was first implemented the medium term solution of usinga full RFC822 email-like header was broached. Of course, RFC822 is fullof loopholes and surprisingly hard to parse in reality, but the obviouspoint that Ignore-this: xxx does indeed look like an email header stillstands. (I'd like to remain on the record that I'd still prefer a bettername like "Patch conflict avoidance hash" than Ignore-this, by the way.)

I've been thinking on this some, and I think I have a reasonablesuggestion that is easier to parse than RFC822, but carries a similareffect: YAML formatted darcs comments.

YAML (yaml.org) is a JSON superset that was designed to be morehuman-readable/human-editable than JSON. Since long comments are stillmeant to be examined (and perhaps amended) by us humans, I'm all forkeeping markup to a reasonable minimum. However, YAML is still easy toparse, with libraries in many languages.


Here's Ignore-this wrapped in an explicit YAML document:

  %YAML 1.2 # YAML version directive, can be used as indicator
  --- # document start
  Ignore-this: xxx # same as currently, but now in a YAML mapping
  ... # document end

We could argue the usefulness of the explicit YAML directive anddocument start (---), but explicit document end (...) makes a clearseparation between any darcs-interesting metadata and a user's actualcontent: both to simple regex searching, and to YAML parsers (which havethe concept of "parse the first document" and "parse past the firstdocument"). (Certainly an explicit marker is better than RFC822'ssometimes difficultly implicit marker.)

Of course, the above example doesn't seem too great with justIgnore-this, so here's a better example:


  %YAML 1.2
  ---
  Ignore-this: yyy
  Encoding: UTF-8
  Patch version: 2.0+YAML
  X-Musdex version: 10.03.22
  ...

So, backwards compatibility issues: much the same as with Ignore-this.Patches with long comments with YAML headers get the headers output inversion of darcs prior to the switchover point. This may not be a bigproblem, for instance, the above example in darcs 2.4 changes outputseems reasonable:


  %YAML 1.2
  ---
  Encoding: UTF-8
  Patch version: 2.0+YAML
  X-Musdex version: 10.03.22
  ...

The big gain is the forwards compatibility for arbitrary headers withoutspecial casing each and every one or prefixing them all with the silly"Ignore-this:" tag. It also would be presumably be forwards compatiblewith some nice long term future version of darcs where arbitrarymetadata headers can be moved out of the long comment to someone morepreferable.

Additional gain is that ignorable header lines now have two stronglyconsistent ways of being handled by scripts: 1) parse the first YAMLdocument in the long comment to get the headers, 2) ignore everything tothe first line that begins with an ellipsis (...) to get to the usercomment. In both cases a first line beginning with %YAML can be used todenote that there is any header at all.


So that's my current suggestion. Feel free to tear it apart.

--
--Max Battcher--
http://worldmaker.net
_______________________________________________
darcs-users mailing list
[email protected]
http://lists.osuosl.org/mailman/listinfo/darcs-users

Re: [darcs-users] patch metadata, annotations, Ignore-this, tagging, etc

Reply via email to