Re: [PATCH v8 27/44] walker.c: use ref transaction for ref updates

Jonathan Nieder Tue, 20 May 2014 17:47:28 -0700

Ronnie Sahlberg wrote:

> This changes the locking slightly for walker_fetch. Previously the code would
> lock all refs before writing them but now we do not lock the refs until the
> commit stage. There is thus a very short window where changes could be done
> locally during the fetch which would be overwritten when the fetch completes
> and commits its transaction. But this window should be reasonably short.
> Even if this race does trigger, since both the old code and the new code
> just overwrites the refs to the new values without checking or comparing
> them with the previous value, this is not too dissimilar to a similar scenario
> where you first do a ref change locally and then later do a fetch that
> overwrites the local change. With this in mind I do not see the change in
> locking semantics to be critical.


Sounds scary.  The usual approach is

        old_sha1 = ...
        ... various checks ...

        transaction = transaction_begin(&err)
        transaction_update(transaction, refname, new_sha1, old_sha1, ...);
        transaction_commit(transaction, &err);

which is not racy because _update checks against old_sha1.

If I understand correctly, you are saying 'have_old' is false here so
we don't have the usual protection.  If the "... various checks ..."
section shown above is empty, that should be fine and there is no
actual change in semantics.  If the "... various checks ..." section
shown above is nonempty then it could be a problem.

[...]
> --- a/walker.c
> +++ b/walker.c
> @@ -251,24 +251,18 @@ void walker_targets_free(int targets, char **target, 
> const char **write_ref)
>  int walker_fetch(struct walker *walker, int targets, char **target,
>                const char **write_ref, const char *write_ref_log_details)
>  {
> -     struct ref_lock **lock = xcalloc(targets, sizeof(struct ref_lock *));
> +     char ref_name[PATH_MAX];

We tend to prefer strbuf instead of fixed-size buffers in new code.

[...]
> -     char *msg;
> +     char *msg = NULL;

Needed?  The existing code seems to set msg = NULL in the
!write_ref_log_details case already.

[...]
> @@ -294,19 +288,26 @@ int walker_fetch(struct walker *walker, int targets, 
> char **target,
>       for (i = 0; i < targets; i++) {
>               if (!write_ref || !write_ref[i])
>                       continue;
> -             ret = write_ref_sha1(lock[i], &sha1[20 * i], msg ? msg : "fetch 
> (unknown)");

Okay, so before this patch we do:

        for each target in write_ref:
                lock it (with no particular expectation about where it
                points)

Then

        unless http-fetch was passed --recover:
                mark the objects pointed to by current refs as COMPLETE

Then we do HTTP GETs to grab the objects we need from a "dumb" HTTP
server.  The COMPLETE objects tell us about objects we don't have to
bother trying to get.

When we're done, we come up with a reflog entry and write out refs
pointing to the requested commits.

This code has two callers:

        git-remote-http (aka remote-curl.c::fetch_dumb)
        git-http-fetch (aka http-fetch.c)

The codepath in git-remote-http gets wide use, though it's diminishing
as more people switch to "smart" http.  It doesn't 't use the "write
out some refs" feature.  It just wants the objects and then takes care
of writing refs on its own.

Perhaps it's worth avoiding beginning a transaction in the first place
in the !write_ref case.

The git-http-fetch command is a piece of plumbing that used to be used
by 'git clone' and 'git fetch' in the olden days when they were shell
scripts.  I doubt anyone uses it.  As you noticed, it doesn't have any
way to specify anything about the expected old values of the refs it
writes to.  So this change doesn't introduce any race there.

> +             sprintf(ref_name, "refs/%s", write_ref[i]);
> +             if (ref_transaction_update(transaction, ref_name,
> +                                        &sha1[20 * i], NULL,
> +                                        0, 0))
> +                     goto rollback_and_fail;
> +     }

Looks good.

> +
> +     if (ref_transaction_commit(transaction, msg ? msg : "fetch (unknown)",
> +                                &err)) {
> +             error("%s", err.buf);
> +             goto rollback_and_fail;
>       }

Also looks good.

Thanks,
Jonathan
--
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH v8 27/44] walker.c: use ref transaction for ref updates

Reply via email to