Thanks - as stated in your commit message, this adds quite a useful
piece of functionality.

> -     case LOFS_BEGIN_TREE:
> -     case LOFS_BLOB:
> -             if (filter_data->omits) {
> -                     oidset_insert(filter_data->omits, &obj->oid);
> -                     /* _MARK_SEEN but not _DO_SHOW (hard omit) */
> -                     return LOFR_MARK_SEEN;
> -             } else {
> -                     /*
> -                      * Not collecting omits so no need to to traverse tree.
> -                      */
> -                     return LOFR_SKIP_TREE | LOFR_MARK_SEEN;
> -             }
> -
>       case LOFS_END_TREE:
>               assert(obj->type == OBJ_TREE);
> +             filter_data->current_depth--;
>               return LOFR_ZERO;
>  
> +     case LOFS_BLOB:
> +             filter_trees_update_omits(obj, filter_data, include_it);
> +             return include_it ? LOFR_MARK_SEEN | LOFR_DO_SHOW : LOFR_ZERO;

Any reason for moving "case LOFS_BLOB" (and "case LOFS_BEGIN_TREE"
below) after LOFS_END_TREE?

This is drastically different from the previous case, but this makes
sense - previously, all blobs accessed through traversal were not shown,
but now they are sometimes shown. Here, filter_trees_update_omits() is
only ever used to remove a blob from the omits set, since once this blob
is encountered with include_it == true, it is marked as LOFR_MARK_SEEN
and will not be traversed again.

> +     case LOFS_BEGIN_TREE:
> +             seen_info = oidmap_get(
> +                     &filter_data->seen_at_depth, &obj->oid);
> +             if (!seen_info) {
> +                     seen_info = xcalloc(1, sizeof(struct seen_map_entry));

Use sizeof(*seen_info).

> +                     seen_info->base.oid = obj->oid;

We have been using oidcpy, but come to think of it, I'm not sure why...

> +                     seen_info->depth = filter_data->current_depth;
> +                     oidmap_put(&filter_data->seen_at_depth, seen_info);
> +                     already_seen = 0;
> +             } else
> +                     already_seen =
> +                             filter_data->current_depth >= seen_info->depth;

There has been recently some clarification that if one branch of an
if/else construct requires braces, braces should be put on all of them:
1797dc5176 ("CodingGuidelines: clarify multi-line brace style",
2017-01-17). Likewise below.

> +             if (already_seen)
> +                     filter_res = LOFR_SKIP_TREE;
> +             else {
> +                     seen_info->depth = filter_data->current_depth;
> +                     filter_trees_update_omits(obj, filter_data, include_it);
> +
> +                     if (include_it)
> +                             filter_res = LOFR_DO_SHOW;
> +                     else if (filter_data->omits)
> +                             filter_res = LOFR_ZERO;
> +                     else
> +                             filter_res = LOFR_SKIP_TREE;

Looks straightforward. If we have already seen it at a shallower or
equal depth, we can skip it (since we have already done the appropriate
processing). Otherwise, we need to ensure that its "omit" is correctly
set, and:
 - show it if include_it
 - don't do anything special if not include_it and we need the omit set
 - skip the tree if not include_it and we don't need the omit set

> +static void filter_trees_free(void *filter_data) {
> +     struct filter_trees_depth_data* d = filter_data;
> +     oidmap_free(&d->seen_at_depth, 1);
> +     free(d);
> +}

Check for NULL-ness of filter_data too, to match the usual behavior of
free functions.

> diff --git a/t/t6112-rev-list-filters-objects.sh 
> b/t/t6112-rev-list-filters-objects.sh
> index eb32505a6e..54e7096d40 100755
> --- a/t/t6112-rev-list-filters-objects.sh
> +++ b/t/t6112-rev-list-filters-objects.sh

[snip]

Thanks for the tests that cover quite a wide range of cases. Can you
also demonstrate the case where a blob would normally be omitted
(because it is too deep) but it is directly specified, so it is
included.

> +expect_has_with_different_name () {
> +     repo=$1 &&
> +     name=$2 &&
> +
> +     hash=$(git -C $repo rev-parse HEAD:$name) &&
> +     ! grep "^$hash $name$" actual &&
> +     grep "^$hash " actual &&
> +     ! grep "~$hash" actual
> +}

Should we also check that a "~" entry appears with $name?

Reply via email to