[dev-platform] Searchfox Update: Changes to annotate / "blame" behavior around revision skipping

Andrew Sutherland Wed, 16 Mar 2022 12:58:45 -0700

tl;dr: The Searchfox blame UI will no longer try to skip over revisionsin `.git-blame-ignore-revs`. The downside is that this may result inyou needing to click through reformatting patches and do additionalmanual ctrl-f-ing to find where the line you were interested in wentbefore repeating your manual blame algorithm steps. The upside is thatthere is no longer any risk of the heuristics getting tricked andreferencing an inaccurate revision and thereby tricking you. Thischange is equivalent to always showing you what was in the "ignoredchangesets" collapsed details section of the blame popup, but faster! Want to see a cool thing to tempt you to read more? Look athttps://clicky.visophyte.org/files/microannotate/nsWebBrowserPersist.cpp.htmland then read more!


     Details

Revision history and the "annotate" / "blame" UIs for revision controlare tricky because they're built on a sequential, line-centricdata-model where moving a function above another function in a fileresults in a destructive representational decision to treat one functionas continuing through history and the other function as removed and thenre-added as new code. Reformatting that maintains the overall sequenceof tokens but changes how they are distributed across multiple linesalso looks like removal of all of the old code and the addition of newcode. Tools frequently perform heuristic-based post-passes to helpidentify intra-line changes which are reflected in diff UIs, as well as(entire) lines of code that are copied/moved in a revision (ex:Phabricator does this).

In Dec, 2018 searchfox gained "hyperblame" functionality[1] thatattempted to skip over the revisions listed inhttps://searchfox.org/mozilla-central/source/.git-blame-ignore-revs thatlists a number of code re-formattings / refactorings that don'tintentionally have semantic content. As noted above, this is a trickyproblem space and, especially when built on a line-centric model,additional heuristics and/or an alternate approach to the data-model arenecessary. https://bugzilla.mozilla.org/show_bug.cgi?id=1517978 hastracked potential enhancements to this problem space, but the reality isthat searchfox is not directly staffed[2] and although significantprogress has been made on improving the blame infrastruture and UX[3],no one has had the time to attempt to further iterate on thissignificant undertaking.

Unfortunately, we have seen that the functionality as exposed by defaulthas resulted in a number of bugs being filed that are evidence of peoplebeing (understandably!) misled by the current behavior; these aretracked as blocked by the aforementionedhttps://bugzilla.mozilla.org/show_bug.cgi?id=1517978 enhancement bug. It's always been possible to bypass the result of the heuristic byexpanding the "N ignored changeset(s)" `<details>` in the blame strippopup, but this has been far from obvious or intuitive. Also, having anoption that might be magically correct but also might be wrong and tofigure it out you have to perform the extra steps you would haveperformed if the feature didn't exist may be counter-productive. Theheuristics also were currently applied at runtime and so had an ongoingdynamic cost that could impact the latency of the dynamic serving offiles. (All HEAD revisions are statically pre-generated, but any otherrevision results in dynamic HTML generation for the heuristics, althoughsearchfox otherwise precomputes blame[5]) There are no known activeplans to further enhance blame-related functionality at this time, butthere are possibilities[5] which are cool[5].

As such, blame-skipping functionality has been removed inhttps://github.com/mozsearch/mozsearch/pull/491 and this change hasalready taken effect for mozilla-central and will roll out to othertrees over the next day. If you are interested in helping movesearchfox's blame behavior forward or are an avid user of searchfox'sblame behavior and would like to see enhancements, please talk to yourmanager[2] about allocating some time to contribute to searchfox or tohelp surface the demand.


Andrew (:asuth)

1: https://bugzilla.mozilla.org/show_bug.cgi?id=1507923 viahttps://github.com/mozsearch/mozsearch/pull/170

2: Quoting my other searchfox message from earlier this week: "Searchfoxis a contributor driven project. If there are features you would liketo see or rough edges that slow down your work, please talk to yourmanager and discuss the possibility of finding time to potentiallycontribute to the project yourself if you are interested, and/or helpyour manager surface the potential efficiency improvements for you/theorganization so that these can be quantified and potentially folded intoOKRs for teams with the relevant expertise and interest." We've had ashort discussion on this in #searchfox today on chat.mozilla.org, butagain, I'd emphasize the importance of making sure your manager knowswhat things that could make your development experience more pleasantand more productive as the most important discussion to have!

3: Noting that (nearly all if not) all blame improvements were made by:kats[4], we have had:https://bugzilla.mozilla.org/show_bug.cgi?id=1627532,https://bugzilla.mozilla.org/show_bug.cgi?id=1634770,https://bugzilla.mozilla.org/show_bug.cgi?id=1653245,https://bugzilla.mozilla.org/show_bug.cgi?id=1674601,https://bugzilla.mozilla.org/show_bug.cgi?id=1654946,https://bugzilla.mozilla.org/show_bug.cgi?id=1683563,https://bugzilla.mozilla.org/show_bug.cgi?id=1716914, and more!

4: Thank you, :kats! (And While :kats is no longer at MoCo, he hascontinued to make significant enhancements to searchfox and blame![4])

5: https://github.com/mozsearch/mozsearch/blob/master/docs/blame.md arethe high level searchfox docs on the blame pre-computation/cachingmechanism. Note that this mechanism is extensible and could be extendedto support applying additional levels of heuristic inference on linemovement, as proposed athttps://bugzilla.mozilla.org/show_bug.cgi?id=1517978#c1 (although theproposal does call for storing the more expensive heuristic computationsin a more long-lived cache that would be reused even when otherwiseregenerating the m-c blame cache, such as when we tuned copy/movedetection heuristic values). Note, however, that the most interestingoption at this time might be :marco'shttps://github.com/mozilla/microannotate/ approach which uses atransform to make the diff algorithm token-centric instead ofline-centric. An example run of this (on old data from the transformedhttps://github.com/marco-c/gecko-dev-wordified/tree/master) ishttps://clicky.visophyte.org/files/microannotate/nsWebBrowserPersist.cpp.htmlwhere you can see it used on Gecko code. A slightly more fleshed outversion of this mechanism (hover popups!) ishttps://github.com/cregit/cregit being applied to the linux kernel atthe example ofhttps://cregit.linuxsources.org/code/4.19/net/ethernet/eth.c.html.


--
You received this message because you are subscribed to the Google Groups 
"[email protected]" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/a/mozilla.org/d/msgid/dev-platform/00c4b4e3-028c-1c36-462c-8c7ac575af4f%40asutherland.org.

[dev-platform] Searchfox Update: Changes to annotate / "blame" behavior around revision skipping

Reply via email to