On 10/2/25 4:21 PM, Rich Bowen wrote:
Hi, folks,For the last couple of months I’ve been producing these - https://boxofclue.com/apache-highlights/
interesting project
The process is that I have a checkout of every repo under https://github.com/apache (just metadata, not actual files. 2.9Gb total) and I grind through them to generate some metrics like:
this doesn't seem to work for svn mirrored repositories (httpd and spamassassin at least), is this a known issue ? Thanks Giovanni
First time commit (ie, had a commit in a merged PR the first time) 10th/100th/1000th/etc commit There are some false positives, which I think come from, for example, X makes a first-time commit to iceberg-fortran but they’ve contributed to iceberg-rust before. But for the most part, it gives a really great weekly snapshot of who the new people in your project are. I’ve gotten a couple positive comments from a handful of projects that are using this data to welcome new contributors, which was the intent of the thing. (I post it to Mastodon every week.) I’d like to run this on our VM, rather than running it on my laptop every Monday morning. I’d also like to link to the reports from a couple of places on our website (and don’t really want to link to boxofclue.com <http://boxofclue.com/> from there!) But I wanted to run it by you folks first, before taking the liberty to do that. Does anybody have any objections to me doing this? — Rich Bowen [email protected]
OpenPGP_signature.asc
Description: OpenPGP digital signature
