potiuk opened a new pull request, #416:
URL: https://github.com/apache/airflow-steward/pull/416

   ## Summary
   
   A dry-run of #414's pre-flight against a real adopter tracker surfaced two 
issues that kept the skip rate at ~5% instead of the predicted 30–50%:
   
   1. **"Last comment author is a bot" was structurally unreachable** on 
single-operator private trackers. The sync skill itself writes rollup updates 
as the operator's personal GitHub user, not as a `*[bot]` account — so the gate 
never fired.
   2. **The 7-day `updatedAt` safety override caught most trackers** because 
the sync itself bumps `updatedAt` every time it writes a rollup append or flips 
a label. The override conflated skill activity with substantive activity.
   
   ## Changes
   
   - **Skill-or-bot detection.** Treat a comment as bot-equivalent when its 
body starts with the framework's marker prefix `<!-- apache-steward: ` (matches 
every status-rollup / RM hand-off / wrap-up comment). Falls back to the 
original `*[bot]` login check plus an override-file hook for adopters with 
personal-account bots.
   - **Relaxed lifecycle skip rules.** Dropped the *"idle > 14 days"* gates — 
they were the safety net for the broken bot-detection. Once skill-or-bot 
detection works, the lifecycle label set + skill-last-write is enough signal.
   - **New rule:** `cve allocated + fix released + skill-last` → `skip-noop` 
(fix released; awaiting advisory propagation). Largest contributor to the new 
skip count in dry-run.
   - **Query change:** the GraphQL query now fetches `body` on the last comment 
(was just `author { login }` + `createdAt`). Response size grows moderately 
depending on rollup length, still cheaper than one subagent transcript.
   
   ## Measurement
   
   Same set, same trackers, before/after on the rule change: skip rate ~5% → 
~30%, and the new skips were all correctly identified steady-state trackers. 
The savings are recurring (every `sync all` invocation), so the per-sweep win 
compounds.
   
   ## Test plan
   
   - [x] `lychee` on the edited file — clean
   - [x] `skill-and-tool-validate` — no new violations
   - [x] `prek` (markdownlint, typos, format) — green
   - [ ] CI lychee + tests-ok on this PR
   - [ ] Real-world bulk sync after merge to confirm classifier still tracks 
reality
   
   🤖 Generated with [Claude Code](https://claude.com/claude-code)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to