On Fri, Sep 05, 2025 at 03:33:14PM -0400, Konstantin Ryabitsev wrote:
On Fri, Sep 05, 2025 at 11:06:01AM -0700, Linus Torvalds wrote:
Anyway, the "discourage mindless use" might be as simple as a big
warning message that the link may be just adding annoying overhead.
In contrast, a "perfect" model might be to actually have some kind of
automation of "unless there was actual discussion about it".
But I feel such a model might be much too complicated, unless somebody
*wants* to explore using AI because their job description says "Look
for actual useful AI uses". In today's tech world, I assume such job
descriptions do exist. Sigh.
So, I did work on this for a while before running out of credits, and there
were the following stumbling blocks:
- consuming large threads is expensive; a thread of 20 patches and a bunch of
follow-up discussions costs $1 of API credits just to process. I realize
it's peanuts for a lot of full-time maintainers who have corporate API
contracts, but it's an important consideration
- the LLMs did get confused about who said what when consuming long threads,
at least with the models at the time. Maybe more modern models are better at
this than those I tried a year ago. Misattributing things can be *really*
bad in the context of decision making, so I found this the most troubling
aspect of "have AI analyze this series and tell me if everyone important is
okay with it."
Quick note on this: I observed the same thing, but found that using structured
format (i.e. lei q --format json) really helps with this issue.
- the models I used were proprietary (ChatGPT, Claude, Gemini), because I
didn't have access to a good enough system to run ollama with a large enough
context window to analyze long email threads. Even ollama is questionably
"open source" -- but don't need to get into that aspect of it in this
thread.
However, I feel that LLMs can be generally useful here, when handled with
care and with a good understanding that they do and will get things wrong.
I'm facing a similar challange both for the AUTOSEL and the CVE work: there is
very little historical context in most commits, and the Link: tag is almost
always useless and just points to the final submission of a patch rather than a
relevant discussion around that code.
I ended up creating an AI agent that knows how to dig through both a local git
repo as well as our mailing list using lei-q and knows to search related
dashboards like kernelci, lkft, the syzbot dashboard, etc.
I'm not sure if at it's current form it's useful to anyone else, but here's an
example of what it generates on the patch in question:
Mailing List History for commit 0f51a5c0a89921deca72e42583683e44ff742d06
========================================================================
Author: Caleb Sander Mateos <[email protected]>
Date: Thu Sep 4 19:25:34 2025 -0600
Subject: io_uring/rsrc: initialize io_rsrc_data nodes array
## Timeline of Events
1. **October 25, 2024**: Jens Axboe commits major refactoring
- Commit: 7029acd8a950 ("io_uring/rsrc: get rid of per-ring io_rsrc_node
list")
- Major rewrite eliminating per-ring serialization of resource nodes
- Addressed resource reclaim stalls in networked workloads
2. **April 4, 2025**: Pavel Begunkov fixes related issue
- Commit: ab6005f3912f ("io_uring: don't post tag CQEs on file/buffer
registration failure")
- Also fixes issues introduced by commit 7029acd8a950
- Reference:
https://lore.kernel.org/r/c514446a8dcb0197cddd5d4ba8f6511da081cf1f.1743777957.git.asml.sile...@gmail.com
3. **September 4, 2025, 19:25 MDT**: Caleb submits initialization fix
- Message-ID: [email protected]
- Sent to: [email protected], [email protected]
- Direct submission to Jens Axboe
4. **September 4, 2025, 19:50 MDT**: Jens applies the patch
- Applied within 25 minutes of submission
- No public review or discussion found
5. **September 5, 2025, 05:14 MDT**: Jens sends acknowledgment
- Message-ID: [email protected]
- Simple "Applied, thanks!" with b4 tool
- Assigned commit: 0f51a5c0a89921deca72e42583683e44ff742d06
## Key Findings from Mailing List Search
### Minimal Public Discussion
- **No pre-submission review**: No RFC or v1 versions found
- **No public bug reports**: No KASAN, syzkaller, or user bug reports found
that directly led to this fix
- **No post-submission discussion**: Only Jens' acknowledgment found
- **No testing tags**: No Tested-by or Reviewed-by tags
### Related Activity
- Pavel Begunkov's earlier fix (April 2025) shows the original refactoring had
multiple issues
- Both fixes target error paths in the resource registration code
- Pattern suggests issues were found through code review rather than runtime
failures
--
Thanks,
Sasha