Re: [PATCH] memory: Make FlatView root references weak

Akihiko Odaki Thu, 06 Nov 2025 18:18:11 -0800

On 2025/11/07 2:50, Peter Xu wrote:

On Thu, Nov 06, 2025 at 11:23:32AM +0900, Akihiko Odaki wrote:

Generally speaking, we will not necessarily "always" get an issue report
when things went wrong with memory management. A bug in memory management
may not cause an immediate crash but corrupt the memory state which you will
find only later. The end result of memory corruption may look random and
result in a hard-to-debug issue report. A user may not even bother writing
an issue report at all; this is especially true for this kind of corner
cases that rarely happen.


There should have been no such a hazard of memory corruption if the code did
exactly what the documentation said in the first place. The consistency of
the code and the documentation is essential, especially for this kind of
complex and fundamental code.


Do you have encountered such case before?

I wasn't expecting that, because what you were saying looks more like what
Linux kernel would have a bug in mm.  QEMU is still special as it has the
default unassigned MR trapping everything by default, meanwhile normally
what is moving is MMIO / alias regions rather than real ramblocks.  It
means when things go wrong we have much higher chance to trap them
properly.

When I said "memory management" I meant the methods we use to allocateand free memory (the Linux equivalents would be kmalloc()/free()/kref),not the MM tracking or unassigned MR trapping behavior you mentioned.The unassigned MR trap and MMIO/alias movement are a separate concernand don’t change the underlying risk.

Concrete example: imagine an alias is allocated with g_new() and freedimmediately after object_unparent(). If that alias accidentally becomesthe FlatView root, destroying the FlatView later will callmemory_region_unref() and produce a use-after-free. We cannot predictwhat memory_region_unref() will read or write in that scenario — theresult can be arbitrary memory corruption that surfaces much later as ahard-to-debug, intermittent problem. Users often won’t file an issue forthese rare corner cases.


I also confess though that I'm pretty conservative on fixing things with
hypothetical issues.  In general, I prefer fixing things with explicit
problems, so we know how to measure and justify a fix (depending on how
aggressive the fix is and how much maintanence burden it will bring to
QEMU).  Without a real problem, it's harder to quantify that even if such
evaluation will also normally be subjective too.

Regarding your preference to fix only explicit problems: I understandthe conservatism, but here are the facts we need to weigh:


- The documentation claims we may free aliases because
  memory_region_ref() is never called, yet there is code that does call
  memory_region_ref().
- The patch adds code to align behavior with the documentation.

The significance of both potential impacts (the behavioral divergencefor devices other than pci-bridge, and the added complexity needed forconsistency) may be subjective and hypothetical, but that appliesequally to both sides.

In this case, the long-term reliability and maintainability of QEMUdepend on having the code behave as documented. Correctness should takeprecedence over simplicity.


Regards,
Akihiko Odaki

Re: [PATCH] memory: Make FlatView root references weak

Reply via email to