Peter Xu <[email protected]> writes:
> On Thu, Nov 07, 2024 at 12:12:10PM +0100, Markus Armbruster wrote:
>> Peter Xu <[email protected]> writes:
>>
>> > On Fri, Oct 25, 2024 at 05:55:59PM -0400, Peter Xu wrote:
>> >> On Fri, Oct 25, 2024 at 11:25:23AM +0200, Markus Armbruster wrote:
>> >> > Peter Xu <[email protected]> writes:
>> >> >
>> >> > > X86 IOMMUs cannot be created more than one on a system yet. Make it a
>> >> > > singleton so it guards the system from accidentally create yet another
>> >> > > IOMMU object when one already presents.
>> >> > >
>> >> > > Now if someone tries to create more than one, e.g., via:
>> >> > >
>> >> > > ./qemu -M q35 -device intel-iommu -device intel-iommu
>> >> > >
>> >> > > The error will change from:
>> >> > >
>> >> > > qemu-system-x86_64: -device intel-iommu: QEMU does not support
>> >> > > multiple vIOMMUs for x86 yet.
>> >> > >
>> >> > > To:
>> >> > >
>> >> > > qemu-system-x86_64: -device intel-iommu: Class 'intel-iommu' only
>> >> > > supports one instance
>> >> > >
>> >> > > Unfortunately, yet we can't remove the singleton check in the machine
>> >> > > hook (pc_machine_device_pre_plug_cb), because there can also be
>> >> > > virtio-iommu involved, which doesn't share a common parent class yet.
>> >> > >
>> >> > > But with this, it should be closer to reach that goal to check
>> >> > > singleton by
>> >> > > QOM one day.
>> >> > >
>> >> > > Signed-off-by: Peter Xu <[email protected]>
>> >> >
>> >> > $ qemu-system-x86_64 -device amd-iommu,help
>> >> > /work/armbru/qemu/include/hw/boards.h:24:MACHINE: Object 0x56473906f960
>> >> > is not an instance of type machine
>> >> > Aborted (core dumped)
>>
>> [...]
>>
>> >> Thanks for the report!
>> >>
>> >> It turns out that qdev_get_machine() cannot be invoked too early, and the
>> >> singleton code can make it earlier..
>> >>
>> >> We may want a pre-requisite patch to allow qdev_get_machine() to be
>> >> invoked
>> >> anytime, like:
>> >>
>> >> ===8<===
>> >> diff --git a/hw/core/qdev.c b/hw/core/qdev.c
>> >> index db36f54d91..7ceae47139 100644
>> >> --- a/hw/core/qdev.c
>> >> +++ b/hw/core/qdev.c
>> >> @@ -831,6 +831,16 @@ Object *qdev_get_machine(void)
>> >> {
>> >> static Object *dev;
>> >>
>> >> + if (!phase_check(PHASE_MACHINE_CREATED)) {
>> >> + /*
>> >> + * When the machine is not created, below can wrongly create
>> >> + * /machine to be a container.. this enables qdev_get_machine()
>> >> to
>> >> + * be used at any time and return NULL properly when machine is
>> >> not
>> >> + * created.
>> >> + */
>> >> + return NULL;
>> >> + }
>> >> +
>> >> if (dev == NULL) {
>> >> dev = container_get(object_get_root(), "/machine");
>> >> }
>> >> ===8<===
>> >>
>> >> I hope it makes sense on its own.
>> >
>> > My apologies, spoke too soon here. This helper is used too after machine
>> > is created, but right before switching to PHASE_MACHINE_CREATE stage..
>>
>> container_get() is a trap.
>
> I had the same feeling.. Though I'd confess I'm not familiar enough with
> this part of code.
>
>>
>> When the object to be gotten is always "container", it merely
>> complicates container creation: it's implicitly created on first get.
>> Which of the calls creates may be less than obvious.
>>
>> When the object to be gotten is something else, such as a machine,
>> container_get() before creation is *wrong*, and will lead to trouble
>> later.
>>
>> In my opinion:
>>
>> * Hiding creation in getters is a bad idea unless creation has no
>> material side effects.
>>
>> * Getting anything but a container with container_get() is in bad taste.
>
> Agreed.
>
> IMHO container_get() interface might still be ok to implicitly create
> containers,
Creation on demand is fine when we want to create the thing only when
there is demand.
I guess it can also be okay when we want to create it always, but don't
want to decide when exactly (must be before first use), although I
suspect that's just lazy more often than not.
> but only if it will: (1) always make sure what it walks is a
> container along the way, and (2) never return any non-container.
Yes. Anything else invites abuse.
>> > So we need another way, like:
>> >
>> > ===8<===
>> >
>> > diff --git a/hw/core/qdev.c b/hw/core/qdev.c
>> > index db36f54d91..36a9fdb428 100644
>> > --- a/hw/core/qdev.c
>> > +++ b/hw/core/qdev.c
>> > @@ -832,7 +832,13 @@ Object *qdev_get_machine(void)
>> > static Object *dev;
>> >
>> > if (dev == NULL) {
>> > - dev = container_get(object_get_root(), "/machine");
>> > + /*
>> > + * NOTE: dev can keep being NULL if machine is not yet created!
>> > + * In which case the function will properly return NULL.
>> > + *
>> > + * Whenever machine object is created and found once, we cache it.
>> > + */
>> > + dev = object_resolve_path_component(object_get_root(), "machine");
>> > }
>> >
>> > return dev;
>>
>> Now returns null instead of a bogus container when called before machine
>> creation. Improvement of sorts. But none of the callers expect null...
>> shouldn't we assert(dev) here?
>>
>> Hmm, below you add a caller that checks for null.
>>
>> Another nice mess.
>
> I plan to put aside the application of singletons to x86-iommu as of now,
> due to the fact that qdev complexity may better be done separately.
>
> IOW, before that, I wonder whether we should clean up the container_get()
> as you discussed: it doesn't sound like a good interface to return
> non-container objects.
>
> I had a quick look, I only see two outliers of such, and besides the
> "abuse" in qdev_get_machine(), the only other one is
> e500_pcihost_bridge_realize():
>
> *** hw/core/qdev.c:
> qdev_get_machine[820] dev = container_get(object_get_root(),
> "/machine");
>
> *** hw/pci-host/ppce500.c:
> e500_pcihost_bridge_realize[422] PPCE500CCSRState *ccsr =
> CCSR(container_get(qdev_get_machine(),
"/e500-ccsr"));
Yes, this abuses container_get() to get an "e500-ccsr", which is a
device, not a container.
By the way, intentation is confusing here.
> If any of us thinks this is the right way to go, I can try to clean it up
> (for 10.0). qdev_get_machine() may still need to be able to return NULL
> when singleton applies to IOMMUs, but that can be for later. Before that,
> we can still assert(qdev), I think.
I think it's worthwhile.
> Just to mention I've posted rfcv2 for this series, again feel free to
> ignore patch 3-5 as of now:
>
> [PATCH RFC v2 0/7] QOM: Singleton interface
> https://lore.kernel.org/r/[email protected]
>
> I think the plan is Dan may keep collecting feedbacks on his other rfc:
>
> [RFC 0/5] RFC: require error handling for dynamically created objects
> https://lore.kernel.org/r/[email protected]
>
> Then after Dan's lands, I'll rebase my rfcv2 on top of his, dropping
> iommu/qdev changes.
>
> Thanks,
Makes sense. Thanks!