On 3/17/22 13:29, Dr. David Alan Gilbert wrote:
* Peter Maydell (peter.mayd...@linaro.org) wrote:
On Thu, 17 Mar 2022 at 14:03, Daniel Henrique Barboza
<danielhb...@gmail.com> wrote:
I've been looking into converting some vmstate_register() calls to use dc->vmsd,
using as a base the docs in docs/devel/migration.rst. This doc mentions that we
can either register the vmsd by using vmstate_register() or we can use dc->vmsd
for qdev-based devices.

When trying to convert this vmstate() call for the qdev alternative 
(hw/ppc/spapr_drc.c,
drc_realize()) I found this:

      vmstate_register(VMSTATE_IF(drc), spapr_drc_index(drc), 
&vmstate_spapr_drc,
                       drc);

spapr_drc_index() is an unique identifier for these DRC devices and it's being 
used
as instance_id. It is not clear to me how we can keep using this same 
instance_id when
using the dc->vmsd alternative. By looking a bit into migration files I 
understood
that if dc->vmsd is being used the instance_id is always autogenerated. Is that 
correct?

Not entirely. It is the intended common setup, but because changing
the ID value breaks migration compatibility there is a mechanism
for saying "my device is special and needs to set the instance ID
to something else" -- qdev_set_legacy_instance_id().

Yes, this is normally only an issue for 'system' or memory mapped
devices;  for things hung off a bus that has it's own device naming,
then each instance of a device has it's own device due to the bus name
so instance_id's aren't used.  Where you've got a few of the
same device with the same name, and no bus for them to be named by, then
the instance_id is used to uniquify them.



(long reply inc)

So, qdev_set_legacy_instance_id() doesn't set 'instance_id' as I've expected 
but rather
'alias_id'. The function will set dev->instance_id_alias, which is then used in 
device_set_realized()
as follows:


        if (qdev_get_vmsd(dev)) {
            if (vmstate_register_with_alias_id(VMSTATE_IF(dev),
                                               VMSTATE_INSTANCE_ID_ANY,
                                               qdev_get_vmsd(dev), dev,
                                               dev->instance_id_alias,
                                               dev->alias_required_for_version,
                                               &local_err) < 0) {
                goto post_realize_fail;
            }
        }

instance_id is set to VMSTATE_INSTANCE_ID_ANY, meaning that is  going to be 
autogenerated. The
SaveStateEntry (SE) will be generated with se->alias_id = (custom value we set) 
and
se->instance_id = autogenerated.

The migration stream transmits se->instance_id but not se->alias_id. When 
loading the migration
in the destination, find_se() will make a search using the received instance_id 
from the source
and compare it to both se->instance_id and se->alias_id from the destination.

If I try to convert an existing migratable device that is setting instance_id 
via vmstate_register()
to use qdev's dc->vmsd, if the existing code is already setting instance_id via 
vmstate_register(),
I end up breaking backward migration. This is what happened in patch
https://lists.gnu.org/archive/html/qemu-devel/2022-03/msg05617.html where I 
attempted this
conversion.

The code before the patch (B) has the following SEs for the device I changed:

===== spapr_iommu: se->instanceid = 0x80000000 se->alias_id = 0xffffffff ====
===== spapr_iommu: se->instanceid = 0x80000001 se->alias_id = 0xffffffff ====

And the code after the patch (A):

===== spapr_iommu: se->instanceid = 0x0 se->alias_id = 0x80000000 ====
===== spapr_iommu: se->instanceid = 0x1 se->alias_id = 0x80000001 ====


Migrating a pseries guest from B to A works because the new code, although 
using a different
instance_id, is matching with its alias_id. This is the output in A using the 
following trace:


    trace_qemu_loadvm_state_section_startfull(section_id, idstr, instance_id, 
version_id);

qemu_loadvm_state_section_startfull 15(vty@71000000/spapr_vty) 0 1
qemu_loadvm_state_section_startfull 16(nvram@71000001/spapr_nvram) 0 1
qemu_loadvm_state_section_startfull 560(spapr_iommu) 2147483648 2
qemu_loadvm_state_section_startfull 561(spapr_iommu) 2147483649 2
(...)

But the backward migration, A to B, doesn't work:

qemu_loadvm_state_section_startfull 560 (spapr_iommu) 0 2
qemu-system-ppc64: Unknown savevm section or instance 'spapr_iommu' 0. Make 
sure that your current
VM setup matches your saved VM setup, including any hotplugged devices
qemu-system-ppc64: load of migration failed: Invalid argument

The failure happens because the code without the patch is trying to match an 
instance_id = 0
(which A is now autogenerating) to its se->instance_id = 0x80000000 | se->alias 
= 0xffffffff.
The match fails and the error is thrown.


It seems that what I'm trying to do, convert vmstate_register() calls to qdev's 
dc->vmsd, when
the existing  code is setting custom instance_ids in vmstate_register(), is not 
feasible to be
done without breaking backward migration. At least with the current qdev APIs.
qdev_set_legacy_instance_id() helps to allow older guests to migrate to newer 
QEMUs, but not
the other way around.


Am I missing something here?


Thanks,


Daniel
















Dave

Given that this is a 13 year old comment from Anthony Liguori I wanted to 
confirm its
validity. Is there a long term goal of getting rid of instance_id? Can I ignore 
its
role when converting these calls to dc->vmsd?

Only if you're prepared to break migration compatibility, I think.

-- PMM


Reply via email to