Re: [RFC PATCH v1 00/13] named CPU models for ARM64 on KVM

Eric Auger Wed, 20 May 2026 08:28:44 -0700

Hi Khushit,

On 5/20/26 3:08 PM, Khushit Shah wrote:
>> On 19 May 2026, at 9:21 AM, Shaju Abraham <[email protected]> wrote:
>>
>>
>>
>> Get Outlook for Mac 
>> From: Eric Auger <[email protected]>
>> Date: Monday, 18 May 2026 at 3:57 PM
>> To: Shaju Abraham <[email protected]>, [email protected] 
>> <[email protected]>, [email protected] <[email protected]>, 
>> [email protected] <[email protected]>, [email protected] 
>> <[email protected]>, [email protected] 
>> <[email protected]>, [email protected] <[email protected]>, 
>> [email protected] <[email protected]>, [email protected] 
>> <[email protected]>, [email protected] <[email protected]>, Andrea 
>> Bolognani <[email protected]>
>> Subject: Re: [RFC PATCH v1 00/13] named CPU models for ARM64 on KVM
>>
>> !-------------------------------------------------------------------|
>>  CAUTION: External Email
>>
>> |-------------------------------------------------------------------!
>>
>> Hi Shaju,˝
>>
>> On 5/13/26 6:33 PM, Shaju Abraham wrote:
>>> Hi All,
>>>
>>> This RFC introduces "named" CPU models for ARM64 KVM guests. This
>>> is foundational for cross-host live migration and management-stack
>>> control over individual CPU features exposed to the guest.
>>>
>>> TL;DR Examples:
>>>  # Boot with Grace CPU model
>>>  qemu-system-aarch64 -cpu grace-v1 -machine virt,accel=kvm ...
>>>
>>>  # Grace with a feature disabled
>>>  qemu-system-aarch64 -cpu grace-v1,feat_SHA1=off ...
>>>
>>>  # Host passthrough with individual feature control
>>>  qemu-system-aarch64 -cpu host,feat_AES=aes ...
>>>
>>>  # Neoverse v2 on Grace.
>>>  qemu-system-aarch64 -cpu neoverse-v2-v1
>>>
>>>  # Migration from Grace to Graviton3 (TBD)
>>>  qemu-system-aarch64 -cpu neoverse-v1-v1 ...
>>>
>>> Relationship with Auger/Huck's customizable host model [1]:
>>> We have been working on this series in parallel with [1]. Eric Auger and
>>> Cornelia Huck's series [1] exposes raw SYSREG_<REG>_<FIELD> uint64
>>> properties on -cpu host, providing the essential low-level knobs for ID
>>> register customization. This RFC builds on the same KVM capability
>> Please find some comments/questions below.
>>> and can be layered on top of [1]:
>>>  - Human-readable property names: feat_AES=pmull instead of
>>>    SYSREG_ID_AA64ISAR0_EL1_AES=2, with arch-defined named values
>>>    validated at set time.
>> From what I understand what you call feature here refers to an
>> ARM64_FTR_BITS definition in kernel arch/arm64/kernel/cpufeatures.c.
>> Named string values, safe policy and safe value are all extracted from
>> the kernel implementation and do not stem from the ARM ARM itself.
> Correct, a  CPU feature (that we talk about) is not the same as ARM ARM
> defined FEAT_*. As you say below, they really are much more complex to
> model. But `sve`, `pauth` defined by QEMU, are also not an ARM ARM
> defined FEAT. Because of the complexities of ARM ARM FEAT_s, we decided
> to go with ID register field-based properties with abstractions like 
> (fractional
> and composites) on top.
>
> Yes, value names, safe policy, and safe value are taken from the kernel. But
> in most cases, value names already correlate with ARM ARM.
>
> For example,ID_AA64ISAR1_EL1.LS64:
>
> ARM ARM says:
> FEAT_LS64 implements the functionality identified by 0b0001.
> FEAT_LS64_V implements the functionality identified by 0b0010.
> FEAT_LS64_ACCDATA implements the functionality identified by 0b0011.
> FEAT_LS64WB implements the functionality identified by 0b0100.
>
> Named values in cpu-idregs.h.inc:
> IDREG_FIELD_START(ID_AA64ISAR1, LS64, 60, 4, EXACT, 0)
> IDREG_FIELD_ARCH_VAL(0b0000, "off")
> IDREG_FIELD_ARCH_VAL(0b0001, "ls64")
> IDREG_FIELD_ARCH_VAL(0b0010, "ls64_v")
> IDREG_FIELD_ARCH_VAL(0b0011, "ls64_accdata")
> IDREG_FIELD_ARCH_VAL(0b0100, "ls64wb")
> IDREG_FIELD_END(ID_AA64ISAR1, LS64)
>
> Values names are: lowered feat names with FEAT_ removed.
>
> For “on”/“off” toggle features: for example (XS, just below LS64):
> We have:
> IDREG_FIELD_START(ID_AA64ISAR1, XS, 56, 4, LOWER, 0)
> IDREG_FIELD_ARCH_VAL(0b0000, "off")
> IDREG_FIELD_ARCH_VAL(0b0001, "on")
> IDREG_FIELD_END(ID_AA64ISAR1, XS)
>
> ARM ARM says:
> FEAT_XS implements the functionality identified by 0b0001.


I agree with you on the fact that in general the association is fine. I
am confident that developpers who wrote 

arch/arm64/kernel/cpufeatures.c did it in a sensible way. Still the problem 
comes when the end user/layered products have to guess those string values 
which are documented/defined nowhere besides in cpufeatures.c. How will they 
come to know which string values are supported and how they map onto any valid 
ARM ARM ID reg value? You cannot point them to any reference doc. Refering to 
past comments, Marc said he would even prefer not using kernel sysreg as a root 
of trust (hence the choice of extracting info from AARCHMRS) so I anticipate 
using arch/arm64/kernel/cpufeatures.c as input may be frown upon. But I may be 
wrong.

I guess the usage of string values really depends on how many props layered 
products or user will need to set explicitly. Assuming we expose named models 
and this will be the main use case, my assumption is quite a few props will 
remain to be overriden on top of named model hardcorded values. And since the 
prop granularity is very low level anyway, if the end user has to identify and 
set an ID reg field, it can also set the associated uint64 value directly.

My point is that compared to the original series Connie and I have been pushing,
- you still use the same prop granularity (ID reg field), somehow abusively 
called feature
- you add significant complexity related to the string values (which are not 
referenced anywhere in the spec)
- you also add significant complexity related to different kinds and name 
patterns of props (FRAC, STRING, BOOLEAN, NUMERIC) while we only used 
SYSREG_REG_FIELD uint64_t props.

While I am really interested in layering up named models on top of ID reg field 
properties, or even a real feature upper level abstraction, I am not sure the 
above extra complexity is worth while keeping the original IDreg field 
granularity.

  

>
> There may be a few value names that do not make sense right now, but
> that does not de-merit the idea that perfectly works for most fields.
>
> Regarding the safe-value tag, I feel that it is the one thing that must come
> from the kernel; in the end, KVM is gonna validate based on that. If the
> static file feels problematic, another option is to introduce a new KVM 
> ioctl. 
> We are fine with both.
safe policy and value are kernel defined. To be honest I still fail to
understand why they are mandated for named vcpu models. Please could
elaborate again?
Why can't you start from the host values and override them with the
named vcpu model settings? If the new value cannot be applied by KVM, it
will reject it and fail the vcpu init.
Moreover if one ID reg field that needs to be overriden is not writable
with that host kernel, the prop won't exist and the named model won't
work on that host either.
>
> Regarding default values, I do not see any reset values populated in
> AARCHMRS_OPENSOURCE_A_profile_FAT-2026-03/Registers.json, I
> may be missing something here. Is there some other place where we can
> find ARM ARM defined reset values?
>
>> If the named vcpu models anyway use hardcoded values I wonder if it is
>> so important to have named string values whereas a comment would do the
>> job in the named vcpu model definition?
> While true, for a named model, numerical field values will suffice as well. 
> It’s
> just a way of representation. But for end users, I will argue, named values
> are more user-friendly.
to me, again, it would be if those values were properly specified which
is not the case.
>
> On x86, the user does not modify a specific cpuid leaf to turn a feature on 
> or off.
> They have assigned a name to each leaf in the spec.  Our value names in most
in the spec ;-) That's a big difference.
> cases already make perfect sense, as mentioned above.
>
>> From a spec pov I had in mind that a defintion of a FEAT could be much
>> more complex that just 1 field (for instance could be a combination of
>> several of them)
> Agreed. Hence, the ID register field-based properties.
>
>> It is not clear to be if you allow the end-user to overwrite a property
>> on top of a named model setting.
> Yes, the end user can change the property on top of the named model as
> illustrated in the examples. 
>
>>>  - Default values and forward compatibility: CPU models start from a
>>>    known-zero baseline rather than the host view, so new fields/registers
>> [1]
>>>    introduced in future kernels do not silently leak into existing models.
>>>  - Named CPU models with hierarchical inheritance: grace-v1,
>>>    neoverse-v2-v1, etc.
>>>
>>> The two series can coexist; this series can be rebased on top of [1].
>>>
>>> [1] 
>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lore.kernel.org_qemu-2Ddevel_20260503073541.790215-2D1-2Deric.auger-40redhat.com_&d=DwIDaQ&c=s883GpUCOChKOHiocYtGcg&r=sY-XeNqcuy_ruBQ9T7A2LmG6ktyYXXSxRB1ljkxMepI&m=fpNKdg99IJhj_lpxANCw1YCGZfWRAyz1YWNycq_dH0Rdg8U46NwjIJ3BvFyYb-e7&s=xzjnJMwOvSYbQjuaej-yNezaOn0DWWfAJho6IG_AEto&e=
>>>
>>> Problems with defining "named" CPU models for ARM64 KVM guests:
>>>  * Features are not single CPUID bits. They are mostly multi-bit fields
>>>    encoding version/level instead of just presence. A single field encodes
>>>       multiple ARM ARM defined features (FEAT_s) at different thresholds.
>> would be good to provide an example for each challenge. I remember
>> Connie provided some in the past though her KVM forum presentations.
> Acked. Will add in v2.
>
>>>  * KVM does not allow all registers and fields to be modified for a guest.
>>>    Some fields KVM does not virtualise at all (SME) or only support host
>>>       values (BRPs, CWG, etc.). This is evolving and differs between kernel
>>>       versions.
>> this seems to be contradictory to [1]. Do you have a mix of host
>> inherited values and hardcoded value or do you only have hardcoded values?
> A named model will not be realizable on a given host if any of the above 
> non-writable
> fields differ between the host and the named model. We realize the named 
> model fully
> from a  zeroed-out cpu->isar.idregs[] (no host features) exposed. A model is 
> also not
why is it mandated to start from zeroed cpu->isar.idregs[]? Why can't
you start with host initialized cpu->isar.idregs[] and overwrite
everything that characterizes the named vcpu model.
> realizable on a host if it over-promises features that are not actually 
> available on the
> host (the need for safe-value tags).
>
>>>  * ARM does not have a single natural granularity for CPU models unlike
>>>    x86. ARM has architecture, reference core and SoC levels each becoming
>>>       more granular.
>>>  * ARM has dozens of vendors and it will be tricky to maintain models for
>>>    all of them.
>>>  * Previous designs started from the host values and then subtracted
>>>    undesirable features. This is not forward-compatible; the design
>>>    should work when a new ID register or field is introduced.
>>>
>>> With the above problems in mind, the design has 3 layers:
>>>
>>> 1. ARM ID Register Field Table:
>>>   - This layer maintains all architecturally defined ID registers and
>>>     ID register fields. It includes:
>>>               * Field name
>>>               * Field shift
>>>               * Field length
>>>               * Safe-value tag: LOWER, HIGHER, HIGHER_OR_ZERO, SIGNED_LOWER,
>>>                                                 EXACT, ANY
>>>                       This will be used to validate user-provided values 
>>> during
>>>                       CPU realization time against the host's value. I.e., 
>>> if the
>>>                       host only supports "aes", a CPU model that sets 
>>> "pmull"
>>>                       should be rejected.
>> why isn't the kernel doing that job already. Setting a value not
>> compatible with the host shall be rejected by the kernel, no?
> KVM will reject. But how will management stack know which values are 
> supported?
> x86 is simpler; a feature is a single-bit toggle. All management stacks infer 
> that if the
> value of a property is true, false is supported. And, for -cpu host if the 
> property value
> is false, the feature is not supported on the host.
Well I would imagine that a cloud vendor wants to migrate between
different hosts which call for a specific named model (so there is
minimal alignment between the vcpu model and the host cpu). In the worst
case it is always possible to instantiate scratch named model vcpus and
see if the vcpu init succeeds. We could even provide a qmp utility or
any other scheme that returns all valid vcpu models for that host.
Anyway I think this will be needed. or -cpu help may be enhanced to do
that, not simply returning the theoretical named models that are
available but the ones which actually apply on that kernel. I can help
you investigating that direction if it can come as an alternative.
>
> On ARM, we don’t have that luxury; the management stack does not know which
> values are valid for a property. Ambiguity is:
> 1. How will management stack even know that a property is writable/modifiable?
this is already available in the original series through 

query-cpu-model-expansion

> 2. For a 4-bit nibble, are all 16 values valid if it is writable?
>  a) Lower values than the host?
>  b) Higher values than the host? etc.
try to apply it and the kernel will simply let you know. Why
reimplementing this in qemu as it is done on the kernel with the safe
policy/value infra. 
>
> Having a safe value tag in QEMU allows us to provide useful information like
> named model ‘blockers’/‘unsupported-features’ and supported property values.
This can be achieved in a different way, relying on the kernel.
Instantiate a scratch vcpu model and see if it inits. Some scratch vcpu
inits are already in place in qemu.
>
> I feel the above should also be present for the customizable host model 
> series [1],
> providing valid values along with properties is much more useful for 
> management
> stack.

you can already introspect which ID reg are writables. You cannot get
the values that are valid for that host but any attempt to write invalid
values already fail the vcpu init. I tested that with kernel ID_FILTERED
id regs.
>
>>>               * Default value: The value to which the field is reset. This 
>>> gives
>>>                       CPU models a clean cpu.isar.idregs[] baseline instead 
>>> of the
>>>                       host view provided by the kernel, as in previous 
>>> designs.
>>>                       This also complements the forward-compatibility 
>>> story. Given
>>>                       the "default" values, higher levels need not worry 
>>> about new
>>>                       fields/registers being introduced.
>>>               * Architecturally defined named values like "off", "aes", 
>>> "pmull",
>>>                       etc.
>>>               * These values are derived from the kernel's ftr_bits array 
>>> and
>>>                 tools/sysreg file.
>>>    E.g:
>>>
>>>     IDREG_START(ID_AA64ISAR0)
>>>     IDREG_FIELD_START(ID_AA64ISAR0, AES, 4, 4, LOWER, 0)
>>>     IDREG_FIELD_ARCH_VAL(0b0000, "off")
>>>     IDREG_FIELD_ARCH_VAL(0b0001, "aes")
>>>     IDREG_FIELD_ARCH_VAL(0b0010, "pmull")
>>>     IDREG_FIELD_END(ID_AA64ISAR0, AES)
>>>        ....
>>>        IDREG_END(ID_AA64ISAR0)
>>>
>>>   - This layer is the single source of truth for ARM64 ID registers.
>> single source of truth extracted from the kernel and not from the spec.
>> What if we discover a bug in ARM64_FTR_BITS chang default or safe value
>> for instance. How does the change propagate to qemu and existing models?
> Relaxation of safe value tag is safe, (EXACT -> LOWER -> ANY). Change in
> default value should not happen; instead, that should be handled with the
> “kvm-base” model, which is defined for KVM quirks. Same implications as
> ARM ARM changes the field’s semantics.
>
>>>     The default values and safe-value tags are manually derived from the
>> default value, if equal to reset value could be extracted from the spec
>> directly.
> Again, I checked AARCHMRS_OPENSOURCE_A_profile_FAT-2026-03/Registers.json,
> and the reset values were null. I may be missing something. Is there any
> other place  we can get ARM ARM defined reset values? We can modify
> the script in [1] to also have default values for named model infra.
I guess it is normal because those regs are ID regs. The minimum level
of support comes from the various revisions and reference core
implementations. To me this is the huge job that needs to be done while
defining the various named models you drafted.
>>>        kernel's ftr_bits array. Other boilerplate and arch-defined values 
>>> are
>>>        script-generated.
>>>
>>>   - AArch32 ID registers are added with a single field so they can be
>>>     zeroed out on hosts that support AArch32.
>>>
>>>   - This layer also defines helpers for higher layers to extract and
>>>     manipulate ID register fields.
>>>       * arm_idregs_reset_to_defaults(): Reset all ID registers to their
>>>            default values.
>>>          * arm_idreg_field_read/write(): Read the value of an ID register
>>>            field.
>>>          * arm_arch_val_name/from_name(): Look up the arch-defined name for
>>>            a numeric field value.
>>>          * ...
>> this is a pity we have not exchanged on this earlier because that code
>> could have been shared instead of rewriting things and resetting all
>> credits.
> Let’s work together from now on :) 

yeah the problem is you reimplemented a different infrastructure for the
same prop granularity as you did not rely nor ever commented on the
original series. So the first thing is to converge on the base infra
(props) definition and then build layered named models on top of it. As
we do not agree for now, let's wait for other's comments...
I a bit frustrated you did not communicate with us before so that we
could have initiated cooperation before and even make the original
series evolve in the direction you needed for named vcpu models.

But anyway, my end goal is to get those named models in a decent
timeframe so I am obviously willing to cooperate, hence the time I spend
reviewing your alternative implementation ;-)  
>
>>>       - This layer creates the following tables using X-macro expansion:
>>>          * arm_idregs[]: Array of ID register descriptors.
>>>          * arm_field_locs[]: Array of field location descriptors.
>>>               (fieldIDx -> registerIndex, fieldIndex)
>>>          * ...
>>>
>>>    - The ArmIdReg struct also includes a writable_mask to track which
>>>      bits are writable by KVM. This is populated at runtime during
>>>      scratch VM creation, and is further used to validate that only
>>>      the writable bits are modified by the CPU model.
>> this is an interesting idea that could have been also used in previous
>> contributions.
> Acked.
>
>>> 2. ARM Properties Layer:
>>>   A small property layer on top of the ID Register Field table is defined.
>>>   This series defines two types of properties with plans for one more
>>>   in the future:
>>>      - Single field properties: These represent ARM FEAT_X features
>>>           that correspond to a single ID register field. Example: feat_AES,
>>>           feat_SHA2, etc.
>>>
>>>               The property name is set as "feat_<FieldName>" and possible 
>>> values
>>>               are the arch-defined named values. This can be further 
>>> categorized
>>>               into:
>>>                       * STRING: multi-bit fields (>=2 bits) with 
>>> arch-defined named
>>>                                 values, example: feat_AES, feat_SHA2, etc.
>>>                       * BOOLEAN: 1-bit fields only (true/false)
>>>                                 example: hw_prop_IDC, hw_prop_DIC, etc.
>>>                       * NUMERIC: IDREG_ANY fields with no named values (raw 
>>> integer)
>>>                                 example: hw_prop_BS, hw_prop_DZP, etc.
>> I just wonder if we need all that complexity if eventually we hardcode
>> values in named vcpu models
> Then we are just delegating the complexity to the management stack. For
> example, without fractional prop, the management stack must know CSV2,
> CSV2_Frac relation, and not all value combination of <CSV2>.<CSV2_Frac>
> are valid. Rather than having different management software solving the
> same problem, we can solve it once in QEMU.
well, named vcpu models hardcode 90% of the values, aren't they. Scratch
vcpu init attempts can let management layers know what named vcpu models
would work on that host I guess.
>
> We also have very active plans to hook up cpu-definitions, 
> cpu-model-expansion,
> and more.
>>>               String property values are validated against the arch-defined 
>>> named
>>>               values.
>>>
>>>               ID register fields that are not covered by single field 
>>> properties
>>>               are also exposed as a property named hw_prop_FieldName. These 
>>> are
>>>               usually implementation-defined values like cache geometry, 
>>> debug
>>>               counter widths, etc. (CTR_EL0.*, DCZID_EL0.*, etc.)
>>>               Example: hw_prop_BS, hw_prop_DZP, etc.
>> how do you discriminate between those and field_ props? Again do we need
>> that complexity?
> Ummm, there is no special code for these props. They are just named 
> differently
> as no ARM ARM FEAT_ relate to them.
Yes but I mean for the end user, this ends up with different prop names
which does not make their life easier to me, instead of SYS_REG_FIELD
pattern. also keep in mind that FEAT_ props are not, strictly speaking
what the spec call FEAT.
>
>>>               Single field properties are defined as:
>>>
>>>               ARM_PROP("prop_name", type, reg, field)
>>>               Example:
>>>               ARM_PROP("feat_AES", STRING, ID_AA64ISAR0, AES)
>>>
>>>               * Validation based on safe-value tags is yet to be 
>>> implemented.
>>>
>>>     - Fractional properties: These represent ARM FEAT_X features that
>>>           use two fields (base + frac) across registers. Example: feat_CSV2,
>>>           feat_MPAM, etc.
>>>
>>>               The property name is set as "feat_<BaseFieldName>" and 
>>> possible
>>>               values are the arch-defined string values like "0.0", "1.0", 
>>> "1.1",
>>>               etc.
>>>
>>>               Fractional properties are defined as:
>>>               ARM_FRACTIONAL_PROP("prop_name", base_reg, base_field, 
>>> frac_reg, frac_field)
>>>               Example:
>>>               ARM_FRACTIONAL_PROP("feat_CSV2", ID_AA64PFR0, CSV2, 
>>> ID_AA64PFR1, CSV2_FRAC)
>>>
>>>
>>>               When a fractional property is set, both the base field and 
>>> frac
>>>               field values are set to the corresponding values.
>>>               E.g: feat_CSV2=1.1 will set ID_AA64PFR0.CSV2=1 and 
>>> ID_AA64PFR1.CSV2_FRAC=1.
>>>
>>>       - Composite properties (planned for v2):
>>>          These will act as master boolean switches that control a list of
>>>          fields. Example: pauth, sve, etc. Setting sve=on with a named model
>>>          will set all the SVE-related fields (ID_AA64ZFR0_EL1.*) along with
>>>          sveNNN vector-length. Similarly, setting pauth=on will set APA, 
>>> GPA,
>>>          API, GPA3, GPI, GPA3 fields based on the named model.
>>>
>>>       - cpu_revision, cpu_partnum, etc. properties are introduced to expose
>>>         MIDR, REVIDR, AIDR fields.
>>>
>>>       Exceptions to the property naming are made for ID_AA64PFR0_EL1.ELx
>>>       fields, which are named elx_mode.
>> yet another naming exception. 
> A better name is always welcome! We can iterate over both the prop name and
> the value name.
yeah but is it making end user life easier?
>
>>>       This series defines over 130 single field properties plus 4
>>>       fractional properties. All properties work with -cpu host also.
>> Similary as with the host "custom" series, one needs to address
>> collision between legacy cpu properties and low level ones too.
> Agreed, This needs more thoughts. It is not a part of this RFC.
I have further thought on this when respinning the original series into
v5. I believe the props can follow a hierarchy. ID reg field props are
the lowest level props and apply first. Then come the higher level
legacy options that are likely to override them. I am waiting for
comments on the series but this is proposal I would make.
>
>>>       All properties change the cpu.isar.idregs[] values which are later
>>>       written back to KVM at the end of kvm_arch_init_vcpu().
>>>
>>>       * The arch-defined named values and property names can be iterated
>>>         until they make sense.
>>>
>>> 3. ARM CPU Model Hierarchy:
>>>
>>> A small named model layer is defined on top of the properties. An ARM named
>>> CPU model defines a list of property values and a parent model. A child
>>> model naturally inherits all the properties from its parent and can
>>> override them when needed.
>>>
>>> The initial model hierarchy shipped here is:
>>>    kvm-base-v1                  KVM-imposed quirks
>>>      arm-v8_4-a-v1              ARMv8.4-A architectural mandate
>>>          neoverse-v1-v1         Neoverse V1
>>>                   graviton3-v1         AWS Graviton3
>>>               arm-v9_0-a-v1            ARMv9.0-A architectural deltas on 
>>> top of ARM-v8_4-a-v1
>>>          neoverse-v2-v1         Neoverse V2
>>>            grace-v1             NVIDIA Grace
>> Do you have a strategy for named model validation, besides code review.
>> At least each vcpu named model shall be introduced in separate patch,
>> overrides clearly explained and links to the reference spec shall be
>> precisely given for review.
> Agreed, each model should be in a separate patch, with overrides and new
> introductions clearly mentioned/documented with supporting ID register dumps.

Yes I believe it is a huge work to assess the correctness of the
definition.
>
>>> (kvm-base-v1 and arm-vX are not meant to be realizable unless the
>>> user provides values for implementation-defined fields)
>>>
>>> So for example, grace-v1 defines Crypto fields and CTR_EL0.IDC/DIC on top
>>> of neoverse-v2-v1, which leaves those fields vendor-configurable.
>>>
>>> The hierarchy reflects a deliberate trade-off:
>>>  - Architecture-level models (arm-v8_4-a-v1) maximize migration
>>>    compatibility but lack implementation-defined values.
>>>  - Reference-core models (neoverse-v2-v1) enable migration across
>>>    SoCs sharing the same core design.
>>>  - SoC models (grace-v1) expose the full hardware feature set but
>>>    limit migration to hosts with the same SoC.
>> What kind of migration did you exercise up to now?
> Successful same host migration with named models and migration
> from grace to graviton3 with neoverse-v1.
>
>>> At model realization time,
>>>    1. a clean slate of cpu.isar.idregs[] is created using
>>>          arm_idregs_reset_to_defaults().
>>>       2. Then, a model's full parent-chain is walked and all properties are
>>>          applied in order from parent to child.
>>>       3. Finally, kvm_arm_writeback_idregs() compares the model's desired
>>>          ID-register values against the host-provided cpreg snapshot and
>>>          writes back the writable bits, warning on any non-writable 
>>> difference.
>>>
>>> Models will follow a monotonic versioning convention (grace-v1, grace-v2,
>>> ...) mirroring x86's scheme.
>>>
>>> * Please take the CPU model property values with a grain of salt.
>>>  They are added based on what the guest-visible values are with "host"
>>>  model on available hardware.
>> I don't catch the above statement.
> They don’t match the exact hardware specs. For example feat_CSV2 is capped
> at “1.0” by KVM regardless of what the hardware says. On Neoverse-v2, TRM
> says FEAT_CSV2_2 is implemented (i.e “2.0”)
OK. This is effectively an important aspect that, despite the value we
attempt to set on an ID reg field, KVM eventually does what he wants
with it according to its policy. I think it can ignore, sanitize, ...
>
>>> Benefits of this design:
>>>       - General benefits that come with properties and named CPU models,
>>>         like cross-host live migration, management-stack control over
>>>         feature exposure, etc.
>>>       - Forward compatibility: when a new ID register or field is
>>>         introduced, CPU models need not change; during realization they
>>>         will be populated with the default values. Only ID register/field
>>>         information needs to be added to the field table.
>>>       - As CPU models are hierarchical, defining a new model is much easier.
>> I like the hierarchical approach but to be honest, at the moment, I miss
>> knowledge on whether it safely applies. I agree that named vcpu models
>> are the end target goal while [1] is rather an intermediate step that
>> was paving the way for it. I rather saw [1] as a tool box for enhaving
>> the host model and understanding issues when migrating. I wish we could
>> share the foundations instead of having totally separate contributions. 
> Definitely, let’s build a common foundation. We need to agree on safe-value 
> tags,
> default values, and value names. We can build our solution on top of [1], if 
> at least
> safe-value tags and default values are present.

To me there are several aspects
1) decide which ID reg properties are needed/better (unique uint64_t or
your various prop flavours)
2) usage of safe policy/value (is it really mandated). Why can't we use
scratch vcpus. Possibly if it is proven to be mandatory this is
independent on 1
3) definition of named models

Thanks

Eric
>
>>>       - The property names and values are self-documenting.
>> not really sorry (because it does not match the spec). I don't think we
>> can ask the end-user to read the kernel code.
> Acked, We will add appropriate documentation in QEMU.
>
>>> NOTE: ~2200 of the ~3300 added lines are declarative (field table,
>>> model definitions, properties, etc.)
>>>
>>> Tested with KVM on an NVIDIA Grace host.
>>>
>>> Relationship with existing code base:
>>> - It does not change any TCG-based code paths.
>>> - For KVM host passthrough it just adds property support.
>> with hardcoded reset values, correct? Instead of host retrived values?
> Reset values are not used with the -cpu host. Only for the named model
> realization, reset values are used.
>
>>> - Does not change any existing properties or other code paths.
>> yes but if it words along with legacy options, you share our concern to
>> coexist with them
> Yes, it requires some more thoughts.
>
>>> - Can layer on top of the SYSREG_ property series [1].
>> but in that case why don't you simply reuse it to build the named vcpu
>> model. I don't say the previous properties are the ideal solution but I
>> am not sure the mix or heterogenously named ones introduced here + value
>> strings retrieved from the kernel are better. At least the SYSREG ones
>> matched the spec with raw values, which bring simplicity in the code.
>> And since the end user shall be so much involved in provided extra
>> SYSREG values himself on top of named vcpu models, ...
> We can build on [1] as long as we have default values and safe-value tags.
> They are required for the proper model realization and QMP integration.
>
>>> Planned Follow-ups:
>>>    - Composite properties with handling of sve, pauth for named models.
>> yes this is needed and I am currently working on this on [1]
>>>    - CLIDR_EL1 and CCSIDR_EL1 handling.
>>>       - Safe-value based validation logic.
>>>       - QMP commands like query-cpu-model-expansion are not hooked yet.
>>>         Blockers and supported values (calculated using safe-value tags
>>>         and runtime KVM writable masks) will be reported through them.
>>>         E.g. libvirt could report:
>>>           <property name='feat_AES' type='string' value='pmull'
>>>                     supports='off,aes,pmull'/>
>>>         and:
>>>           <cpu type='kvm' name='nvidia-grace-v1'
>>>                       typename='arm-nvidia-grace-v1-arm-cpu' usable='no'>
>>>             <blocker name='feat_AES'/>
>>>           </cpu>
>> adding Andrea for libvirt inputs
>>
>> Thanks
>>
>> Eric
> Warm Regards,
> Khushit

Re: [RFC PATCH v1 00/13] named CPU models for ARM64 on KVM

Reply via email to