On Thu, 26 Nov 2020 17:23:20 +0100
Igor Mammedov <imamm...@redhat.com> wrote:

> On Thu, 26 Nov 2020 10:10:27 +0100
> Greg Kurz <gr...@kaod.org> wrote:
> 
> > On Thu, 26 Nov 2020 15:57:37 +1100
> > David Gibson <da...@gibson.dropbear.id.au> wrote:
> >   
> > > On Wed, Nov 25, 2020 at 10:51:05AM +0100, Greg Kurz wrote:    
> > > > On Wed, 25 Nov 2020 13:39:47 +1100
> > > > David Gibson <da...@gibson.dropbear.id.au> wrote:
> > > >     
> > > > > On Mon, Nov 23, 2020 at 12:51:08PM +0100, Greg Kurz wrote:    
> > > > > > On Mon, 23 Nov 2020 16:11:30 +1100
> > > > > > David Gibson <da...@gibson.dropbear.id.au> wrote:
> > > > > >     
> > > > > > > On Sat, Nov 21, 2020 at 12:42:03AM +0100, Greg Kurz wrote:    
> > > > > > > > When it comes to resetting the compat mode of the vCPUS, there 
> > > > > > > > are
> > > > > > > > two situations to consider:
> > > > > > > > (1) machine reset should set the compat mode back to the 
> > > > > > > > machine default,
> > > > > > > >     ie. spapr->max_compat_pvr
> > > > > > > > (2) hot plugged vCPUs should set their compat mode to mach the 
> > > > > > > > boot vCPU,
> > > > > > > >     ie. POWERPC_CPU(first_cpu)->compat_pvr
> > > > > > > > 
> > > > > > > > This is currently handled in two separate places: globally for 
> > > > > > > > all vCPUs
> > > > > > > > from the machine reset code for (1) and for each thread of a 
> > > > > > > > core from
> > > > > > > > the hotplug path for (2).
> > > > > > > > 
> > > > > > > > Since the machine reset code already resets all vCPUs, starting 
> > > > > > > > with boot
> > > > > > > > vCPU, consolidate the logic in spapr_reset_vcpu(). Special case 
> > > > > > > > the boot
> > > > > > > > vCPU so that it resets its compat mode back to the machine 
> > > > > > > > default. Any
> > > > > > > > other vCPU just need to match the compat mode of the boot vCPU.
> > > > > > > > 
> > > > > > > > Failing to set the compat mode during machine reset is a fatal 
> > > > > > > > error,
> > > > > > > > but not for hot plugged vCPUs. This is arguable because if 
> > > > > > > > we've been
> > > > > > > > able to set the boot vCPU compat mode at CAS or during machine 
> > > > > > > > reset,
> > > > > > > > it should definitely not fail for other vCPUs. Since 
> > > > > > > > spapr_reset_vcpu()
> > > > > > > > already has a fatal error path for kvm_check_mmu() failures, do 
> > > > > > > > the
> > > > > > > > same for ppc_set_compat().
> > > > > > > > 
> > > > > > > > This gets rid of an error path in spapr_core_plug(). It will 
> > > > > > > > allow
> > > > > > > > further simplifications.
> > > > > > > > 
> > > > > > > > Signed-off-by: Greg Kurz <gr...@kaod.org>
> > > > > > > > ---
> > > > > > > >  hw/ppc/spapr.c          | 16 ----------------
> > > > > > > >  hw/ppc/spapr_cpu_core.c | 13 +++++++++++++
> > > > > > > >  2 files changed, 13 insertions(+), 16 deletions(-)
> > > > > > > > 
> > > > > > > > diff --git a/hw/ppc/spapr.c b/hw/ppc/spapr.c
> > > > > > > > index f58f77389e8e..da7586f548df 100644
> > > > > > > > --- a/hw/ppc/spapr.c
> > > > > > > > +++ b/hw/ppc/spapr.c
> > > > > > > > @@ -1606,8 +1606,6 @@ static void 
> > > > > > > > spapr_machine_reset(MachineState *machine)
> > > > > > > >      spapr_ovec_cleanup(spapr->ov5_cas);
> > > > > > > >      spapr->ov5_cas = spapr_ovec_new();
> > > > > > > >  
> > > > > > > > -    ppc_set_compat_all(spapr->max_compat_pvr, &error_fatal);
> > > > > > > > -
> > > > > > > >      /*
> > > > > > > >       * This is fixing some of the default configuration of the 
> > > > > > > > XIVE
> > > > > > > >       * devices. To be called after the reset of the machine 
> > > > > > > > devices.
> > > > > > > > @@ -3785,20 +3783,6 @@ static void 
> > > > > > > > spapr_core_plug(HotplugHandler *hotplug_dev, DeviceState *dev,
> > > > > > > >  
> > > > > > > >      core_slot->cpu = OBJECT(dev);
> > > > > > > >  
> > > > > > > > -    /*
> > > > > > > > -     * Set compatibility mode to match the boot CPU, which was 
> > > > > > > > either set
> > > > > > > > -     * by the machine reset code or by CAS.
> > > > > > > > -     */
> > > > > > > > -    if (hotplugged) {
> > > > > > > > -        for (i = 0; i < cc->nr_threads; i++) {
> > > > > > > > -            if (ppc_set_compat(core->threads[i],
> > > > > > > > -                               
> > > > > > > > POWERPC_CPU(first_cpu)->compat_pvr,
> > > > > > > > -                               errp) < 0) {
> > > > > > > > -                return;
> > > > > > > > -            }
> > > > > > > > -        }
> > > > > > > > -    }
> > > > > > > > -
> > > > > > > >      if (smc->pre_2_10_has_unused_icps) {
> > > > > > > >          for (i = 0; i < cc->nr_threads; i++) {
> > > > > > > >              cs = CPU(core->threads[i]);
> > > > > > > > diff --git a/hw/ppc/spapr_cpu_core.c b/hw/ppc/spapr_cpu_core.c
> > > > > > > > index 2f7dc3c23ded..17741a3fb77f 100644
> > > > > > > > --- a/hw/ppc/spapr_cpu_core.c
> > > > > > > > +++ b/hw/ppc/spapr_cpu_core.c
> > > > > > > > @@ -27,6 +27,7 @@
> > > > > > > >  
> > > > > > > >  static void spapr_reset_vcpu(PowerPCCPU *cpu)
> > > > > > > >  {
> > > > > > > > +    PowerPCCPU *first_ppc_cpu = POWERPC_CPU(first_cpu);
> > > > > > > >      CPUState *cs = CPU(cpu);
> > > > > > > >      CPUPPCState *env = &cpu->env;
> > > > > > > >      PowerPCCPUClass *pcc = POWERPC_CPU_GET_CLASS(cpu);
> > > > > > > > @@ -69,6 +70,18 @@ static void spapr_reset_vcpu(PowerPCCPU *cpu)
> > > > > > > >      kvm_check_mmu(cpu, &error_fatal);
> > > > > > > >  
> > > > > > > >      spapr_irq_cpu_intc_reset(spapr, cpu);
> > > > > > > > +
> > > > > > > > +    /*
> > > > > > > > +     * The boot CPU is only reset during machine reset : reset 
> > > > > > > > its
> > > > > > > > +     * compatibility mode to the machine default. For other 
> > > > > > > > CPUs,
> > > > > > > > +     * either cold plugged or hot plugged, set the 
> > > > > > > > compatibility mode
> > > > > > > > +     * to match the boot CPU, which was either set by the 
> > > > > > > > machine reset
> > > > > > > > +     * code or by CAS.
> > > > > > > > +     */
> > > > > > > > +    ppc_set_compat(cpu,
> > > > > > > > +                   cpu == first_ppc_cpu ?
> > > > > > > > +                   spapr->max_compat_pvr : 
> > > > > > > > first_ppc_cpu->compat_pvr,
> > > > > > > > +                   &error_fatal);    
> > > > > > > 
> > > > > > > This assumes that when it is called for a non-boot CPU, it has 
> > > > > > > already
> > > > > > > been called for the boot CPU..  Are we certain that's guaranteed 
> > > > > > > by
> > > > > > > the sequence of reset calls during a full machine reset?
> > > > > > >     
> > > > > > 
> > > > > > This happens to be the case. Basically because the boot CPU core
> > > > > > is created (including registering its reset handler) first and
> > > > > > qemu_devices_reset() calls handlers in the same order they were
> > > > > > registered.    
> > > > > 
> > > > > Right, I assumed it works for now, but it seems rather fragile, since
> > > > > I'm not sure we're relying on guaranteed properties of the interface. 
> > > > >    
> > > > 
> > > > The reset handler interface is absolutely undocumented, so I guess we
> > > > have no formal guarantees at the present time. But since the current
> > > > implementation has the property, would it be acceptable to carve it
> > > > in stone with added documentation ? In the event of unlikely changes
> > > > to the reset handler logic, people would _just_ need to make sure
> > > > handlers are called in the same order they were registered.    
> > > 
> > > Yeah, maybe.
> > > 
> > > One other thing occurs to me: will we still do things in the right
> > > order if the (initial) boot cpu is hot unplugged, then replugged
> > > before a reset?
> > >     
> > 
> > This can't happen AFAICT.
> > 
> > (qemu) qom-get /machine/unattached/device[1] core-id
> > 0
> > (qemu) device_del /machine/unattached/device[1]
> > Error: Boot CPU core may not be unplugged
> > 
> > commit 62be8b044adf47327ebefdefb25f28a40316ebd0
> > Author: Bharata B Rao <bhar...@linux.vnet.ibm.com>
> > Date:   Wed Jul 27 10:44:42 2016 +0530
> > 
> >     spapr: Prevent boot CPU core removal
> > 
> > 
> > So yes, this adds yet another road block on the way to support hot
> > unplug of the boot CPU. Is this a concern ?
> > 
> > If we go forward with this patch, maybe I should mention in the
> > changelog/documentation the various assumptions which this patch
> > is made under:
> > - reset handlers are called in the same order they were registered
> > - boot CPU registers its reset handler before other CPUs
> > - boot CPU cannot be hot unplugged
> > 
> > These guarantee that the boot core is always reset before other
> > cores during reset.  
> it might work for now but it seems fragile to me.
> 
> What if we  make compat mode a property and move setting it to machine code,
> more precisely treat it like any other cpu feature property.
>   
>  if(need_compat_more)
>     register_global_property(compat_mode)
> 
> that way when any cpu is created it will have this property set
> and it won't depend on the order CPUs are created/reset

Ah it's more complicated, ignore this nonsense pls.

> 
> >   
> > > > > Is there any way we could at least assert() if things are called out
> > > > > of order?
> > > > >     
> > > > 
> > > > Maybe. I'll look into it.
> > > >     
> > > > > >     
> > > > > > > >  }
> > > > > > > >  
> > > > > > > >  void spapr_cpu_set_entry_state(PowerPCCPU *cpu, target_ulong 
> > > > > > > > nip,    
> > > > > > >     
> > > > > >     
> > > > > 
> > > > > 
> > > > >     
> > > >     
> > > 
> > > 
> > >     
> >   
> 
> 


Reply via email to