On Tue, Aug 1, 2023 at 11:28 AM Jason Wang <jasow...@redhat.com> wrote:
>
> On Mon, Jul 31, 2023 at 5:41 PM Eugenio Perez Martin
> <epere...@redhat.com> wrote:
> >
> > On Mon, Jul 31, 2023 at 10:42 AM Jason Wang <jasow...@redhat.com> wrote:
> > >
> > > On Mon, Jul 31, 2023 at 4:05 PM Eugenio Perez Martin
> > > <epere...@redhat.com> wrote:
> > > >
> > > > On Mon, Jul 31, 2023 at 8:36 AM Jason Wang <jasow...@redhat.com> wrote:
> > > > >
> > > > > On Wed, Jul 26, 2023 at 2:27 PM Eugenio Perez Martin
> > > > > <epere...@redhat.com> wrote:
> > > > > >
> > > > > > On Wed, Jul 26, 2023 at 4:07 AM Jason Wang <jasow...@redhat.com> 
> > > > > > wrote:
> > > > > > >
> > > > > > > On Wed, Jul 26, 2023 at 2:21 AM Eugenio Pérez 
> > > > > > > <epere...@redhat.com> wrote:
> > > > > > > >
> > > > > > > > The device already has a virtio status set by vhost_vdpa_init 
> > > > > > > > by the
> > > > > > > > time vhost_vdpa_probe_cvq_isolation is called. vhost_vdpa_init 
> > > > > > > > set
> > > > > > > > S_ACKNOWLEDGE and S_DRIVER, so it is invalid to just reset it.
> > > > > > > >
> > > > > > > > It is invalid to start the device after it, but all devices 
> > > > > > > > seems to be
> > > > > > > > fine with it.  Fixing qemu so it follows virtio start procedure.
> > > > > > > >
> > > > > > > > Fixes: 152128d64697 ("vdpa: move CVQ isolation check to 
> > > > > > > > net_init_vhost_vdpa")
> > > > > > > > Reported-by: Dragos Tatulea <dtatu...@nvidia.com>
> > > > > > > > Signed-off-by: Eugenio Pérez <epere...@redhat.com>
> > > > > > > > ---
> > > > > > > >  net/vhost-vdpa.c | 2 ++
> > > > > > > >  1 file changed, 2 insertions(+)
> > > > > > > >
> > > > > > > > diff --git a/net/vhost-vdpa.c b/net/vhost-vdpa.c
> > > > > > > > index 9795306742..d7e2b714b4 100644
> > > > > > > > --- a/net/vhost-vdpa.c
> > > > > > > > +++ b/net/vhost-vdpa.c
> > > > > > > > @@ -1333,6 +1333,8 @@ static int 
> > > > > > > > vhost_vdpa_probe_cvq_isolation(int device_fd, uint64_t features,
> > > > > > > >  out:
> > > > > > > >      status = 0;
> > > > > > > >      ioctl(device_fd, VHOST_VDPA_SET_STATUS, &status);
> > > > > > > > +    status = VIRTIO_CONFIG_S_ACKNOWLEDGE | 
> > > > > > > > VIRTIO_CONFIG_S_DRIVER;
> > > > > > > > +    ioctl(device_fd, VHOST_VDPA_SET_STATUS, &status);
> > > > > > >
> > > > > > > So if we fail after FEATURES_OK, this basically clears that bit. 
> > > > > > > Spec
> > > > > > > doesn't say it can or not, I wonder if a reset is better?
> > > > > > >
> > > > > >
> > > > > > I don't follow this, the reset is just above the added code, isn't 
> > > > > > it?
> > > > >
> > > > > I meant for error path:
> > > > >
> > > > > E.g:
> > > > >     uint8_t status = VIRTIO_CONFIG_S_ACKNOWLEDGE |
> > > > >                      VIRTIO_CONFIG_S_DRIVER |
> > > > >                      VIRTIO_CONFIG_S_FEATURES_OK;
> > > > > ...
> > > > >     r = ioctl(device_fd, VHOST_VDPA_SET_STATUS, &status);
> > > > > ....
> > > > >         if (cvq_group != -ENOTSUP) {
> > > > >             r = cvq_group;
> > > > >             goto out;
> > > > >         }
> > > > >
> > > > > out:
> > > > >     status = VIRTIO_CONFIG_S_ACKNOWLEDGE | VIRTIO_CONFIG_S_DRIVER;
> > > > >     ioctl(device_fd, VHOST_VDPA_SET_STATUS, &status);
> > > > >
> > > > > We're basically clearing FEATURES_OK?
> > > > >
> > > >
> > > > Yes, it is the state that previous functions (vhost_vdpa_init) set. We
> > > > need to leave it that way, either if the backend supports cvq
> > > > isolation or not, or in the case of an error. Not doing that way makes
> > > > vhost_dev_start (and vhost_vdpa_set_features) set the features before
> > > > setting VIRTIO_CONFIG_S_ACKNOWLEDGE | VIRTIO_CONFIG_S_DRIVER.
> > > > Otherwise, the guest can (and do) access to config space before
> > > > _S_ACKNOWLEDGE | _S_DRIVER.
> > >
> > > I'm not sure if it is supported by the spec or not (I meant clearing
> > > the FEATURES_OK). Or maybe we need a reset here?
> > >
> >
> > Sorry, I'm still missing it :). The reset just above in all fail
> > paths. They go to "out" label, and the first ioctl reset the device,
> > the second set the VIRTIO_CONFIG_S_ACKNOWLEDGE |
> > VIRTIO_CONFIG_S_DRIVER.
>
> Just to make sure we are at the same page:
>
> On error we basically do:
>
> set_status(VIRTIO_CONFIG_S_ACKNOWLEDGE | VIRTIO_CONFIG_S_DRIVER |
> VIRTIO_CONFIG_S_FEATURES_OK);
> ...
> set_status(VIRTIO_CONFIG_S_ACKNOWLEDGE | VIRTIO_CONFIG_S_DRIVER)
>
> So it means the device allows the driver to clear FEATURES_OK. But
> spec is unclear whether or not this is supported. So I'm not sure it
> is supported by all devices.

Ok, I think I miss the set_status(0), so this patch should be fine.

Acked-by: Jason Wang <jasow...@redhat.com>

Thanks

>
> Thanks
>
> >
> > > Thanks
> > >
> > > >
> > > >
> > > > > >
> > > > > > > Btw, spec requires a read of status after setting FEATURES_OK, 
> > > > > > > this
> > > > > > > seems to be missed in the current code.
> > > > > > >
> > > > > >
> > > > > > I'm ok with that, but this patch does not touch that part.
> > > > > >
> > > > > > To fix this properly we should:
> > > > > > - Expose vhost_vdpa_set_dev_features_fd as we did in previous 
> > > > > > versions
> > > > > > of the series that added vhost_vdpa_probe_cvq_isolation [1].
> > > > > > - Get status after vhost_vdpa_add_status, so both vhost start code 
> > > > > > and
> > > > > > this follows the standard properly.
> > > > > >
> > > > > > Is it ok to do these on top of this patch?
> > > > >
> > > > > Fine.
> > > > >
> > > > > Thanks
> > > > >
> > > > > >
> > > > > > Thanks!
> > > > > >
> > > > > > [1] 
> > > > > > https://lore.kernel.org/qemu-devel/20230509154435.1410162-4-epere...@redhat.com/
> > > > > >
> > > > > >
> > > > > > > Thanks
> > > > > > >
> > > > > > > >      return r;
> > > > > > > >  }
> > > > > > > >
> > > > > > > > --
> > > > > > > > 2.39.3
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >


Reply via email to