On Tue, Apr 16, 2024 at 9:54 AM Akihiko Odaki <akihiko.od...@daynix.com> wrote:
>
> On 2024/04/16 13:00, Jason Wang wrote:
> > On Mon, Apr 15, 2024 at 10:05 PM Yuri Benditovich
> > <yuri.benditov...@daynix.com> wrote:
> >>
> >> On Wed, Apr 3, 2024 at 2:11 PM Akihiko Odaki <akihiko.od...@daynix.com> 
> >> wrote:
> >>>
> >>> vhost requires eBPF for RSS. When eBPF is not available, virtio-net
> >>> implicitly disables RSS even if the user explicitly requests it. Return
> >>> an error instead of implicitly disabling RSS if RSS is requested but not
> >>> available.
> >>>
> >>> Signed-off-by: Akihiko Odaki <akihiko.od...@daynix.com>
> >>> ---
> >>>   hw/net/virtio-net.c | 97 
> >>> ++++++++++++++++++++++++++---------------------------
> >>>   1 file changed, 48 insertions(+), 49 deletions(-)
> >>>
> >>> diff --git a/hw/net/virtio-net.c b/hw/net/virtio-net.c
> >>> index 61b49e335dea..3d53eba88cfc 100644
> >>> --- a/hw/net/virtio-net.c
> >>> +++ b/hw/net/virtio-net.c
> >>> @@ -793,9 +793,6 @@ static uint64_t virtio_net_get_features(VirtIODevice 
> >>> *vdev, uint64_t features,
> >>>           return features;
> >>>       }
> >>>
> >>> -    if (!ebpf_rss_is_loaded(&n->ebpf_rss)) {
> >>> -        virtio_clear_feature(&features, VIRTIO_NET_F_RSS);
> >>> -    }
> >>>       features = vhost_net_get_features(get_vhost_net(nc->peer), 
> >>> features);
> >>>       vdev->backend_features = features;
> >>>
> >>> @@ -3591,6 +3588,50 @@ static bool 
> >>> failover_hide_primary_device(DeviceListener *listener,
> >>>       return qatomic_read(&n->failover_primary_hidden);
> >>>   }
> >>>
> >>> +static void virtio_net_device_unrealize(DeviceState *dev)
> >>> +{
> >>> +    VirtIODevice *vdev = VIRTIO_DEVICE(dev);
> >>> +    VirtIONet *n = VIRTIO_NET(dev);
> >>> +    int i, max_queue_pairs;
> >>> +
> >>> +    if (virtio_has_feature(n->host_features, VIRTIO_NET_F_RSS)) {
> >>> +        virtio_net_unload_ebpf(n);
> >>> +    }
> >>> +
> >>> +    /* This will stop vhost backend if appropriate. */
> >>> +    virtio_net_set_status(vdev, 0);
> >>> +
> >>> +    g_free(n->netclient_name);
> >>> +    n->netclient_name = NULL;
> >>> +    g_free(n->netclient_type);
> >>> +    n->netclient_type = NULL;
> >>> +
> >>> +    g_free(n->mac_table.macs);
> >>> +    g_free(n->vlans);
> >>> +
> >>> +    if (n->failover) {
> >>> +        qobject_unref(n->primary_opts);
> >>> +        device_listener_unregister(&n->primary_listener);
> >>> +        migration_remove_notifier(&n->migration_state);
> >>> +    } else {
> >>> +        assert(n->primary_opts == NULL);
> >>> +    }
> >>> +
> >>> +    max_queue_pairs = n->multiqueue ? n->max_queue_pairs : 1;
> >>> +    for (i = 0; i < max_queue_pairs; i++) {
> >>> +        virtio_net_del_queue(n, i);
> >>> +    }
> >>> +    /* delete also control vq */
> >>> +    virtio_del_queue(vdev, max_queue_pairs * 2);
> >>> +    qemu_announce_timer_del(&n->announce_timer, false);
> >>> +    g_free(n->vqs);
> >>> +    qemu_del_nic(n->nic);
> >>> +    virtio_net_rsc_cleanup(n);
> >>> +    g_free(n->rss_data.indirections_table);
> >>> +    net_rx_pkt_uninit(n->rx_pkt);
> >>> +    virtio_cleanup(vdev);
> >>> +}
> >>> +
> >>>   static void virtio_net_device_realize(DeviceState *dev, Error **errp)
> >>>   {
> >>>       VirtIODevice *vdev = VIRTIO_DEVICE(dev);
> >>> @@ -3760,53 +3801,11 @@ static void virtio_net_device_realize(DeviceState 
> >>> *dev, Error **errp)
> >>>
> >>>       net_rx_pkt_init(&n->rx_pkt);
> >>>
> >>> -    if (virtio_has_feature(n->host_features, VIRTIO_NET_F_RSS)) {
> >>> -        virtio_net_load_ebpf(n);
> >>> -    }
> >>> -}
> >>> -
> >>> -static void virtio_net_device_unrealize(DeviceState *dev)
> >>> -{
> >>> -    VirtIODevice *vdev = VIRTIO_DEVICE(dev);
> >>> -    VirtIONet *n = VIRTIO_NET(dev);
> >>> -    int i, max_queue_pairs;
> >>> -
> >>> -    if (virtio_has_feature(n->host_features, VIRTIO_NET_F_RSS)) {
> >>> -        virtio_net_unload_ebpf(n);
> >>> +    if (virtio_has_feature(n->host_features, VIRTIO_NET_F_RSS) &&
> >>> +        !virtio_net_load_ebpf(n) && get_vhost_net(nc->peer)) {
> >>> +        virtio_net_device_unrealize(dev);
> >>> +        error_setg(errp, "Can't load eBPF RSS for vhost");
> >>>       }
> >>
> >> As I already mentioned, I think this is an extremely bad idea to
> >> fail to run qemu due to such a reason as .absence of one feature.
> >> What I suggest is:
> >> 1. Redefine rss as tri-state (off|auto|on)
> >> 2. Fail to run only if rss is on and not available via ebpf
> >> 3. On auto - silently drop it
> >
> > "Auto" might be promatic for migration compatibility which is hard to
> > be used by management layers like libvirt. The reason is that there's
> > no way for libvirt to know if it is supported by device or not.
>
> Certainly auto is not good for migration, but it is useful in the other
> situations. You can still set "on" or "off" if you care migration. I'll
> add "auto" support in the next version.

It will be very nice if you take this patch to separate series, all
others will pass without questions, I think.

Thanks,
Yuri Benditovich

>
> >
> > Thanks
> >
> >> 4. The same with 'hash' option - it is not compatible with vhost (at
> >> least at the moment)
> >> 5. Reformat the patch as it is hard to review it due to replacing
> >> entire procedures, i.e. one patch with replacing without changes,
> >> another one - with real changes. >> If this is hard to review only for me 
> >> - please ignore that.
>
> I'll split this patch accordingly in the next version.
>
> Regards,
> Akihiko Odak

Reply via email to