On Wed, Jun 4, 2014 at 10:30 PM, Michael S. Tsirkin <m...@redhat.com> wrote:
> On Tue, May 27, 2014 at 03:03:21PM +0300, Nikolay Nikolaev wrote: > > In this patch series we would like to introduce our approach for putting > a > > virtio-net backend in an external userspace process. Our eventual target > is to > > run the network backend in the Snabbswitch ethernet switch, while > receiving > > traffic from a guest inside QEMU/KVM which runs an unmodified virtio-net > > implementation. > > > > For this, we are working into extending vhost to allow equivalent > functionality > > for userspace. Vhost already passes control of the data plane of > virtio-net to > > the host kernel; we want to realize a similar model, but for userspace. > > > > In this patch series the concept of a vhost-backend is introduced. > > > > We define two vhost backend types - vhost-kernel and vhost-user. The > former is > > the interface to the current kernel module implementation. Its control > plane is > > ioctl based. The data plane is realized by the kernel directly accessing > the > > QEMU allocated, guest memory. > > > > In the new vhost-user backend, the control plane is based on > communication > > between QEMU and another userspace process using a unix domain socket. > This > > allows to implement a virtio backend for a guest running in QEMU, inside > the > > other userspace process. For this communication we use a chardev with a > Unix > > domain socket backend. Vhost-user is client/server agnostic regarding the > > chardev, however it does not support the 'nowait' and 'telnet' options. > > > > We rely on the memdev with a memory-file backend. The backend's share=on > option > > should be used. HugeTLBFS is required for this option to work. > > > > The data path is realized by directly accessing the vrings and the > buffer data > > off the guest's memory. > > > > The current user of vhost-user is only vhost-net. We add a new netdev > backend > > that is intended to initialize vhost-net with vhost-user backend. > > > > Example usage: > > > > qemu -m 512 \ > > -object memory-file,id=mem,size=512M,mem-path=/hugetlbfs,share=on \ > > -numa node,memdev=mem \ > > -chardev socket,id=chr0,path=/path/to/socket \ > > -netdev type=vhost-user,id=net0,chardev=chr0 \ > > -device virtio-net-pci,netdev=net0 > > > > On non-MSIX guests the vhost feature can be forced using a special > option: > > > > ... > > -netdev type=vhost-user,id=net0,chardev=chr0,vhostforce > > ... > > > > In order to use ioeventfds, kvm should be enabled. > > > > The work is made on top of the NUMA patch series v3.2 > > http://lists.gnu.org/archive/html/qemu-devel/2014-05/msg02706.html > > > > This code can be pulled from g...@github.com:virtualopensystems/qemu.git > vhost-user-v10 > > A simple functional test is available in tests/vhost-user-test.c > > > > A reference vhost-user slave for testing is also available from > g...@github.com:virtualopensystems/vapp.git > > > > Changes from v9: > > - Rebased on the NUMA memdev patchseries and reworked to use memdev > > OK so I should wait until NUMA memdev is merged > before merging this one? > I guess it is still planned for 2.1 as was said by Paolo: https://lists.gnu.org/archive/html/qemu-devel/2014-03/msg00582.html > > > - Removed -mem-path refactoring > > - Removed all reconnection code > > - Fixed 100% CPU usage in the G_IO_HUP handler after disconnect > > - Reworked vhost feature bits handling so vhost-user has better control > in the negotiation > > > > Changes from v8: > > - Removed prealloc property from the -mem-path refactoring > > - Added and use new function - kvm_eventfds_enabled > > - Add virtio_queue_get_avail_idx used in vhost_virtqueue_stop to > > get a sane value in case of VHOST_GET_VRING_BASE failure > > - vhost user uses kvm_eventfds_enabled to check whether the ioeventfd > > capability of KVM is available > > - Added flag VHOST_USER_VRING_NOFD_MASK to be set when KICK, CALL or > ERR file > > descriptor is invalid or ioeventfd is not available > > > > Changes from v7: > > - Slave reconnection when using chardev in server mode > > - qtest vhost-user-test added > > - New qemu_chr_fe_get_msgfds for reading multiple fds from the chardev > > - Mandatory features in vhost_dev, used on reconnect to verify for > conflicts > > - Add vhostforce parameter to -netdev vhost-user (for non-MSIX guests) > > - Extend libqemustub.a to support qemu-char.c > > > > Changes from v6: > > - Remove the 'unlink' property of '-mem-path' > > - Extend qemu-char: blocking read, send fds, monitor for connection > close > > - Vhost-user uses chardev as a backend > > - Poll and reconnect removed (no VHOST_USER_ECHO). > > - Disconnect is deteced by the chardev (G_IO_HUP event) > > - vhost-backend.c split to vhost-user.c > > > > Changes from v5: > > - Split -mem-path unlink option to a separate patch > > - Fds are passed only in the ancillary data > > - Stricter message size checks on receive/send > > - Netdev vhost-user now includes path and poll_time options > > - The connection probing interval is configurable > > > > Changes from v4: > > - Use error_report for errors > > - VhostUserMsg has new field `size` indicating the following payload > length. > > Field `flags` now has version and reply bits. The structure is packed. > > - Send data is of variable length (`size` field in message) > > - Receive in 2 steps, header and payload > > - Add new message type VHOST_USER_ECHO, to check connection status > > > > Changes from v3: > > - Convert -mem-path to QemuOpts with prealloc, share and unlink > properties > > - Set 1 sec timeout when read/write to the unix domain socket > > - Fix file descriptor leak > > > > Changes from v2: > > - Reconnect when the backend disappears > > > > Changes from v1: > > - Implementation of vhost-user netdev backend > > - Code improvements > > > > > > --- > > > > Nikolay Nikolaev (18): > > Add kvm_eventfds_enabled function > > Add chardev API qemu_chr_fe_read_all > > Add chardev API qemu_chr_fe_set_msgfds > > Add chardev API qemu_chr_fe_get_msgfds > > Add G_IO_HUP handler for socket chardev > > vhost: add vhost_get_features and vhost_ack_features > > vhost_net should call the poll callback only when it is set > > Refactor virtio-net to use generic get_vhost_net > > vhost_net_init will use VhostNetOptions to get all its arguments > > Add vhost_ops to vhost_dev struct and replace all relevant ioctls > > Add vhost-backend and VhostBackendType > > Add vhost-user as a vhost backend. > > vhost-net: vhost-user feature bits support > > Add new vhost-user netdev backend > > Add the vhost-user netdev backend to the command line > > Add vhost-user protocol documentation > > libqemustub: add stubs to be able to use qemu-char.c > > Add qtest for vhost-user > > > > > > docs/specs/vhost-user.txt | 261 ++++++++++++++++++++++++++++ > > hmp-commands.hx | 4 > > hw/net/vhost_net.c | 228 +++++++++++++++++-------- > > hw/net/virtio-net.c | 29 +-- > > hw/scsi/vhost-scsi.c | 45 +++-- > > hw/virtio/Makefile.objs | 2 > > hw/virtio/vhost-backend.c | 71 ++++++++ > > hw/virtio/vhost-user.c | 342 > +++++++++++++++++++++++++++++++++++++ > > hw/virtio/vhost.c | 82 ++++++--- > > include/hw/virtio/vhost-backend.h | 38 ++++ > > include/hw/virtio/vhost.h | 13 + > > include/net/vhost-user.h | 17 ++ > > include/net/vhost_net.h | 11 + > > include/sysemu/char.h | 44 +++++ > > include/sysemu/kvm.h | 11 + > > kvm-all.c | 4 > > kvm-stub.c | 1 > > net/Makefile.objs | 2 > > net/clients.h | 3 > > net/hub.c | 1 > > net/net.c | 25 ++- > > net/tap.c | 18 ++ > > net/vhost-user.c | 265 +++++++++++++++++++++++++++++ > > qapi-schema.json | 19 ++ > > qemu-char.c | 277 +++++++++++++++++++++++++++--- > > qemu-options.hx | 18 ++ > > stubs/Makefile.objs | 8 + > > stubs/bdrv-commit-all.c | 7 + > > stubs/chr-msmouse.c | 7 + > > stubs/get-next-serial.c | 3 > > stubs/is-daemonized.c | 7 + > > stubs/machine-init-done.c | 6 + > > stubs/monitor-init.c | 6 + > > stubs/notify-event.c | 6 + > > stubs/vc-init.c | 7 + > > tests/Makefile | 4 > > tests/vhost-user-test.c | 312 > ++++++++++++++++++++++++++++++++++ > > 37 files changed, 2011 insertions(+), 193 deletions(-) > > create mode 100644 docs/specs/vhost-user.txt > > create mode 100644 hw/virtio/vhost-backend.c > > create mode 100644 hw/virtio/vhost-user.c > > create mode 100644 include/hw/virtio/vhost-backend.h > > create mode 100644 include/net/vhost-user.h > > create mode 100644 net/vhost-user.c > > create mode 100644 stubs/bdrv-commit-all.c > > create mode 100644 stubs/chr-msmouse.c > > create mode 100644 stubs/get-next-serial.c > > create mode 100644 stubs/is-daemonized.c > > create mode 100644 stubs/machine-init-done.c > > create mode 100644 stubs/monitor-init.c > > create mode 100644 stubs/notify-event.c > > create mode 100644 stubs/vc-init.c > > create mode 100644 tests/vhost-user-test.c > > > > -- > > Signature > > -- > You received this message because you are subscribed to the Google Groups > "Snabb Switch development" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to snabb-devel+unsubscr...@googlegroups.com. > To post to this group, send an email to snabb-de...@googlegroups.com. > Visit this group at http://groups.google.com/group/snabb-devel. > regards, Nikolay Nikolaev