On Thu, Oct 17, 2019 at 03:01:46PM +0200, Kevin Wolf wrote: > This series adds a new tool 'qemu-storage-daemon', which can be used to > export and perform operations on block devices. There is some overlap > between qemu-img/qemu-nbd and the new qemu-storage-daemon, but there are > a few important differences: > > * The qemu-storage-daemon has QMP support. The command set is obviously > restricted compared to the system emulator because there is no guest, > but all of the block operations are present. > > This means that it can access advanced options or operations that the > qemu-img command line doesn't expose. For example, blockdev-create is > a lot more powerful than 'qemu-img create', and qemu-storage-daemon > allows to execute it without starting a guest. > > Compared to qemu-nbd it means that, for example, block jobs can now be > executed on the server side, and backing chains shared by multiple VMs > can be modified this way. > > * The existing tools all have a separately invented one-off syntax for > the job at hand, which usually comes with restrictions compared to the > system emulator. qemu-storage-daemon shares the same syntax with the > system emulator for most options and prefers QAPI based interfaces > where possible (such as --blockdev), so it should be easy to make use > of in libvirt. > > * While this series implements only NBD exports, the storage daemon is > intended to serve multiple protocols and its syntax reflects this. In > the past, we had proposals to add new one-off tools for exporting over > new protocols like FUSE or TCMU. > > With a generic storage daemon, additional export methods have a home > without adding a new tool for each of them. > > I'm posting this as an RFC mainly for two reasons: > > 1. The monitor integration, which could be argued to be a little hackish > (some generated QAPI source files are built from a separate QAPI > schema, but the per-module ones are taken from the system emulator) > and Markus will want to have a closer look there. But from the IRC > discussions we had, we seem to agree on the general approach here. > > 2. I'm not completely sure if the command line syntax is the final > version that we want to support long-term. Many options directly use > QAPI visitors (--blockdev, --export, --nbd-server) and should be > fine. However, others use QemuOpts (--chardev, --monitor, --object). > > This is the same as in the system emulator, so we wouldn't be adding > a new problem, but as there was talk about QAPIfying the command > line, and I wouldn't want later syntax changes or adding lots of > compatibility code to a new tool, I thought we should probably > discuss whether QAPIfying from the start would be an option.
I think that following what the QEMU emulators currently do for CLI args should be an explicit anti-goal, because we know that it is a long standing source of pain. Fixing it in the emulator binaries is hard due to backward compatibility, but for this new binary we have a clean slate. This feels like a good opportunity to implement & demonstrate what we think QEMU configuration ought to look like. Work done for this in the qemu-storage-daemon may well help us understand how we'll be able to move the QEMU emulators into a new scheme later. My personal wish would be to have no use of QemuOpts at all. Use GOptionContext *only* for parsing command line arguments related to execution of the daemon - ie things like --help, --version, --daemon, --pid-file. The use a "--config /path/to/json/file" arg to point to the config file for everything else using QAPI schema to describe it fully. When loading the config file, things should be created in order in which they are specified. ie don't try and group things, otherwise we end up back with the horrific hacks for objects where some are created early & some late. For an ambitious stretch goal, I think we should seriously consider whether our use of chardevs is appropriate in all cases that exist, and whether we can avoid the trap of over-using chardev in the new storage daemon since it is a clean slate in terms of user facing CLI config. chardevs are designed for & reasonably well suited to attaching to devices like serial ports, parallel ports, etc. You have a 1:1 remote:local peer relationship. The transport is a dumb byte stream, nothing special needed on top & the user can cope with any type of chardev. Many cases where we've used chardevs as a backend in QEMU are a poor fit. We just used chardevs as an "easy" way to configure a UNIX or TCP socket from the CLI, and don't care about, nor work with, any othuer chardev backends. As a result of this misuse we've had to put in an increasing number of hacks in the chardev code to deal with fact that callers want to know about & use socket semantics. eg FD passing, the chardev reconnection polling code. The monitor is a prime example of a bad fit - it would be better suited by simply referencing a SocketAddress QAPI type, instead of having the chardev indirection. It would then directly use the QIOChannelSocket APIs and avoid the inconvenient chardev abstractions which are a source of complexity & instability for no net gain. vhostuser is another prime example, responsible for much of the complexity & bugs recently added to chardevs to expose socket semantics This is a long winded way of saying that we should consider what syntax we expose for the monitor socket configuration with the new daemon. Even if the internal code still uses a chardev for the forseeable future, we have the option to hide this from the user facing configuration. Let the user specify a SocketAddress, which we use to secretly instantiate a chardev. Eventually we can convert the monitor code to stop using a chardev internally too, with a suitable deprecation period for main QEMU binarijes. Regards, Daniel -- |: https://berrange.com -o- https://www.flickr.com/photos/dberrange :| |: https://libvirt.org -o- https://fstop138.berrange.com :| |: https://entangle-photo.org -o- https://www.instagram.com/dberrange :|