On Fri, Sep 20, 2013 at 10:15:30AM -0400, Laine Stump wrote:
> On 09/20/2013 09:33 AM, Daniel P. Berrange wrote:
> > On Fri, Sep 20, 2013 at 06:25:10AM -0400, Laine Stump wrote:
> >> This resolves https://bugzilla.redhat.com/show_bug.cgi?id=1008903
> >>
> >> The Q35 machinetype has an implicit SATA controller at 00:1F.2 which
> >> isn't given the "expected" id of ahci0 by qemu when it's created. The
> >> original suggested solution to this problem was to not specify any
> >> controller for the disks that use the default controller and just
> >> specify "unit=n" instead; qemu should then use the first ide or sata
> >> controller for the disk.
> >>
> >> Unfortunately, this "solution" is ignorant of the fact that in the
> >> case of SATA disks, the "unit" attribute in the disk XML is actually
> >> *not* being used for the unit, but is instead used to specify the
> >> "bus" number; each SATA controller has 6 buses, and each bus only
> >> allows a single unit. This makes it nonsensical to specify unit='n'
> >> where n is anything other than 0. It also means that the only way to
> >> connect more than a single device to the implicit SATA controller is
> >> to explicitly give the bus names, which happen to be "ide.$n", where
> >> $n can be replaced by the disk's "unit" number.
> >> ---
> >>  src/qemu/qemu_command.c                                | 15 
> >> ++++++---------
> >>  tests/qemuxml2argvdata/qemuxml2argv-pcihole64-q35.args |  2 +-
> >>  tests/qemuxml2argvdata/qemuxml2argv-q35.args           |  2 +-
> >>  3 files changed, 8 insertions(+), 11 deletions(-)
> >>
> >> diff --git a/src/qemu/qemu_command.c b/src/qemu/qemu_command.c
> >> index 4628dac..e6239c9 100644
> >> --- a/src/qemu/qemu_command.c
> >> +++ b/src/qemu/qemu_command.c
> >> @@ -4383,18 +4383,15 @@ qemuBuildDriveDevStr(virDomainDefPtr def,
> >>          if (qemuDomainMachineIsQ35(def) &&
> >>              disk->info.addr.drive.controller == 0) {
> >>              /* Q35 machines have an implicit ahci (sata) controller at
> >> -             * 00:1F.2 which has no "id" associated with it. For this
> >> -             * reason, we can't refer to it as "ahci0". Instead, we
> >> -             * don't give an id, which qemu interprets as "use the
> >> -             * first ahci controller". We then need to specify the
> >> -             * unit with "unit=%d" rather than adding it onto the bus
> >> -             * arg.
> >> +             * 00:1F.2 which for some reason is hardcoded with the id
> >> +             * "ide" instead of the seemingly more reasonable "ahci0"
> >> +             * or "sata0".
> >>               */
> >> -            virBufferAsprintf(&opt, ",unit=%d", 
> >> disk->info.addr.drive.unit);
> >> +            virBufferAsprintf(&opt, ",bus=ide.%d", 
> >> disk->info.addr.drive.unit);
> > This isn't right - you're now using the 'unit' number to refer to a bus.
> >
> > If the built-in controller only supports 1 unit and 6 buses, then the
> > XML should be using one of
> >
> >    <address type=disk controller=0 bus=0 unit=0/>
> >    <address type=disk controller=0 bus=1 unit=0/>
> >    <address type=disk controller=0 bus=2 unit=0/>
> >    <address type=disk controller=0 bus=3 unit=0/>
> >    <address type=disk controller=0 bus=4 unit=0/>
> >    <address type=disk controller=0 bus=5 unit=0/>
> 
> Right. We were just discussing this on IRC.
> 
> >From what I understand right now (thanks to what kraxel pointed out in
> the comments of Bug 1008903, and confirmed with some simple
> experiments), it's not just the builtin controller that only supports 1
> unit and 6 buses - the way we use bus and unit for SATA controllers has
> been messed up since the SATA controller support was first added to
> libvirt in 2011. So this patch is perpetuating an existing mistake.
> 
> Here is what I understand as of right now; someone with better knowledge
> please correct me if I'm wrong (in particular, I want to make sure that
> 1 and 2 are true for *every* SATA controller, not just the one
> integrated in the Q35 chipset):
> 
> 1) Every SATA controller has 6 buses, and each bus allows a single
> target (or "unit").
> 
> 2) The way to specify a particular bus for any qemu SATA controller is
> "$idname.$busnumber". So for example, if we've given a controller the
> name "ahci3" when we created it (that's what we would name a controller
> with index='3'), then the bus names recognized by qemu would be:
> 
>     ahci3.0
>     ahci3.1
>     ...
>     ahci3.5
> 
> 3) When libvirt assigns a SATA address to a disk it always assigns
> bus='0', and unit='(0..5)'
> 
> 4) When libvirt generates the commandline, it always gives qemu the
> "bus" arg as "ahci$controller.$unit", ignoring the bus attribute in the
> config, and not specifying a "unit=n" in the qemu commandline (which is
> good, because if unit is non-0, you get an error).
> 
> So every address that libvirt has assigned to a SATA drive between 2011
> and now has been wrong (except for the ones with both bus=0 and unit=0).
> 
> Fortunately, we can fix this in new configs and still retain forward
> compatibility of pre-existing configs. What we need to do is:
> 
> 1) change the address auto-assign code to now assign bus = 0.5 and
> always force unit to 0
> 
> 2) modify the commandline generator code to use
> "ahci$controller.$bus-or-unit-whichever-is-non-0"
> 
> 3) if both are non-zero, when defining a domain or generating the
> commandline, that is an error.
> 
> This will leave one problem  - what to do if someone migrates a guest
> with the new non-0 unit# to a host that's still running old libvirt that
> ignores unit#. I don't see how to get around this problem without having
> knowledge of the source and destination libvirt versions when formatting
> the XML to send across, but that is something we've avoided like the
> plague. So what do we do for new->old migration?
> 
> (or are we just stuck with the current state of unit being interpreted
> as bus for sata addresses? :-/)

Actually based on Gerd's followup, I don't think we need todo anything
now. Your first patch in this thread fixed the only problem we had.
Since SATA has no concept of a bus, we only need use the controller and
unit attributes of the address, and reject any bus != 0.

What was causing me confusion is that QEMU's terminology for bus isn't
quite the same as ours, nor the same as real hardwares'.

Daniel
-- 
|: http://berrange.com      -o-    http://www.flickr.com/photos/dberrange/ :|
|: http://libvirt.org              -o-             http://virt-manager.org :|
|: http://autobuild.org       -o-         http://search.cpan.org/~danberr/ :|
|: http://entangle-photo.org       -o-       http://live.gnome.org/gtk-vnc :|

--
libvir-list mailing list
libvir-list@redhat.com
https://www.redhat.com/mailman/listinfo/libvir-list

Reply via email to