Re: Problems with irq mapping in qemu v5.2

BALATON Zoltan via Mon, 28 Dec 2020 13:18:46 -0800

On Mon, 28 Dec 2020, Mark Cave-Ayland wrote:

On 24/12/2020 08:11, BALATON Zoltan via wrote:
On Wed, 23 Dec 2020, Guenter Roeck wrote:
On Thu, Dec 24, 2020 at 02:34:07AM +0100, BALATON Zoltan wrote:
[ ... ]
If we need legacy mode then we may be able to emulate that by settingBARs
to legacy ports ignoring what values are written to them if legacy mode
config is set (which may be what the real chip does) and we already have
IRQs hard wired to legacy values so that would give us legacy and
half-native mode which is enough for both fuloong2e and pegasos2 but I'mnotsure how can we fix BARs in QEMU because that's also handled by genericPCI
code which I also don't want to break.
The code below works for booting Linux while at the same time notaffecting
any other emulation. I don't claim it to be a perfect fix, and overloading
the existing property is a bit hackish, but it does work.
Yes, maybe combining it with my original patch 1 to change secondary toflags to make it a bit cleaner would work for me. Then we would either onlyemulate legacy or half-native mode which is sufficient for these twomachines we have. If Mark or others do not object it this time, I canupdate my patch and resubmit with this one to fix this issue, otherwiselet's wait what idea do they have because I hate to spend time withsomething only to be discarded again. I think we don't need more completeemulation of this chip than this for now but if somebody wants to attemptthat I don't mind as long as it does not break pegasos2.
I had a play with your patches this afternoon, and spent some time performingsome experiments and also reading various PCI bus master specifications anddatasheets: this helped me understand a lot more about the theory of IRQrouting and compatible vs. legacy mode.
From reading all the documentation (including the VIA and other datasheets) Icannot find any reference to a half-native mode which makes me think

The half-native mode is my simpler term for Linux's "non 100% nativemode". This may not exist in hardware but exists as a concept in someLinux (and maybe other) drivers so emulating it just means we do whatthese drivers expect to work correctly.

How this maps to hardware and what interactions are there with firmwaremay be interesting but I'm not interested to find out as long as allguests we care about work because adding more complexity just for the sakeof correctly modeling hardware seems like a waste of time in this case.Thanks for taking the time to find and document these though, it may beuseful if someone wants to clean this up further. I'm satisfied withgetting it in good enough shape for fuloong2e and pegasos2 to boot theguests we want, because I'd rather spend time on other, more interestingstuff such as writing replacement firmware for pegasos2 to avoid needingan undistributable ROM, implementing missing sound support, improvingati-vga or getting the Mac ROM work with g3beige, and also FPU emulationon PPC (and these are just the QEMU related stuff, I can think of otherstoo). All of those seem time better spent than beating this via-ide modelfurther now just for the sake of perfection without any gain, becauseguests will not work better even after spending more time with this.That's why I call it a waste of time. I know you prefer perfect patchesbut as they say "Perfect is the enemy of good." (I could think of betteruse of your time too such as finishing your screamer patches or improvingOpenBIOS or your original sparc interest but that's for you to decide whatyou do.)

I also try to improve these models and add missing stuff as needed but mygoal is not perfection because I don't have that much time, just reachinggood enough. It can always be improved later (or corrected if it turns outto be needed as in this case) but if we always hold back until getting itperfect we wont get anywhere. If your level of perfection was arequirement in QEMU a lot of devices would not be there as they could notget in in the first place which means other people cannot improve it asthere's nothing there to start with. So I think something that is goodenough is at least a good start towards perfection.

We can argue what level is good enough. I think if it makes guests workwhich seems to be the general approach in QEMU as a lot of devices don'tactually model real hardware correctly but just so that guests run withit. Of course we should make it clean and follow hardware where possiblebut a lot of models don't do that (maybe actually very few are anywherenear perfect).

something else is wrong here. At the simplest level it could simply be thatthe VIA doesn't tri-state its legacy IRQ lines whilst the device is in nativemode (the SI controller has an option for this), or it could indicate thereis a PCI IRQ routing problem somewhere else that hasn't been picked up yet.
All of the datasheets suggest that legacy vs. native mode is selected bysetting the correct bits in PCI_CLASS_PROG, and Linux reads this byte andconfigures itself to use legacy or native mode accordingly. Since the currentdefault for the VIA is 0x8a then it should default to legacy mode, but we'reimmediately hitting some issues here: I've summarised my notes below forthose interested.
1) PCI bus reset loses the default BAR addresses
The first problem we find is that the initialisation of the PCI bus erasesthe default BAR addresses: that's to say lines 133-137 in hw/ide/via.c willin effect do nothing:
133     pci_set_long(pci_conf + PCI_BASE_ADDRESS_0, 0x000001f0);
134     pci_set_long(pci_conf + PCI_BASE_ADDRESS_1, 0x000003f4);
135     pci_set_long(pci_conf + PCI_BASE_ADDRESS_2, 0x00000170);
136     pci_set_long(pci_conf + PCI_BASE_ADDRESS_3, 0x00000374);
137 pci_set_long(pci_conf + PCI_BASE_ADDRESS_4, 0x0000cc01); /* BMIBA:20-23h */
The lifecycle of the VIA IDE device goes like this: init() -> realize() ->reset() but then the PCI bus reset in pci_do_device_reset() immediately wipesthe BAR addresses. This is why the legacy IDE ports currently don't appear atstartup. Note I do see that other devices do try this e.g.gt64120_pci_realize() so it's an easy mistake to make.

This is from the original commit 10 years ago so I think QEMU may haveworked differently back then and possibly this worked and just left therebecause nobody noticed until now. I did notice PCI config values are resetwhen starting to work on this and on your suggestion fixed the problem forthat one register in PCI reset code that I've worked around first in thismodel.

2) -kernel doesn't initialise the VIA device
If you take a look at the PMON source it is possible to see that the firmwareexplicitly sets the PCI_CLASS_PROG to compatibility mode and disables thenative PCI interrupt(https://github.com/loongson-community/pmon-2ef/blob/master/sys/dev/pci/vt82c686.c#L82).
Since Linux reads this byte on startup then this is why the kernel switchesto compatibility mode by default. However the point here is that booting akernel directly without firmware means the VIA IDE device isn't initialisedas it would be in real life, and that's why there are attempts topre-configure the device accordingly in via_ide_realize()/via_ide_reset().

Isn't this worked around by setting the mode to legacy at start up? Maybeyou could emulate firmware in load_kernel() but I leave that exercise tosomebody who is interested in running Linux on fuloong2e.

3) QEMU doesn't (easily) enable a BAR to be disabled
The ideal situation would be for QEMU's VIA IDE device to checkPCI_CLASS_PROG and configure itself dynamically: with PCI_CLASS_PROG set forlegacy mode by default, the device can disable its BARs until they areexplicitly enabled.
According to the PCI bus master specification the recommended behaviour for adevice in compatible mode is to ignore all writes to the BARs, and for allBAR reads to return 0. This fits nicely with Guenter's finding that the BMDMABAR should not return a value in order for Linux to boot correctly in legacymode.
Unfortunately there is no existing functionality for this in QEMU which meansyou would have to do this manually by overriding the PCI config read/writefunctions. This is trickier than it sounds because the reads/writes don'tnecessarily have to be aligned to the BAR addresses in configuration space.

I did go through this too when I've prepared my original patches and gotto the same conclusion.

In summary whilst I'm not keen on the series in its current form, itseems the best solution for now. I've got a few comments on the latestversion of the series which I will send along shortly.

Glad you've got to this at last. Would have probably saved some time ifyou accepted it back in March but that's gone now.


Regards,
BALATON Zoltan

Re: Problems with irq mapping in qemu v5.2

Reply via email to