On 2012-09-17 10:07, Gilles Chanteperdrix wrote: > On 09/17/2012 09:43 AM, Jan Kiszka wrote: >> On 2012-09-17 08:30, Gilles Chanteperdrix wrote: >>> >>> Hi, >>> >>> looking at x86 latencies, I found that what was taking long on my atom >>> was masking the fasteoi interrupts at IO-APIC level. So, I experimented >>> an idea: masking at LAPIC level instead of IO-APIC, by using the "task >>> priority" register. This seems to improve latencies on my atom: >>> >>> http://sisyphus.hd.free.fr/~gilles/core-3.4-latencies/atom.png >>> >>> This implies splitting the LAPIC vectors in a high priority and low >>> priority sets, the final implementation would use ipipe_enable_irqdesc >>> to detect a high priority domain, and change the vector at that time. >>> >>> This also improves the latencies on my old PIII with a VIA chipset, but >>> it generates spurious interrupts (I do not know if it really is a >>> matter, as handling a spurious interrupt is still faster than masking an >>> IO-APIC interrupt), the spurious interrupts in that case are a >>> documented behaviour of the LAPIC. >>> >>> Is there any interest in pursuing this idea, or are x86 with slow >>> IO-APIC the exception more than the rule, or having to split the vector >>> space appears too great a restriction? >> >> Line-based interrupts are legacy, of decreasing relevance for PCI >> devices - likely what we are primarily interesting in here - due to MSI. > > Even if I enable MSI, the kernel still uses these irqs for the > peripherals integrated to the chipset, such as the USB HCI, or ATA > driver (IOW, non PCI devices).
Those are all PCI as well. And modern chipsets include variants of them with MSI(-X) support. > > atom login: root > > # cat /proc/interrupts > > CPU0 CPU1 > > 0: 41 0 IO-APIC-edge timer > > 4: 39 0 IO-APIC-edge serial > > 9: 0 0 IO-APIC-fasteoi acpi > > 14: 0 0 IO-APIC-edge ata_piix > > 15: 0 0 IO-APIC-edge ata_piix > > 16: 0 0 IO-APIC-fasteoi uhci_hcd:usb5 > > 18: 0 0 IO-APIC-fasteoi uhci_hcd:usb4 > > 19: 0 0 IO-APIC-fasteoi ata_piix, uhci_hcd:usb3 > > 23: 6598 0 IO-APIC-fasteoi ehci_hcd:usb1, uhci_hcd:usb2 > > 43: 2704 0 PCI-MSI-edge eth0 > > 44: 249 0 PCI-MSI-edge snd_hda_intel > > NMI: 0 0 Non-maskable interrupts > > LOC: 661 644 Local timer interrupts > > SPU: 0 0 Spurious interrupts > > PMI: 0 0 Performance monitoring interrupts > > IWI: 0 0 IRQ work interrupts > > RTR: 0 0 APIC ICR read retries > > RES: 1582 2225 Rescheduling interrupts > > CAL: 26 48 Function call interrupts > > TLB: 10 19 TLB shootdowns > > ERR: 0 > > MIS: 0 > > > I do not think peripherals integrated to chipsets can really be > considered "legacy". And they tend to be used in the field... The good news is that, even on your low-end atom, you can avoid those latencies by CPU assignment, i.e. isolating the Linux IRQ load on one core and the RT on the other. That's getting easier and easier due to the inflation of cores. > >> So I tend to say "don't worry", specifically as fiddling with vector >> allocations will require yet another round of invasive changes to the >> IRQ subsystem of Linux. > > The changes would be minimally invasive, we would reuse the functions > already existing (clear_irq_vector and assign_irq_vector). > You will have to rearrange vector assignment and mask those vectors on all CPUs, possibly complicated my affinity changes. That's worrying me as well. But I'm also open for discussing a prototype. Jan -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 259 bytes Desc: OpenPGP digital signature URL: <http://www.xenomai.org/pipermail/xenomai/attachments/20120917/8d32f70e/attachment.pgp> _______________________________________________ Xenomai mailing list Xenomai@xenomai.org http://www.xenomai.org/mailman/listinfo/xenomai