On Mon, 2014-04-28 at 20:19 +0100, Will Deacon wrote: > Hi Alex, > > On Mon, Apr 28, 2014 at 05:43:41PM +0100, Alex Williamson wrote: > > On Mon, 2014-04-28 at 17:52 +0200, Antonios Motakis wrote: > > > This allows to make use of the VFIO_IOMMU_TYPE1 driver with platform > > > devices on ARM in addition to PCI. This is required in order to use the > > > Exynos SMMU, or ARM SMMU driver with VFIO_IOMMU_TYPE1. > > [...] > > > > @@ -721,13 +722,15 @@ static int vfio_iommu_type1_attach_group(void > > > *iommu_data, > > > INIT_LIST_HEAD(&domain->group_list); > > > list_add(&group->next, &domain->group_list); > > > > > > - if (!allow_unsafe_interrupts && > > > +#ifdef CONFIG_PCI > > > + if (bus == &pci_bus_type && !allow_unsafe_interrupts && > > > !iommu_domain_has_cap(domain->domain, IOMMU_CAP_INTR_REMAP)) { > > > pr_warn("%s: No interrupt remapping support. Use the module > > > param \"allow_unsafe_interrupts\" to enable VFIO IOMMU support on this > > > platform\n", > > > __func__); > > > ret = -EPERM; > > > goto out_detach; > > > } > > > +#endif > > > > > > if (iommu_domain_has_cap(domain->domain, IOMMU_CAP_CACHE_COHERENCY)) > > > domain->prot |= IOMMU_CACHE; > > > > This is not a PCI specific requirement. Anything that can support MSI > > needs an IOMMU that can provide isolation for both DMA and interrupts. > > I think the IOMMU should still be telling us that it has this feature. > > Please excuse any ignorance on part here (I'm not at all familiar with the > Intel IOMMU), but shouldn't this really be a property of the interrupt > controller itself? On ARM with GICv3, there is a separate block called the > ITS (interrupt translation service) which is part of the interrupt > controller. The ITS provides a doorbell page which the SMMU can map into a > guest operating system to provide MSI for passthrough devices, but this > isn't something the SMMU is aware of -- it will just see the iommu_map > request for a non-cacheable mapping.
Hi Will, I don't know the history of why this is an IOMMU domain capability on x86, it's sort of a paradox. An MSI from a device is conceptually just a DMA write and is therefore logically co-located in the IOMMU hardware, but x86 doesn't allow it to be mapped via the IOMMU API interfaces. For compatibility, interrupt remapping support is buried deep in the request_irq interface and effectively invisible other than having this path to query it. Therefore this flag is effectively just saying "MSI isolation support is present and enabled". IOW, the host is protected from interrupt injection attacks from malicious devices. If there is some property of your platform that makes this always the case, then the IOMMU driver can always export this capability as true. With PCI, MSI is configured via spec defined configuration space registers, so we emulate these registers and prevent user access to them so that we don't need to allow the user a way to setup an interrupt remapping entry. It's done for them via request_irq. IIRC, the Freescale devices have a limited number of MSI pages and can therefore create some instances with isolation while others may require sharing. In that case I would expect this flag to indicate whether the domain has an exclusive or shared page. In any case, I suspect keying on the bus_type here is not the correct way to go. Thanks, Alex -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/