Presently, the default domain of a group is allocated during boot time
(i.e. when a device is being added to a group) and it cannot be changed
later. So, the device would typically be either in identity/pass_through
mode (controlled by "iommu=pt" kernel command line argument) or the device
would be in DMA mode (as long as the machine is up and running). There is
no way to change the default domain type dynamically i.e. after booting, a
device cannot switch between identity mode and DMA mode.

But, assume a use case where in there is an SR-IOV device and if the
privileged user decides to use some VF's natively (i.e. they are available
only to host) and he wants them to be high performing and also believes
that his OS is secure enough. In this scenario, some VF's should bypass
IOMMU. Presently, this is not supported and hence it will be helpful if
there is some way to change the default domain of a B:D.F dynamically.
Since, linux iommu subsystem prefers to deal at group level instead of
B:D.F level, it might be helpful if there is some way to change the default
domain of a *group* dynamically. Hence, add such support.

A privileged user could request the kernel to change the default domain
type of a group by writing to "/sys/kernel/iommu_groups/<grp_id>/type"
file. Presently, only two values are supported "identity" and "dma".
"identity" means that all the DMA transactions from the devices in this
group are *not* translated by the iommu and where as "dma" means that all
the DMA transactions from the devices in this group are translated by the
iommu. Also please note that a group type could be modified only when *all*
the devices in the group are not binded to any device driver.

Please see "Documentation/ABI/testing/sysfs-kernel-iommu_groups" for more
information.

Cc: Christoph Hellwig <h...@lst.de>
Cc: Joerg Roedel <j...@8bytes.org>
Cc: Ashok Raj <ashok....@intel.com>
Cc: Will Deacon <will.dea...@arm.com>
Cc: Lu Baolu <baolu...@linux.intel.com>
Cc: Sohil Mehta <sohil.me...@intel.com>
Cc: Robin Murphy <robin.mur...@arm.com>
Cc: Jacob Pan <jacob.jun....@linux.intel.com>
Signed-off-by: Sai Praneeth Prakhya <sai.praneeth.prak...@intel.com>
---
 drivers/iommu/iommu.c | 178 +++++++++++++++++++++++++++++++++++++++++-
 1 file changed, 177 insertions(+), 1 deletion(-)

diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
index 0c674d80c37f..92fadbea36b1 100644
--- a/drivers/iommu/iommu.c
+++ b/drivers/iommu/iommu.c
@@ -155,6 +155,8 @@ static int __iommu_attach_group(struct iommu_domain *domain,
                                struct iommu_group *group);
 static void __iommu_detach_group(struct iommu_domain *domain,
                                 struct iommu_group *group);
+static ssize_t iommu_group_store_type(struct iommu_group *group,
+                                     const char *buf, size_t count);
 
 static int __init iommu_set_def_domain_type(char *str)
 {
@@ -376,7 +378,8 @@ static IOMMU_GROUP_ATTR(name, S_IRUGO, 
iommu_group_show_name, NULL);
 static IOMMU_GROUP_ATTR(reserved_regions, 0444,
                        iommu_group_show_resv_regions, NULL);
 
-static IOMMU_GROUP_ATTR(type, 0444, iommu_group_show_type, NULL);
+static IOMMU_GROUP_ATTR(type, 0644, iommu_group_show_type,
+                       iommu_group_store_type);
 
 static void iommu_group_release(struct kobject *kobj)
 {
@@ -2468,3 +2471,176 @@ int iommu_sva_get_pasid(struct iommu_sva *handle)
        return ops->sva_get_pasid(handle);
 }
 EXPORT_SYMBOL_GPL(iommu_sva_get_pasid);
+
+/*
+ * Changes the default domain of a group. This function is heavily inspired 
from
+ * request_default_domain_for_dev() and couldn't re-use the same because:
+ * 1. The domain should be changed even if there are devices under this group
+ *    because the driver is already unbinded and it's safe to do so. Also, note
+ *    that *only* default_domain is being changed and hence the devices list in
+ *    the group need not be changed.
+ * 2. Unlike request_default_domain_for_dev(), a domain is allocated only once
+ *    for the whole group, where as the former allocates a domain per device.
+ *
+ * @group: The group for which the default domain should be changed
+ * @prev_domain: The previous domain that is being switched from
+ * @type: The type of the new default domain that gets associated with the 
group
+ *
+ * Returns 0 on success and error code on failure
+ *
+ * Note:
+ * 1. Presently, this function is called only when user requests to change the
+ *    group's default domain type through 
/sys/kernel/iommu_groups/<grp_id>/type
+ *    Be aware to take a closer look if intended to use for other cases.
+ * 2. Assumes group->mutex is already taken
+ */
+static int iommu_group_change_def_domain(struct iommu_group *group,
+                                        struct iommu_domain *prev_domain,
+                                        int type)
+{
+       struct group_device *grp_dev;
+       struct iommu_domain *new_domain;
+       int ret = 0;
+
+       /*
+        * iommu_domain_alloc() takes "struct bus_type" as an argument which is
+        * a member in "struct device". Changing a group's default domain type
+        * deals at iommu_group level rather than device level and hence there
+        * is no straight forward way to get "bus_type" of an iommu_group that
+        * could be passed to iommu_domain_alloc(). So, instead of directly
+        * calling iommu_domain_alloc(), use iommu_ops from previous default
+        * domain.
+        */
+       if (!prev_domain || !prev_domain->ops ||
+           !prev_domain->ops->domain_alloc)
+               return -EINVAL;
+
+       /* Allocate a new domain of requested type */
+       new_domain = prev_domain->ops->domain_alloc(type);
+       if (!new_domain) {
+               pr_err("Unable to allocate memory for the new domain\n");
+               return -ENOMEM;
+       }
+
+       new_domain->type = type;
+       new_domain->ops = prev_domain->ops;
+       new_domain->pgsize_bitmap = prev_domain->pgsize_bitmap;
+
+       /* Attach all the devices in the group to the newly created domain */
+       ret = __iommu_attach_group(new_domain, group);
+       if (ret) {
+               pr_err("Unable to attach all the devices in the group to the 
new domain\n");
+               goto free_new_domain;
+       }
+
+       /*
+        * Map reserved regions if the group's default domain is being changed
+        * from identity domain to dma domain
+        */
+       if (type == IOMMU_DOMAIN_IDENTITY)
+               goto free_prev_domain;
+
+       list_for_each_entry(grp_dev, &group->devices, list) {
+               ret = iommu_group_create_direct_mappings(group, grp_dev->dev);
+               if (ret) {
+                       dev_err(grp_dev->dev, "Failed to create direct mappings 
for reserved regions\n");
+                       goto free_new_domain;
+               }
+       }
+
+free_prev_domain:
+       /*
+        * Free the existing default domain and replace it with the newly
+        * created default domain. No need to set group->domain because
+        * __iommu_attach_group() already does it on success.
+        */
+       iommu_domain_free(prev_domain);
+       group->default_domain = new_domain;
+       return 0;
+
+free_new_domain:
+       iommu_domain_free(new_domain);
+       return ret;
+}
+
+static int is_driver_binded(struct device *dev, void *not_used)
+{
+       int ret = 0;
+
+       device_lock(dev);
+       if (device_is_bound(dev))
+               ret = 1;
+       device_unlock(dev);
+       return ret;
+}
+
+static ssize_t iommu_group_store_type(struct iommu_group *group,
+                                     const char *buf, size_t count)
+{
+       int ret, req_type, allowed_types;
+       struct iommu_domain *prev_domain;
+       struct group_device *grp_dev;
+
+       if (!capable(CAP_SYS_ADMIN) || !capable(CAP_SYS_RAWIO))
+               return -EACCES;
+
+       if (WARN_ON(!group))
+               return -EINVAL;
+
+       if (sysfs_streq(buf, "identity"))
+               req_type = IOMMU_DOMAIN_IDENTITY;
+       else if (sysfs_streq(buf, "dma"))
+               req_type = IOMMU_DOMAIN_DMA;
+       else
+               return -EINVAL;
+
+       /* Check if any device in the group still has a driver binded to it */
+       if (iommu_group_for_each_dev(group, NULL, is_driver_binded)) {
+               pr_err("Active drivers exist for devices in the group\n");
+               return -EBUSY;
+       }
+
+       mutex_lock(&group->mutex);
+       prev_domain = group->default_domain;
+       if (!prev_domain) {
+               ret = -EINVAL;
+               goto out;
+       }
+
+       /*
+        * Switch to a new domain only if the requested domain type is different
+        * from the existing default domain type
+        */
+       if (prev_domain->type == req_type) {
+               ret = count;
+               goto out;
+       }
+
+       /*
+        * Every device may not support both the domain types (namely DMA and
+        * identity), so check if it's ok to change domain type of every device
+        * in the group to the requested domain
+        */
+       if (prev_domain->ops && prev_domain->ops->device_def_domain_type) {
+               list_for_each_entry(grp_dev, &group->devices, list) {
+                       const struct iommu_ops *ops;
+                       struct device *dev;
+
+                       dev = grp_dev->dev;
+                       ops = prev_domain->ops;
+                       allowed_types = ops->device_def_domain_type(dev, false);
+                       if (allowed_types && allowed_types != req_type) {
+                               dev_err(dev, "Cannot be in %s domain\n", buf);
+                               ret = -EINVAL;
+                               goto out;
+                       }
+               }
+       }
+
+       ret = iommu_group_change_def_domain(group, prev_domain, req_type);
+       if (!ret)
+               ret = count;
+out:
+       mutex_unlock(&group->mutex);
+       return ret;
+}
-- 
2.19.1

_______________________________________________
iommu mailing list
iommu@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/iommu

Reply via email to