Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-24 Thread Joerg Roedel
On Wed, Jul 23, 2014 at 10:49:55AM -0700, Olav Haugan wrote:
 Joerg, can you comment on what you envisioned when you suggested that we
 add the fallback?
 

The problem is that we already have tons of IOMMU drivers in the tree
which don't provide these call-backs. So adding this API extension
without a fall-back that works for these drivers too would fragment the
functionality between different IOMMU drivers in an inacceptable way and
undermine the purpose of a generic API.


Joerg

___
iommu mailing list
iommu@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/iommu


Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-24 Thread Thierry Reding
On Thu, Jul 24, 2014 at 11:34:27AM +0200, Joerg Roedel wrote:
 On Wed, Jul 23, 2014 at 10:49:55AM -0700, Olav Haugan wrote:
  Joerg, can you comment on what you envisioned when you suggested that we
  add the fallback?
  
 
 The problem is that we already have tons of IOMMU drivers in the tree
 which don't provide these call-backs. So adding this API extension
 without a fall-back that works for these drivers too would fragment the
 functionality between different IOMMU drivers in an inacceptable way and
 undermine the purpose of a generic API.

But we only care about in-tree drivers anyway, so we can equally well
just point all drivers to the generic implementation in the same patch
that adds this new function. The end result will be the same, but it
will keep the core function simpler (and more consistent with the other
core functions).

Thierry


pgpGmikQGfy4A.pgp
Description: PGP signature
___
iommu mailing list
iommu@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/iommu

Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-23 Thread Olav Haugan
On 7/22/2014 12:45 AM, Thierry Reding wrote:
 On Mon, Jul 21, 2014 at 05:59:22PM -0700, Olav Haugan wrote:
 On 7/17/2014 1:21 AM, Thierry Reding wrote:
 On Wed, Jul 16, 2014 at 06:01:57PM -0700, Olav Haugan wrote:
 [...]
 Additionally, the mapping operation would be faster in general since
 clients does not have to keep calling map API over and over again for
 each physically contiguous chunk of memory that needs to be mapped to a
 virtually contiguous region.

 Signed-off-by: Olav Haugan ohau...@codeaurora.org
 ---
  drivers/iommu/iommu.c | 48 
 
  include/linux/iommu.h | 25 +
  2 files changed, 73 insertions(+)

 diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
 index 1698360..a0eebb7 100644
 --- a/drivers/iommu/iommu.c
 +++ b/drivers/iommu/iommu.c
 @@ -1089,6 +1089,54 @@ size_t iommu_unmap(struct iommu_domain *domain, 
 unsigned long iova, size_t size)
  EXPORT_SYMBOL_GPL(iommu_unmap);
  
  
 +int iommu_map_range(struct iommu_domain *domain, unsigned int iova,

 Maybe iova should be dma_addr_t? Or at least unsigned long? And perhaps
 iommu_map_sg() would be more consistent with the equivalent function in
 struct dma_ops?

 +  struct scatterlist *sg, unsigned int len, int opt)

 The length argument seems to be the size of the mapping. Again, the
 struct dma_ops function uses this argument to denote the number of
 entries in the scatterlist.

 opt is somewhat opaque. Perhaps this should be turned into unsigned long
 flags? Although given that there aren't any users yet it's difficult to
 say what's best here. Perhaps the addition of this argument should be
 postponed until there are actual users?

 I am thinking something like this:

 int iommu_map_sg(struct iommu_domain *domain, struct scatterlist *sg,
 unsigned int nents, int prot, unsigned long flags);
 int iommu_unmap_sg(struct iommu_domain *domain, struct scatterlist *sg,
 unsigned int nents, unsigned long flags);
 
 Looks good.
 
 The iova is contained within sg so we don't need that argument really
 
 I'm not sure. I think a common use-case for this function is for some
 driver to map an imported DMA-BUF. In that case you'll get a struct
 sg_table, but I think it won't have sg.dma_address set to anything
 useful. So if we don't have iova as a parameter to this function, the
 driver would have to make it a two-step process, like this:
 
   sg_dma_address(sg) = iova;
 
   err = iommu_map_sg(...);
 
 And drivers that use the IOMMU API directly need to manage IOVA space
 themselves anyway, so I think passing around the IOVA within the SG
 won't be a very common case. It will almost always be the driver that
 calls this function which allocates the IOVA range.

Yes, I see your point. Rob is not a fan either...
So what about this:

int iommu_map_sg(struct iommu_domain *domain, unsigned long iova, struct
scatterlist *sg, unsigned int nents, int prot, unsigned long flags);
int iommu_unmap_sg(struct iommu_domain *domain, unsigned long iova,
size_t size, unsigned long flags);

No need for sg in the unmap call then. Keeping iova an unsigned long to
match the existing iommu_map/iommu_unmap calls.


 and I would like to keep the flags argument. I would prefer not to
 change the API after it has been published which could potentially
 affect a lot of call sites.
 
 We have pretty good tools to help with this kind of mechanical
 conversion, so I don't think changing the API will be much of a problem.
 However it seems likely that we'll want to specify flags eventually, so
 I don't have any strong objections to keeping that parameter.
 
 +  phys_addr_t phys = page_to_phys(sg_page(sg));
 +  u32 page_len = PAGE_ALIGN(sg-offset + sg-length);

 Shouldn't this alignment be left to iommu_map() to handle? It has code
 to deal with that already.

 I don't see page alignment in the iommu_map function. I only see a check
 whether the (iova | paddr | size) is aligned to the minimum page size
 and then it errors out if it isn't
 
 Indeed, the above doesn't do what I thought it did.
 
 Perhaps rather than check for a -map_range implementation everytime a
 better option may be to export this generic implementation so that
 drivers can set it in their iommu_ops if they don't implement it? So the
 contents of the if () block could become a new function:

 int iommu_map_range_generic(...)
 {
 ...
 }
 EXPORT_SYMBOL(iommu_map_range_generic);

 And drivers would do this:

 static const struct iommu_ops driver_iommu_ops = {
 ...
 .map_range = iommu_map_range_generic,
 ...
 };

 I'd like to keep the new API consistent with the rest of the API. Most
 if not all of the other APIs always check if the operation is non-NULL.
 
 But that's because the other operations are either optional or they
 don't provide a fallback implementation. For .map_sg() the issue is
 different because 

Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-22 Thread Thierry Reding
On Mon, Jul 21, 2014 at 05:59:22PM -0700, Olav Haugan wrote:
 On 7/17/2014 1:21 AM, Thierry Reding wrote:
  On Wed, Jul 16, 2014 at 06:01:57PM -0700, Olav Haugan wrote:
[...]
   Additionally, the mapping operation would be faster in general since
   clients does not have to keep calling map API over and over again for
   each physically contiguous chunk of memory that needs to be mapped to a
   virtually contiguous region.
  
   Signed-off-by: Olav Haugan ohau...@codeaurora.org
   ---
drivers/iommu/iommu.c | 48 
   
include/linux/iommu.h | 25 +
2 files changed, 73 insertions(+)
  
   diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
   index 1698360..a0eebb7 100644
   --- a/drivers/iommu/iommu.c
   +++ b/drivers/iommu/iommu.c
   @@ -1089,6 +1089,54 @@ size_t iommu_unmap(struct iommu_domain *domain, 
   unsigned long iova, size_t size)
EXPORT_SYMBOL_GPL(iommu_unmap);


   +int iommu_map_range(struct iommu_domain *domain, unsigned int iova,
  
  Maybe iova should be dma_addr_t? Or at least unsigned long? And perhaps
  iommu_map_sg() would be more consistent with the equivalent function in
  struct dma_ops?
  
  +  struct scatterlist *sg, unsigned int len, int opt)
  
  The length argument seems to be the size of the mapping. Again, the
  struct dma_ops function uses this argument to denote the number of
  entries in the scatterlist.
  
  opt is somewhat opaque. Perhaps this should be turned into unsigned long
  flags? Although given that there aren't any users yet it's difficult to
  say what's best here. Perhaps the addition of this argument should be
  postponed until there are actual users?
 
 I am thinking something like this:
 
 int iommu_map_sg(struct iommu_domain *domain, struct scatterlist *sg,
 unsigned int nents, int prot, unsigned long flags);
 int iommu_unmap_sg(struct iommu_domain *domain, struct scatterlist *sg,
 unsigned int nents, unsigned long flags);

Looks good.

 The iova is contained within sg so we don't need that argument really

I'm not sure. I think a common use-case for this function is for some
driver to map an imported DMA-BUF. In that case you'll get a struct
sg_table, but I think it won't have sg.dma_address set to anything
useful. So if we don't have iova as a parameter to this function, the
driver would have to make it a two-step process, like this:

sg_dma_address(sg) = iova;

err = iommu_map_sg(...);

And drivers that use the IOMMU API directly need to manage IOVA space
themselves anyway, so I think passing around the IOVA within the SG
won't be a very common case. It will almost always be the driver that
calls this function which allocates the IOVA range.

 and I would like to keep the flags argument. I would prefer not to
 change the API after it has been published which could potentially
 affect a lot of call sites.

We have pretty good tools to help with this kind of mechanical
conversion, so I don't think changing the API will be much of a problem.
However it seems likely that we'll want to specify flags eventually, so
I don't have any strong objections to keeping that parameter.

  +  phys_addr_t phys = page_to_phys(sg_page(sg));
  +  u32 page_len = PAGE_ALIGN(sg-offset + sg-length);
  
  Shouldn't this alignment be left to iommu_map() to handle? It has code
  to deal with that already.
 
 I don't see page alignment in the iommu_map function. I only see a check
 whether the (iova | paddr | size) is aligned to the minimum page size
 and then it errors out if it isn't

Indeed, the above doesn't do what I thought it did.

  Perhaps rather than check for a -map_range implementation everytime a
  better option may be to export this generic implementation so that
  drivers can set it in their iommu_ops if they don't implement it? So the
  contents of the if () block could become a new function:
  
  int iommu_map_range_generic(...)
  {
  ...
  }
  EXPORT_SYMBOL(iommu_map_range_generic);
  
  And drivers would do this:
  
  static const struct iommu_ops driver_iommu_ops = {
  ...
  .map_range = iommu_map_range_generic,
  ...
  };
 
 I'd like to keep the new API consistent with the rest of the API. Most
 if not all of the other APIs always check if the operation is non-NULL.

But that's because the other operations are either optional or they
don't provide a fallback implementation. For .map_sg() the issue is
different because the core provides a slow fallback. The disadvantage
of keeping the check within iommu_map_sg() is that you have this check
every time the function is called. If on the other hand drivers
specifically set a pointer (either a custom implementation or the
wrapper around iommu_map()) you'll simply call the driver function no
matter what and don't have to special case.

It also makes it possible for drivers to opt-in to 

Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-22 Thread Rob Clark
On Mon, Jul 21, 2014 at 8:59 PM, Olav Haugan ohau...@codeaurora.org wrote:
 On 7/17/2014 1:21 AM, Thierry Reding wrote:
 On Wed, Jul 16, 2014 at 06:01:57PM -0700, Olav Haugan wrote:
 Mapping and unmapping are more often than not in the critical path.
 map_range and unmap_range allows SMMU driver implementations to optimize

 s/SMMU/IOMMU/

 the process of mapping and unmapping buffers into the SMMU page tables.

 s/SMMU/IOMMU/

 Instead of mapping one physical address, do TLB operation (expensive),
 mapping, do TLB operation, mapping, do TLB operation the driver can map
 a scatter-gatherlist of physically contiguous pages into one virtual
 address space and then at the end do one TLB operation.

 I find the above hard to read. Maybe:

 Instead of mapping a buffer one page at a time and requiring potentially
 expensive TLB operations for each page, this function allows the driver
 to map all pages in one go and defer TLB maintenance until after all
 pages have been mapped.

 Yeah, all above is OK with me.


 Additionally, the mapping operation would be faster in general since
 clients does not have to keep calling map API over and over again for
 each physically contiguous chunk of memory that needs to be mapped to a
 virtually contiguous region.

 Signed-off-by: Olav Haugan ohau...@codeaurora.org
 ---
  drivers/iommu/iommu.c | 48 
  include/linux/iommu.h | 25 +
  2 files changed, 73 insertions(+)

 diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
 index 1698360..a0eebb7 100644
 --- a/drivers/iommu/iommu.c
 +++ b/drivers/iommu/iommu.c
 @@ -1089,6 +1089,54 @@ size_t iommu_unmap(struct iommu_domain *domain, 
 unsigned long iova, size_t size)
  EXPORT_SYMBOL_GPL(iommu_unmap);


 +int iommu_map_range(struct iommu_domain *domain, unsigned int iova,

 Maybe iova should be dma_addr_t? Or at least unsigned long? And perhaps
 iommu_map_sg() would be more consistent with the equivalent function in
 struct dma_ops?

 +struct scatterlist *sg, unsigned int len, int opt)

 The length argument seems to be the size of the mapping. Again, the
 struct dma_ops function uses this argument to denote the number of
 entries in the scatterlist.

 opt is somewhat opaque. Perhaps this should be turned into unsigned long
 flags? Although given that there aren't any users yet it's difficult to
 say what's best here. Perhaps the addition of this argument should be
 postponed until there are actual users?

 I am thinking something like this:

 int iommu_map_sg(struct iommu_domain *domain, struct scatterlist *sg,
 unsigned int nents, int prot, unsigned long flags);
 int iommu_unmap_sg(struct iommu_domain *domain, struct scatterlist *sg,
 unsigned int nents, unsigned long flags);

 The iova is contained within sg so we don't need that argument really
 and I would like to keep the flags argument. I would prefer not to
 change the API after it has been published which could potentially
 affect a lot of call sites.

ugg.. that at least forces me to construct a separate sg for mapping
the same buffer in multiple process's gpu addr space.  Not really a
fan of that.

BR,
-R

 +{
 +s32 ret = 0;

 Should be int to match the function's return type.

 +u32 offset = 0;
 +u32 start_iova = iova;

 These should match the type of iova. Also, what's the point of
 start_iova if we can simply keep iova constant and use offset where
 necessary?

 +BUG_ON(iova  (~PAGE_MASK));
 +
 +if (unlikely(domain-ops-map_range == NULL)) {
 +while (offset  len) {

 Maybe this should use for_each_sg()?

 +phys_addr_t phys = page_to_phys(sg_page(sg));
 +u32 page_len = PAGE_ALIGN(sg-offset + sg-length);

 Shouldn't this alignment be left to iommu_map() to handle? It has code
 to deal with that already.

 I don't see page alignment in the iommu_map function. I only see a check
 whether the (iova | paddr | size) is aligned to the minimum page size
 and then it errors out if it isn't


 +ret = iommu_map(domain, iova, phys, page_len, opt);

 This conflates the new opt argument with iommu_map()'s prot argument.
 Maybe those two should rather be split?

 +if (ret)
 +goto fail;
 +
 +iova += page_len;
 +offset += page_len;
 +if (offset  len)
 +sg = sg_next(sg);
 +}
 +} else {
 +ret = domain-ops-map_range(domain, iova, sg, len, opt);
 +}

 Perhaps rather than check for a -map_range implementation everytime a
 better option may be to export this generic implementation so that
 drivers can set it in their iommu_ops if they don't implement it? So the
 contents of the if () block could become a new function:

   int iommu_map_range_generic(...)
   {
   ...
   }
   EXPORT_SYMBOL(iommu_map_range_generic);


Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-21 Thread Olav Haugan
On 7/17/2014 1:21 AM, Thierry Reding wrote:
 On Wed, Jul 16, 2014 at 06:01:57PM -0700, Olav Haugan wrote:
 Mapping and unmapping are more often than not in the critical path.
 map_range and unmap_range allows SMMU driver implementations to optimize
 
 s/SMMU/IOMMU/

 the process of mapping and unmapping buffers into the SMMU page tables.
 
 s/SMMU/IOMMU/
 
 Instead of mapping one physical address, do TLB operation (expensive),
 mapping, do TLB operation, mapping, do TLB operation the driver can map
 a scatter-gatherlist of physically contiguous pages into one virtual
 address space and then at the end do one TLB operation.
 
 I find the above hard to read. Maybe:
 
 Instead of mapping a buffer one page at a time and requiring potentially
 expensive TLB operations for each page, this function allows the driver
 to map all pages in one go and defer TLB maintenance until after all
 pages have been mapped.

Yeah, all above is OK with me.

 
 Additionally, the mapping operation would be faster in general since
 clients does not have to keep calling map API over and over again for
 each physically contiguous chunk of memory that needs to be mapped to a
 virtually contiguous region.

 Signed-off-by: Olav Haugan ohau...@codeaurora.org
 ---
  drivers/iommu/iommu.c | 48 
  include/linux/iommu.h | 25 +
  2 files changed, 73 insertions(+)

 diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
 index 1698360..a0eebb7 100644
 --- a/drivers/iommu/iommu.c
 +++ b/drivers/iommu/iommu.c
 @@ -1089,6 +1089,54 @@ size_t iommu_unmap(struct iommu_domain *domain, 
 unsigned long iova, size_t size)
  EXPORT_SYMBOL_GPL(iommu_unmap);
  
  
 +int iommu_map_range(struct iommu_domain *domain, unsigned int iova,
 
 Maybe iova should be dma_addr_t? Or at least unsigned long? And perhaps
 iommu_map_sg() would be more consistent with the equivalent function in
 struct dma_ops?
 
 +struct scatterlist *sg, unsigned int len, int opt)
 
 The length argument seems to be the size of the mapping. Again, the
 struct dma_ops function uses this argument to denote the number of
 entries in the scatterlist.
 
 opt is somewhat opaque. Perhaps this should be turned into unsigned long
 flags? Although given that there aren't any users yet it's difficult to
 say what's best here. Perhaps the addition of this argument should be
 postponed until there are actual users?

I am thinking something like this:

int iommu_map_sg(struct iommu_domain *domain, struct scatterlist *sg,
unsigned int nents, int prot, unsigned long flags);
int iommu_unmap_sg(struct iommu_domain *domain, struct scatterlist *sg,
unsigned int nents, unsigned long flags);

The iova is contained within sg so we don't need that argument really
and I would like to keep the flags argument. I would prefer not to
change the API after it has been published which could potentially
affect a lot of call sites.

 +{
 +s32 ret = 0;
 
 Should be int to match the function's return type.
 
 +u32 offset = 0;
 +u32 start_iova = iova;
 
 These should match the type of iova. Also, what's the point of
 start_iova if we can simply keep iova constant and use offset where
 necessary?
 
 +BUG_ON(iova  (~PAGE_MASK));
 +
 +if (unlikely(domain-ops-map_range == NULL)) {
 +while (offset  len) {
 
 Maybe this should use for_each_sg()?
 
 +phys_addr_t phys = page_to_phys(sg_page(sg));
 +u32 page_len = PAGE_ALIGN(sg-offset + sg-length);
 
 Shouldn't this alignment be left to iommu_map() to handle? It has code
 to deal with that already.

I don't see page alignment in the iommu_map function. I only see a check
whether the (iova | paddr | size) is aligned to the minimum page size
and then it errors out if it isn't


 +ret = iommu_map(domain, iova, phys, page_len, opt);
 
 This conflates the new opt argument with iommu_map()'s prot argument.
 Maybe those two should rather be split?
 
 +if (ret)
 +goto fail;
 +
 +iova += page_len;
 +offset += page_len;
 +if (offset  len)
 +sg = sg_next(sg);
 +}
 +} else {
 +ret = domain-ops-map_range(domain, iova, sg, len, opt);
 +}
 
 Perhaps rather than check for a -map_range implementation everytime a
 better option may be to export this generic implementation so that
 drivers can set it in their iommu_ops if they don't implement it? So the
 contents of the if () block could become a new function:
 
   int iommu_map_range_generic(...)
   {
   ...
   }
   EXPORT_SYMBOL(iommu_map_range_generic);
 
 And drivers would do this:
 
   static const struct iommu_ops driver_iommu_ops = {
   ...
   .map_range = iommu_map_range_generic,
   ...
   };

I'd like to keep the new API 

Re: [PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-17 Thread Thierry Reding
On Wed, Jul 16, 2014 at 06:01:57PM -0700, Olav Haugan wrote:
 Mapping and unmapping are more often than not in the critical path.
 map_range and unmap_range allows SMMU driver implementations to optimize

s/SMMU/IOMMU/

 the process of mapping and unmapping buffers into the SMMU page tables.

s/SMMU/IOMMU/

 Instead of mapping one physical address, do TLB operation (expensive),
 mapping, do TLB operation, mapping, do TLB operation the driver can map
 a scatter-gatherlist of physically contiguous pages into one virtual
 address space and then at the end do one TLB operation.

I find the above hard to read. Maybe:

Instead of mapping a buffer one page at a time and requiring potentially
expensive TLB operations for each page, this function allows the driver
to map all pages in one go and defer TLB maintenance until after all
pages have been mapped.

?

 Additionally, the mapping operation would be faster in general since
 clients does not have to keep calling map API over and over again for
 each physically contiguous chunk of memory that needs to be mapped to a
 virtually contiguous region.
 
 Signed-off-by: Olav Haugan ohau...@codeaurora.org
 ---
  drivers/iommu/iommu.c | 48 
  include/linux/iommu.h | 25 +
  2 files changed, 73 insertions(+)
 
 diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
 index 1698360..a0eebb7 100644
 --- a/drivers/iommu/iommu.c
 +++ b/drivers/iommu/iommu.c
 @@ -1089,6 +1089,54 @@ size_t iommu_unmap(struct iommu_domain *domain, 
 unsigned long iova, size_t size)
  EXPORT_SYMBOL_GPL(iommu_unmap);
  
  
 +int iommu_map_range(struct iommu_domain *domain, unsigned int iova,

Maybe iova should be dma_addr_t? Or at least unsigned long? And perhaps
iommu_map_sg() would be more consistent with the equivalent function in
struct dma_ops?

 + struct scatterlist *sg, unsigned int len, int opt)

The length argument seems to be the size of the mapping. Again, the
struct dma_ops function uses this argument to denote the number of
entries in the scatterlist.

opt is somewhat opaque. Perhaps this should be turned into unsigned long
flags? Although given that there aren't any users yet it's difficult to
say what's best here. Perhaps the addition of this argument should be
postponed until there are actual users?

 +{
 + s32 ret = 0;

Should be int to match the function's return type.

 + u32 offset = 0;
 + u32 start_iova = iova;

These should match the type of iova. Also, what's the point of
start_iova if we can simply keep iova constant and use offset where
necessary?

 + BUG_ON(iova  (~PAGE_MASK));
 +
 + if (unlikely(domain-ops-map_range == NULL)) {
 + while (offset  len) {

Maybe this should use for_each_sg()?

 + phys_addr_t phys = page_to_phys(sg_page(sg));
 + u32 page_len = PAGE_ALIGN(sg-offset + sg-length);

Shouldn't this alignment be left to iommu_map() to handle? It has code
to deal with that already.

 + ret = iommu_map(domain, iova, phys, page_len, opt);

This conflates the new opt argument with iommu_map()'s prot argument.
Maybe those two should rather be split?

 + if (ret)
 + goto fail;
 +
 + iova += page_len;
 + offset += page_len;
 + if (offset  len)
 + sg = sg_next(sg);
 + }
 + } else {
 + ret = domain-ops-map_range(domain, iova, sg, len, opt);
 + }

Perhaps rather than check for a -map_range implementation everytime a
better option may be to export this generic implementation so that
drivers can set it in their iommu_ops if they don't implement it? So the
contents of the if () block could become a new function:

int iommu_map_range_generic(...)
{
...
}
EXPORT_SYMBOL(iommu_map_range_generic);

And drivers would do this:

static const struct iommu_ops driver_iommu_ops = {
...
.map_range = iommu_map_range_generic,
...
};

 +int iommu_unmap_range(struct iommu_domain *domain, unsigned int iova,
 +   unsigned int len, int opt)

Some comments regarding function name and argument types as for
iommu_map_range().

 +static inline int iommu_map_range(struct iommu_domain *domain,
 +   unsigned int iova, struct scatterlist *sg,
 +   unsigned int len, int opt)
 +{
 + return -ENODEV;

I know other IOMMU API dummies already use this error code, but I find
it to be a little confusing. The dummies are used when the IOMMU API is
disabled via Kconfig, so -ENOSYS (Function not implemented) seems like a
more useful error.

Thierry


pgp6JYq4GfhPI.pgp
Description: PGP signature
___
iommu mailing list

[PATCH v2 1/1] iommu-api: Add map_range/unmap_range functions

2014-07-16 Thread Olav Haugan
Mapping and unmapping are more often than not in the critical path.
map_range and unmap_range allows SMMU driver implementations to optimize
the process of mapping and unmapping buffers into the SMMU page tables.
Instead of mapping one physical address, do TLB operation (expensive),
mapping, do TLB operation, mapping, do TLB operation the driver can map
a scatter-gatherlist of physically contiguous pages into one virtual
address space and then at the end do one TLB operation.

Additionally, the mapping operation would be faster in general since
clients does not have to keep calling map API over and over again for
each physically contiguous chunk of memory that needs to be mapped to a
virtually contiguous region.

Signed-off-by: Olav Haugan ohau...@codeaurora.org
---
 drivers/iommu/iommu.c | 48 
 include/linux/iommu.h | 25 +
 2 files changed, 73 insertions(+)

diff --git a/drivers/iommu/iommu.c b/drivers/iommu/iommu.c
index 1698360..a0eebb7 100644
--- a/drivers/iommu/iommu.c
+++ b/drivers/iommu/iommu.c
@@ -1089,6 +1089,54 @@ size_t iommu_unmap(struct iommu_domain *domain, unsigned 
long iova, size_t size)
 EXPORT_SYMBOL_GPL(iommu_unmap);
 
 
+int iommu_map_range(struct iommu_domain *domain, unsigned int iova,
+   struct scatterlist *sg, unsigned int len, int opt)
+{
+   s32 ret = 0;
+   u32 offset = 0;
+   u32 start_iova = iova;
+
+   BUG_ON(iova  (~PAGE_MASK));
+
+   if (unlikely(domain-ops-map_range == NULL)) {
+   while (offset  len) {
+   phys_addr_t phys = page_to_phys(sg_page(sg));
+   u32 page_len = PAGE_ALIGN(sg-offset + sg-length);
+
+   ret = iommu_map(domain, iova, phys, page_len, opt);
+   if (ret)
+   goto fail;
+
+   iova += page_len;
+   offset += page_len;
+   if (offset  len)
+   sg = sg_next(sg);
+   }
+   } else {
+   ret = domain-ops-map_range(domain, iova, sg, len, opt);
+   }
+   goto out;
+
+fail:
+   /* undo mappings already done in case of error */
+   iommu_unmap(domain, start_iova, offset);
+out:
+   return ret;
+}
+EXPORT_SYMBOL_GPL(iommu_map_range);
+
+int iommu_unmap_range(struct iommu_domain *domain, unsigned int iova,
+ unsigned int len, int opt)
+{
+   BUG_ON(iova  (~PAGE_MASK));
+
+   if (unlikely(domain-ops-unmap_range == NULL))
+   return iommu_unmap(domain, iova, len);
+   else
+   return domain-ops-unmap_range(domain, iova, len, opt);
+}
+EXPORT_SYMBOL_GPL(iommu_unmap_range);
+
 int iommu_domain_window_enable(struct iommu_domain *domain, u32 wnd_nr,
   phys_addr_t paddr, u64 size, int prot)
 {
diff --git a/include/linux/iommu.h b/include/linux/iommu.h
index c7097d7..54c836e 100644
--- a/include/linux/iommu.h
+++ b/include/linux/iommu.h
@@ -22,6 +22,7 @@
 #include linux/errno.h
 #include linux/err.h
 #include linux/types.h
+#include linux/scatterlist.h
 #include trace/events/iommu.h
 
 #define IOMMU_READ (1  0)
@@ -93,6 +94,8 @@ enum iommu_attr {
  * @detach_dev: detach device from an iommu domain
  * @map: map a physically contiguous memory region to an iommu domain
  * @unmap: unmap a physically contiguous memory region from an iommu domain
+ * @map_range: map a scatter-gather list of physically contiguous memory 
chunks to an iommu domain
+ * @unmap_range: unmap a scatter-gather list of physically contiguous memory 
chunks from an iommu domain
  * @iova_to_phys: translate iova to physical address
  * @domain_has_cap: domain capabilities query
  * @add_device: add device to iommu grouping
@@ -110,6 +113,10 @@ struct iommu_ops {
   phys_addr_t paddr, size_t size, int prot);
size_t (*unmap)(struct iommu_domain *domain, unsigned long iova,
 size_t size);
+   int (*map_range)(struct iommu_domain *domain, unsigned int iova,
+   struct scatterlist *sg, unsigned int len, int opt);
+   int (*unmap_range)(struct iommu_domain *domain, unsigned int iova,
+ unsigned int len, int opt);
phys_addr_t (*iova_to_phys)(struct iommu_domain *domain, dma_addr_t 
iova);
int (*domain_has_cap)(struct iommu_domain *domain,
  unsigned long cap);
@@ -153,6 +160,10 @@ extern int iommu_map(struct iommu_domain *domain, unsigned 
long iova,
 phys_addr_t paddr, size_t size, int prot);
 extern size_t iommu_unmap(struct iommu_domain *domain, unsigned long iova,
   size_t size);
+extern int iommu_map_range(struct iommu_domain *domain, unsigned int iova,
+   struct scatterlist *sg, unsigned int len, int opt);
+extern int iommu_unmap_range(struct iommu_domain *domain, unsigned