Re: [PATCH v2] iommu/dma: Map scatterlists more parsimoniously

Robin Murphy Thu, 26 Nov 2015 09:36:44 -0800

On 26/11/15 15:37, Joerg Roedel wrote:

On Fri, Nov 20, 2015 at 10:57:40AM +0000, Robin Murphy wrote:

diff --git a/drivers/iommu/dma-iommu.c b/drivers/iommu/dma-iommu.c
index 3a20db4..427fdc1 100644
--- a/drivers/iommu/dma-iommu.c
+++ b/drivers/iommu/dma-iommu.c
@@ -441,6 +441,7 @@ int iommu_dma_map_sg(struct device *dev, struct scatterlist 
*sg,
        struct scatterlist *s, *prev = NULL;
        dma_addr_t dma_addr;
        size_t iova_len = 0;
+       unsigned long mask = dma_get_seg_boundary(dev);
        int i;


        /*
@@ -452,6 +453,7 @@ int iommu_dma_map_sg(struct device *dev, struct scatterlist 
*sg,
        for_each_sg(sg, s, nents, i) {
                size_t s_offset = iova_offset(iovad, s->offset);
                size_t s_length = s->length;
+               size_t pad_len = (mask - iova_len + 1) & mask;

                sg_dma_address(s) = s->offset;
                sg_dma_len(s) = s_length;
@@ -460,15 +462,13 @@ int iommu_dma_map_sg(struct device *dev, struct 
scatterlist *sg,
                s->length = s_length;

                /*
-                * The simple way to avoid the rare case of a segment
-                * crossing the boundary mask is to pad the previous one
-                * to end at a naturally-aligned IOVA for this one's size,
-                * at the cost of potentially over-allocating a little.
+                * With a single size-aligned IOVA allocation, no segment risks
+                * crossing the boundary mask unless the total size exceeds
+                * the mask itself. The simple way to maintain alignment when
+                * that does happen is to pad the previous segment to end at the
+                * next boundary, at the cost of over-allocating a little.
                 */
-               if (prev) {
-                       size_t pad_len = roundup_pow_of_two(s_length);
-
-                       pad_len = (pad_len - iova_len) & (pad_len - 1);
+               if (pad_len && pad_len < s_length - 1) {
                        prev->length += pad_len;
                        iova_len += pad_len;
                }


Hmm, this whole code looks overly complicated. Why don't you just add
the sizes of the individual sg-elements together and then do an
allocation aligned on the next power-of-two. This will not cross the
boundary mask until the size is smaller than the mask.

Other than all of 4 lines referencing pad_len, that's exactly what itdoes! In the very common case when the boundary mask is larger than thetotal length such that (pad_len < s_length - 1) is always false, that'sstill exactly what it does!

When the size is bigger than the mask you can either put a WARN on into
and return error (to see if that really happens), or just do multiple
smaller allocations that fit into the boundary mask.

That case is actually surprisingly common in at least one situation - atypical scatterlist coming in from the SCSI layer seems to have a maxsegment length of 65536, a boundary mask of 0xffff, and a short firstsegment followed by subsequent full segments (I guess they are thecommand buffer and data buffers respectively), meaning an alignment bumpis needed fairly frequently there.


I wanted to avoid multiple allocations for various reasons:

- iommu_map_sg() only takes a single base IOVA, but it's nice if wecan make use of it.- Some of the "complication" would just be shifted into the unmappath, and having to make multiple unmap requests to the IOMMU driverincreases the number of potentially costly TLB syncs.- It minimises contention in the IOVA allocator, which is apparently areal-world problem ;)- 'Proper' segment concatenation is only a small step on top of thecurrent code - probably only a couple of extra lines in finalise_sg() (Ijust haven't yet looked into resurrecting it from iommu-dma v5 as I'mtrying to get other things done).- Unless we bring back the really dumb "blindly iommu_map() eachsegment individually" function from iommu-dma v1, we have to scan thelist doing all this segment boundary calculation regardless. At thatpoint it seems far cheaper and simpler to add one number to anothernumber along the way than to break off into a wholeallocate/map/see-if-it-worked cycle every time.- I like to minimise the number of people complaining at me that wholescatterlists don't get mapped into contiguous IOVA ranges.


Robin.

_______________________________________________
iommu mailing list
iommu@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/iommu

Re: [PATCH v2] iommu/dma: Map scatterlists more parsimoniously

Reply via email to