On Fri, Oct 30, 2015 at 11:48:21AM +0800, Wei Yang wrote:
>On Thu, Oct 29, 2015 at 05:22:54PM -0500, Bjorn Helgaas wrote:
>>ines: 115
>>
>>From: Alexander Duyck <adu...@mirantis.com>
>>
>>The enumeration path should leave NumVFs set to zero.  But after
>>4449f079722c ("PCI: Calculate maximum number of buses required for VFs"),
>>we call virtfn_max_buses() in the enumeration path, which changes NumVFs.
>>This NumVFs change is visible via lspci and sysfs until a driver enables
>>SR-IOV.
>>
>>Iterate from TotalVFs down to zero so NumVFs is zero when we're finished
>>computing the maximum number of buses.  Validate offset and stride in
>>the loop, so we can test it at every possible NumVFs setting.  Rename
>>virtfn_max_buses() to compute_max_vf_buses() to hint that it does have a
>>side effect of updating iov->max_VF_buses.
>>
>>[bhelgaas: changelog, rename, allow numVF==1 && stride==0, rework loop,
>>reverse sense of error path]
>>Fixes: 4449f079722c ("PCI: Calculate maximum number of buses required for 
>>VFs")
>>Based-on-patch-by: Ethan Zhao <ethan.z...@oracle.com>
>>Signed-off-by: Alexander Duyck <adu...@mirantis.com>
>>Signed-off-by: Bjorn Helgaas <bhelg...@google.com>
>>---
>> drivers/pci/iov.c |   41 ++++++++++++++++++++++-------------------
>> 1 file changed, 22 insertions(+), 19 deletions(-)
>>
>>diff --git a/drivers/pci/iov.c b/drivers/pci/iov.c
>>index 0cdb2d1..1b1acc2 100644
>>--- a/drivers/pci/iov.c
>>+++ b/drivers/pci/iov.c
>>@@ -54,24 +54,29 @@ static inline void pci_iov_set_numvfs(struct pci_dev 
>>*dev, int nr_virtfn)
>>  * The PF consumes one bus number.  NumVFs, First VF Offset, and VF Stride
>>  * determine how many additional bus numbers will be consumed by VFs.
>>  *
>>- * Iterate over all valid NumVFs and calculate the maximum number of bus
>>- * numbers that could ever be required.
>>+ * Iterate over all valid NumVFs, validate offset and stride, and calculate
>>+ * the maximum number of bus numbers that could ever be required.
>>  */
>>-static inline u8 virtfn_max_buses(struct pci_dev *dev)
>>+static int compute_max_vf_buses(struct pci_dev *dev)
>> {
>>      struct pci_sriov *iov = dev->sriov;
>>-     int nr_virtfn;
>>-     u8 max = 0;
>>-     int busnr;
>>+     int nr_virtfn, busnr, rc = 0;
>>
>>-     for (nr_virtfn = 1; nr_virtfn <= iov->total_VFs; nr_virtfn++) {
>>+     for (nr_virtfn = iov->total_VFs; nr_virtfn; nr_virtfn--) {
>
>Hmm... I agree to iterate the nr_virtfn from total_VFs to 0, which keep the
>original logic, before my patch.
>
>
>>              pci_iov_set_numvfs(dev, nr_virtfn);
>>+             if (!iov->offset || (nr_virtfn > 1 && !iov->stride)) {
>                                     ^^^

After reading the SPEC, I think this code is correct. The original one removed
below may not comply with the SPEC.

>
>Should be this?
>                if (!iov->offset || (iov->total_VFs > 1 && !iov->stride))
>
>
>>+                     rc = -EIO;
>>+                     goto out;
>>+             }
>>+
>>              busnr = pci_iov_virtfn_bus(dev, nr_virtfn - 1);
>>-             if (busnr > max)
>>-                     max = busnr;
>>+             if (busnr > iov->max_VF_buses)
>>+                     iov->max_VF_buses = busnr;
>>      }
>>
>>-     return max;
>>+out:
>>+     pci_iov_set_numvfs(dev, 0);
>>+     return rc;
>> }
>>
>> static struct pci_bus *virtfn_add_bus(struct pci_bus *bus, int busnr)
>>@@ -384,7 +389,7 @@ static int sriov_init(struct pci_dev *dev, int pos)
>>      int rc;
>>      int nres;
>>      u32 pgsz;
>>-     u16 ctrl, total, offset, stride;
>>+     u16 ctrl, total;
>>      struct pci_sriov *iov;
>>      struct resource *res;
>>      struct pci_dev *pdev;
>>@@ -410,11 +415,6 @@ static int sriov_init(struct pci_dev *dev, int pos)
>>
>> found:
>>      pci_write_config_word(dev, pos + PCI_SRIOV_CTRL, ctrl);
>>-     pci_write_config_word(dev, pos + PCI_SRIOV_NUM_VF, 0);
>>-     pci_read_config_word(dev, pos + PCI_SRIOV_VF_OFFSET, &offset);
>>-     pci_read_config_word(dev, pos + PCI_SRIOV_VF_STRIDE, &stride);
>>-     if (!offset || (total > 1 && !stride))
>>-             return -EIO;
>>
>
>Original code will return when it found this condition, so that we don't need
>to bother to do those initialization and then fall back.
>
>So I suggest to keep it here.
>
>>      pci_read_config_word(dev, pos + PCI_SRIOV_TOTAL_VF, &total);
>>      if (!total)
>>@@ -456,8 +456,6 @@ found:
>>      iov->nres = nres;
>>      iov->ctrl = ctrl;
>>      iov->total_VFs = total;
>>-     iov->offset = offset;
>>-     iov->stride = stride;
>>      iov->pgsz = pgsz;
>>      iov->self = dev;
>>      pci_read_config_dword(dev, pos + PCI_SRIOV_CAP, &iov->cap);
>>@@ -474,10 +472,15 @@ found:
>>
>>      dev->sriov = iov;
>>      dev->is_physfn = 1;
>>-     iov->max_VF_buses = virtfn_max_buses(dev);
>>+     rc = compute_max_vf_buses(dev);
>>+     if (rc)
>>+             goto fail_max_buses;
>>
>>      return 0;
>>
>>+fail_max_buses:
>>+     dev->sriov = NULL;
>
>The dev->sriov is allocated with kzalloc(), seems we forget to release it?
>
>>+     dev->is_physfn = 0;
>> failed:
>>      for (i = 0; i < PCI_SRIOV_NUM_BARS; i++) {
>>              res = &dev->resource[i + PCI_IOV_R
>
>-- 
>Richard Yang
>Help you, Help me

-- 
Richard Yang
Help you, Help me

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to