[Bug 1894869]
https://bugs.launchpad.net/qemu/+bug/1894869 Here's the discussion with the upstream devs. The problem ended up being on Chelsio's part as either the .7 funciton fo these cards should not have even been exposed to the OS in the first place, or SR-IOV is necessary to actually correct the parameters of this function. Unfortunately, it looks like SR-IOV is no longer possible to enable on these cards. Thank you for your help. -- You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1894869 Title: Chelsio T4 has old MSIX PBA offset bug Status in QEMU: Invalid Status in Debian: In Progress Bug description: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I discovered this bug on a Proxmox system, and I was working with a downstream Proxmox developer to try to fix this issue. They provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1894869/+subscriptions
[Bug 1894869] Re: Chelsio T4 has old MSIX PBA offset bug
I was able to boot a VM with just the functions of the device with the ethernet controller function ID added as PCI devices. Something I noticed while adding in those devices though is that all of the others have a description associated with them in Proxmox, but the one that's causing the boot to fail doesn't. I attached a picture of the menu, 81:00.7 has no functions associated with it. So it seems like it just doesn't have any function at all? Unless it benefits QEMU to know whether turning SR-IOV on for these cards fixes the problem, I don't think I'm going to go through the process of turning it on, since the process looks terrible. Thank you for your help. ** Attachment added: "no 7th.JPG" https://bugs.launchpad.net/qemu/+bug/1894869/+attachment/5411172/+files/no%207th.JPG -- You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1894869 Title: Chelsio T4 has old MSIX PBA offset bug Status in QEMU: New Status in Debian: In Progress Bug description: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I discovered this bug on a Proxmox system, and I was working with a downstream Proxmox developer to try to fix this issue. They provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1894869/+subscriptions
[Bug 1894869] Re: Chelsio T4 has old MSIX PBA offset bug
Yeah, I figured out that the logic behind that patch was failed and corrected it to get the same error again already. Just to clarify, it is two of the same card giving the same error. I ran dmesg --level=err, but got no output. In the full output of dmesg, though, I noticed that there are some problems with the nics, but I don't know enough about this to know if there's anything I can do about it. I included dmesg output here. I don't believe the nics are giving the host any functionality since I added the driver for them to the blacklist, so it shouldn't even be getting loaded by it. In case it's useful, I'm not sure if SR-IOV is enabled on these cards or not, though I'm trying to use PCI passthrough for my VMs. ** Attachment added: "Output of dmesg" https://bugs.launchpad.net/qemu/+bug/1894869/+attachment/5410926/+files/dmesg.txt -- You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1894869 Title: Chelsio T4 has old MSIX PBA offset bug Status in QEMU: New Status in Debian: In Progress Bug description: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I discovered this bug on a Proxmox system, and I was working with a downstream Proxmox developer to try to fix this issue. They provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1894869/+subscriptions
[Bug 1894869] Re: Chelsio T4 has old MSIX PBA offset bug
** Bug watch added: bugzilla.proxmox.com/ #2969 https://bugzilla.proxmox.com/show_bug.cgi?id=2969 ** Also affects: debian via https://bugzilla.proxmox.com/show_bug.cgi?id=2969 Importance: Unknown Status: Unknown -- You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1894869 Title: Chelsio T4 has old MSIX PBA offset bug Status in QEMU: New Status in Debian: Unknown Bug description: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I discovered this bug on a Proxmox system, and I was working with a downstream Proxmox developer to try to fix this issue. They provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1894869/+subscriptions
[Bug 1894869] Re: Chelsio T4 has old MSIX PBA offset bug
** Description changed: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR - I was working with a downstream Proxmox developer to try to fix this - issue, and they provided me with the following change to make from line - 1484 of hw/vfio/pci.c: + I discovered this bug on a Proxmox system, and I was working with a + downstream Proxmox developer to try to fix this issue. They provided me + with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) - * is 0x1000, so we hard code that here. - */ - if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && + * is 0x1000, so we hard code that here. + */ + if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { - msix->pba_offset = 0x1000; - } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { - error_setg(errp, "hardware reports invalid configuration, " + msix->pba_offset = 0x1000; + } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { + error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv -- You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1894869 Title: Chelsio T4 has old MSIX PBA offset bug Status in QEMU: New Bug description: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I discovered this bug on a Proxmox system, and I was working with a downstream Proxmox developer to try to fix this issue. They provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1894869/+subscriptions
[Bug 1894869] [NEW] Chelsio T4 has old MSIX PBA offset bug
Public bug reported: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I was working with a downstream Proxmox developer to try to fix this issue, and they provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv ** Affects: qemu Importance: Undecided Status: New ** Tags: chelsio t4 ** Attachment added: "Full lspci -nnkvv output" https://bugs.launchpad.net/bugs/1894869/+attachment/5408718/+files/lspci.txt -- You received this bug notification because you are a member of qemu- devel-ml, which is subscribed to QEMU. https://bugs.launchpad.net/bugs/1894869 Title: Chelsio T4 has old MSIX PBA offset bug Status in QEMU: New Bug description: There exists a bug with Chelsio NICs T4 that causes the following error: kvm: -device vfio- pci,host=:83:00.7,id=hostpci1.7,bus=pci.0,addr=0x11.7: vfio :83:00.7: hardware reports invalid configuration, MSIX PBA outside of specified BAR I was working with a downstream Proxmox developer to try to fix this issue, and they provided me with the following change to make from line 1484 of hw/vfio/pci.c: static void vfio_msix_early_setup(VFIOPCIDevice *vdev, Error **errp) * is 0x1000, so we hard code that here. */ if (vdev->vendor_id == PCI_VENDOR_ID_CHELSIO && -(vdev->device_id & 0xff00) == 0x5800) { +((vdev->device_id & 0xff00) == 0x5800 || + (vdev->device_id & 0xff00) == 0x1425)) { msix->pba_offset = 0x1000; } else if (vdev->msix_relo == OFF_AUTOPCIBAR_OFF) { error_setg(errp, "hardware reports invalid configuration, " However, I found that this did not fix the issue, so the bug appears to work differently than the one that was present on the T5 NICs which has already been patched. I have attached the output of my lspci -nnkvv To manage notifications about this bug go to: https://bugs.launchpad.net/qemu/+bug/1894869/+subscriptions