[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-08-23 Thread Launchpad Bug Tracker
[Expired for linux-signed-hwe-5.8 (Ubuntu) because there has been no activity for 60 days.] ** Changed in: linux-signed-hwe-5.8 (Ubuntu) Status: Incomplete => Expired -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-24 Thread Milos Vyletel
As expected 5.11 did crash. Unfortunately the vmcore is not useful as it filled up disk space so it is incomplete and I had to remove it to free up some space. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-23 Thread Po-Hsu Lin
BTW For 5.11 (or oem-5.10 on Focal) it has only 3914d88f7608e6c ("xsk: Respect device's headroom and tailroom on generic xmit path") applied, so I expect you will see this issue with it. For anything older than 5.10, none of these four patches were applied. -- You received this bug notificatio

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-22 Thread Milos Vyletel
Thanks for looking into this. I will try edge HWE. I did try it week or two ago but it was still at 5.8. I will need to let the 5.11 kernel run for couple of days to be sure that it is properly fixed. Btw the same problem also exists on 5.4 kernel line. I do not have vmcore from that kernel versi

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-22 Thread Po-Hsu Lin
Considering the complexity of these and the fact that 5.8 is missing other vital parts to get these applied, e.g.: * f09ced4053 xsk: Fix race in SKB mode transmit with shared cq * 1c1efc2af1 xsk: Create and free buffer pool independently from umem I think it's quite unlikely to get this fix

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-22 Thread Po-Hsu Lin
They're in the upstream tree: https://github.com/torvalds/linux/commit/ab5bd583b9289666e918f9e5f672d33ccdfd49b2 https://github.com/torvalds/linux/commit/c2ff53d8049f30098153cd2d1299a44d7b124c57 https://github.com/torvalds/linux/commit/3914d88f7608e6c2e80e344474fa289370c32451 https://github.com/tor

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-22 Thread Po-Hsu Lin
I think this patchset is the fix: https://lore.kernel.org/bpf/20210218204908.5455-1-aloba...@pm.me/ -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1933245 Title: mlx5_core: Error cqe on cqn leads to

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-22 Thread Po-Hsu Lin
** Changed in: linux-signed-hwe-5.8 (Ubuntu) Assignee: (unassigned) => Po-Hsu Lin (cypressyew) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1933245 Title: mlx5_core: Error cqe on cqn leads to

[Bug 1933245] Re: mlx5_core: Error cqe on cqn leads to hang

2021-06-22 Thread Milos Vyletel
adding dmesg from vmcore ** Attachment added: "dmesg.202106161938" https://bugs.launchpad.net/ubuntu/+source/linux-signed-hwe-5.8/+bug/1933245/+attachment/5506276/+files/dmesg.202106161938 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to U