[Bug 1990272] Re: PCIe Bus Error: Uncorrected, Transaction Layer, device [8086:51b0], AER UnsupReq

2024-06-24 Thread mark mccarthy
I'm having similar issues - the PCIe device in question seems to be the
wireless card in my case. Every now and then my system (Dell Optiplex
3050) will lock up entirely; no app hosting, no SSH, no anything, and
only a forced reboot will fix it - for a while, before it locks up
again. Syslog has a _slew_ of these errors present before the new-
session/reboot takes effect. Coming from Focal, and that machine hardly
every had any problems.

#[about 40 pages of the same error above]
#--- 

Jun 23 07:04:13 optiplex2 kernel: [403063.425670] pcieport :00:1c.7: AER: 
Corrected error message received from :00:1c.7
Jun 23 07:04:13 optiplex2 kernel: [403063.425680] pcieport :00:1c.7: PCIe 
Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
Jun 23 07:04:13 optiplex2 kernel: [403063.425682] pcieport :00:1c.7:   
device [8086:a297] error status/mask=0001/2000
Jun 23 07:04:13 optiplex2 kernel: [403063.425685] pcieport :00:1c.7:[ 
0] RxErr

#---then force rebooted later in the morning after server had crashed 
overnight---  
   
Jun 23 19:09:06 optiplex2 systemd-modules-load[438]: Inserted module 'msr'
Jun 23 19:09:06 optiplex2 kernel: [0.00] microcode: microcode updated 
early to revision 0xf8, date = 2023-09-28
Jun 23 19:09:06 optiplex2 kernel: [0.00] Linux version 
5.15.0-112-generic (buildd@lcy02-amd64-051) (gcc (Ubuntu 11.4.0-1ubuntu1~22.04) 
11.4.0, GNU ld (GNU Binutils for Ubuntu) 2.38) #122-Ubuntu SMP Thu May 23 
07:48:21 UTC 2024 (Ubuntu 5.15.0-112.122-generic 5.15.152)
Jun 23 19:09:06 optiplex2 kernel: [0.00] Command line: 
BOOT_IMAGE=/boot/vmlinuz-5.15.0-112-generic 
root=UUID=694b220d-e9d0-47d3-9b8b-3e069ee1983c ro
Jun 23 19:09:06 optiplex2 kernel: [0.00] KERNEL supported cpus:


Linux optiplex2 5.15.0-112-generic #122-Ubuntu SMP Thu May 23 07:48:21
UTC 2024 x86_64 x86_64 x86_64 GNU/Linux

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1990272

Title:
  PCIe Bus Error: Uncorrected, Transaction Layer, device [8086:51b0],AER
  UnsupReq

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1990272/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

[Bug 1990272] Re: PCIe Bus Error: Uncorrected, Transaction Layer, device [8086:51b0], AER UnsupReq

2024-05-23 Thread Bernie Lademann
I have also experienced a similar issue with my new Thinkpad P1 Gen6.
Since I am dependent on the new Thunderbolt ports for my videography
work, the laptop is essentially useless now.  Lenovo replaced the
motherboard yesterday and there was no change in the behaviour.  On
doing some research, I see that similar issues have occurred for a long
time.  In any case, the key lines in my dmesg, leading to failure are:

[  125.804774] pcieport :00:1d.0: AER: Uncorrectable (Fatal) error message 
received from :20:00.0
[  125.804793] pcieport :20:00.0: AER: PCIe Bus Error: 
severity=Uncorrectable (Fatal), type=Inaccessible, (Unregistered Agent ID)
[  125.804803] thunderbolt :22:00.0: AER: can't recover (no error_detected 
callback)
[  125.804807] xhci_hcd :48:00.0: AER: can't recover (no error_detected 
callback)
[  125.937470] pcieport :00:1d.0: AER: Root Port link has been reset (0)
[  125.937563] pcieport :00:1d.0: AER: device recovery failed



Then the system simply gives up as follows:

[  149.485418] xhci_hcd :48:00.0: xHCI host controller not responding, 
assume dead
[  149.485440] xhci_hcd :48:00.0: HC died; cleaning up

Incidently, I created a bootable Windows disk today and confirmed that
this issue is only related to Linux.

I have also tried updated kernels, 6.8.8 and now 6.8.10 and the issue is
still present.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1990272

Title:
  PCIe Bus Error: Uncorrected, Transaction Layer, device [8086:51b0],AER
  UnsupReq

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1990272/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs