** Description changed:

+ [SRU Justification]
+ 
+ [ Impact ]
+ 
+ AMD Strix Halo devices experience unexpected reboots while starting GUI
+ session every boot.
+ 
+ This is caused by:
+ * 689275140cb8e drm/amdgpu/hdp7.0: do a posting read when flushing HDP
+ * abe1cbaec6cfe drm/amdgpu/hdp6.0: do a posting read when flushing HDP
+ * f756dbac1ce1d drm/amdgpu/hdp5.2: do a posting read when flushing HDP
+ * cf424020e040b drm/amdgpu/hdp5.0: do a posting read when flushing HDP
+ * c9b8dcabb52af drm/amdgpu/hdp4.0: do a posting read when flushing HDP
+ 
+ And fixed by:
+ * 5a11a27677311 drm/amdgpu/hdp7: use memcfg register to post the write for 
HDP flush
+ * ca28e80abe421 drm/amdgpu/hdp6: use memcfg register to post the write for 
HDP flush
+ * dbc988c689333 drm/amdgpu/hdp5.2: use memcfg register to post the write for 
HDP flush
+ * 0e33e0f339b91 drm/amdgpu/hdp5: use memcfg register to post the write for 
HDP flush
+ * f690e3974755a drm/amdgpu/hdp4: use memcfg register to post the write for 
HDP flush
+ 
+ [ Test Plan ]
+ 
+ 1. Install -proposed kernel
+ 2. Reboot to the installed kernel and check if the device may finish boot 
process to GUI session.
+ 
+ [ Where problems could occur ]
+ 
+ No. This fix a problem when reading back the whole mapped
+ KFD_MMIO_REMAP_HDP_MEM_FLUSH_CNTL register by reading another register
+ that also triggers the flush.
+ 
+ [ Other Info ]
+ 
+ The causing commits were first committed to v6.13-rc1 , and have also been 
backported to v6.12.5, v6.6.69, v6.1.120. The fixes are available in v5.15-rc6, 
also backported to v6.14.7, v6.12.29, v6.6.91, v6.1.139, therefore this affects 
following Ubuntu kernels:
+ * linux-unstable/questing: latest (currently 6.15.0-2.2)
+ * linux/plucky: all (currently 6.14.0-22.22)
+ * linux-oem-6.14/noble: all (currently 6.14.0-1005.5)
+ * linux/oracular: 6.11.0-20.20 to latest (currently 6.11.0-28.28)
+ * linux-oem-6.11/noble: 6.11.0-1017.17 to latest (currently 6.11.0-1024.24)
+ * linux/noble: 6.8.0-58.60 to latest (currently 6.8.0-62.65)
+ * linux-oem-6.8/noble: 6.8.0-1026.26 to latest (currently 6.8.0-1029.29)
+ 
+ ========== original bug report ==========
+ 
  With 5 v6.13-rc1 commits , also backported to v6.12.5, v6.6.69, v6.1.120, 
some (at least AMD Strix Halo) devices may experience unexpected reboots while 
starting GUI session every boot:
  * 689275140cb8e drm/amdgpu/hdp7.0: do a posting read when flushing HDP
  * abe1cbaec6cfe drm/amdgpu/hdp6.0: do a posting read when flushing HDP
  * f756dbac1ce1d drm/amdgpu/hdp5.2: do a posting read when flushing HDP
  * cf424020e040b drm/amdgpu/hdp5.0: do a posting read when flushing HDP
  * c9b8dcabb52af drm/amdgpu/hdp4.0: do a posting read when flushing HDP
  
  The fixes are available in v5.15-rc6, also backported to v6.14.7, v6.12.29, 
v6.6.91, v6.1.139:
  * 5a11a27677311 drm/amdgpu/hdp7: use memcfg register to post the write for 
HDP flush
  * ca28e80abe421 drm/amdgpu/hdp6: use memcfg register to post the write for 
HDP flush
  * dbc988c689333 drm/amdgpu/hdp5.2: use memcfg register to post the write for 
HDP flush
  * 0e33e0f339b91 drm/amdgpu/hdp5: use memcfg register to post the write for 
HDP flush
  * f690e3974755a drm/amdgpu/hdp4: use memcfg register to post the write for 
HDP flush
  
  This affects following Ubuntu kernels:
  * linux-unstable/questing: latest (currently 6.15.0-2.2)
  * linux/plucky: all (currently 6.14.0-22.22)
  * linux-oem-6.14/noble: all (currently 6.14.0-1005.5)
  * linux/oracular: 6.11.0-20.20 to latest (currently 6.11.0-28.28)
  * linux-oem-6.11/noble: 6.11.0-1017.17 to latest (currently 6.11.0-1024.24)
  * linux/noble: 6.8.0-58.60 to latest (currently 6.8.0-62.65)
  * linux-oem-6.8/noble: 6.8.0-1026.26 to latest (currently 6.8.0-1029.29)

** Description changed:

  [SRU Justification]
  
  [ Impact ]
  
  AMD Strix Halo devices experience unexpected reboots while starting GUI
  session every boot.
  
  This is caused by:
  * 689275140cb8e drm/amdgpu/hdp7.0: do a posting read when flushing HDP
  * abe1cbaec6cfe drm/amdgpu/hdp6.0: do a posting read when flushing HDP
  * f756dbac1ce1d drm/amdgpu/hdp5.2: do a posting read when flushing HDP
  * cf424020e040b drm/amdgpu/hdp5.0: do a posting read when flushing HDP
  * c9b8dcabb52af drm/amdgpu/hdp4.0: do a posting read when flushing HDP
  
  And fixed by:
  * 5a11a27677311 drm/amdgpu/hdp7: use memcfg register to post the write for 
HDP flush
  * ca28e80abe421 drm/amdgpu/hdp6: use memcfg register to post the write for 
HDP flush
  * dbc988c689333 drm/amdgpu/hdp5.2: use memcfg register to post the write for 
HDP flush
  * 0e33e0f339b91 drm/amdgpu/hdp5: use memcfg register to post the write for 
HDP flush
  * f690e3974755a drm/amdgpu/hdp4: use memcfg register to post the write for 
HDP flush
  
  [ Test Plan ]
  
  1. Install -proposed kernel
  2. Reboot to the installed kernel and check if the device may finish boot 
process to GUI session.
  
  [ Where problems could occur ]
  
  No. This fix a problem when reading back the whole mapped
  KFD_MMIO_REMAP_HDP_MEM_FLUSH_CNTL register by reading another register
  that also triggers the flush.
  
  [ Other Info ]
  
  The causing commits were first committed to v6.13-rc1 , and have also been 
backported to v6.12.5, v6.6.69, v6.1.120. The fixes are available in v5.15-rc6, 
also backported to v6.14.7, v6.12.29, v6.6.91, v6.1.139, therefore this affects 
following Ubuntu kernels:
  * linux-unstable/questing: latest (currently 6.15.0-2.2)
  * linux/plucky: all (currently 6.14.0-22.22)
  * linux-oem-6.14/noble: all (currently 6.14.0-1005.5)
  * linux/oracular: 6.11.0-20.20 to latest (currently 6.11.0-28.28)
  * linux-oem-6.11/noble: 6.11.0-1017.17 to latest (currently 6.11.0-1024.24)
  * linux/noble: 6.8.0-58.60 to latest (currently 6.8.0-62.65)
  * linux-oem-6.8/noble: 6.8.0-1026.26 to latest (currently 6.8.0-1029.29)
  
+ Nominate for linux/noble, linux-oem-6.8/noble, linux/oracular, linux-
+ oem-6.11/noble, linux/plucky, linux-oem-6.14/noble, and linux-unstable.
+ 
  ========== original bug report ==========
  
  With 5 v6.13-rc1 commits , also backported to v6.12.5, v6.6.69, v6.1.120, 
some (at least AMD Strix Halo) devices may experience unexpected reboots while 
starting GUI session every boot:
  * 689275140cb8e drm/amdgpu/hdp7.0: do a posting read when flushing HDP
  * abe1cbaec6cfe drm/amdgpu/hdp6.0: do a posting read when flushing HDP
  * f756dbac1ce1d drm/amdgpu/hdp5.2: do a posting read when flushing HDP
  * cf424020e040b drm/amdgpu/hdp5.0: do a posting read when flushing HDP
  * c9b8dcabb52af drm/amdgpu/hdp4.0: do a posting read when flushing HDP
  
  The fixes are available in v5.15-rc6, also backported to v6.14.7, v6.12.29, 
v6.6.91, v6.1.139:
  * 5a11a27677311 drm/amdgpu/hdp7: use memcfg register to post the write for 
HDP flush
  * ca28e80abe421 drm/amdgpu/hdp6: use memcfg register to post the write for 
HDP flush
  * dbc988c689333 drm/amdgpu/hdp5.2: use memcfg register to post the write for 
HDP flush
  * 0e33e0f339b91 drm/amdgpu/hdp5: use memcfg register to post the write for 
HDP flush
  * f690e3974755a drm/amdgpu/hdp4: use memcfg register to post the write for 
HDP flush
  
  This affects following Ubuntu kernels:
  * linux-unstable/questing: latest (currently 6.15.0-2.2)
  * linux/plucky: all (currently 6.14.0-22.22)
  * linux-oem-6.14/noble: all (currently 6.14.0-1005.5)
  * linux/oracular: 6.11.0-20.20 to latest (currently 6.11.0-28.28)
  * linux-oem-6.11/noble: 6.11.0-1017.17 to latest (currently 6.11.0-1024.24)
  * linux/noble: 6.8.0-58.60 to latest (currently 6.8.0-62.65)
  * linux-oem-6.8/noble: 6.8.0-1026.26 to latest (currently 6.8.0-1029.29)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2112462

Title:
  Unexpected system reboot at loading GUI session on some AMD platforms

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2112462/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to