[Kernel-packages] [Bug 2056498] Re: Kernel crash in amd gpu driver

2024-03-09 Thread Mario Limonciello
*** This bug is a duplicate of bug 2039926 ***
https://bugs.launchpad.net/bugs/2039926

** This bug has been marked a duplicate of bug 2039926
   Error UBSAN: array-index-out-of-bounds amdgpu 
(drivers/gpu/drm/amd/amdgpu/../pm/powerplay/hwmgr/smu7_hwmgr.c)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-signed-hwe-6.5 in Ubuntu.
https://bugs.launchpad.net/bugs/2056498

Title:
  Kernel crash in amd gpu driver

Status in linux-signed-hwe-6.5 package in Ubuntu:
  New

Bug description:
  Mar  7 19:07:10 ripper kernel: [9.873519] UBSAN: 
array-index-out-of-bounds in 
/build/linux-hwe-6.5-YpKOvT/linux-hwe-6.5-6.5.0/drivers/gpu/drm/amd/amdgpu/../pm/powerplay/hwmgr/smu7_hwmgr.c:3676:4
  Mar  7 19:07:10 ripper kernel: [9.873531] index 7 is out of range for 
type 'ATOM_Polaris_SCLK_Dependency_Record [1]'
  Mar  7 19:07:10 ripper kernel: [9.873538] CPU: 4 PID: 849 Comm: 
systemd-udevd Not tainted 6.5.0-17-generic #17~22.04.1-Ubuntu
  Mar  7 19:07:10 ripper kernel: [9.873542] Hardware name: LENOVO 
30E1S3VV00/1046, BIOS S07KT45A 01/20/2022
  Mar  7 19:07:10 ripper kernel: [9.873544] Call Trace:
  Mar  7 19:07:10 ripper kernel: [9.873545]  
  Mar  7 19:07:10 ripper kernel: [9.873547]  dump_stack_lvl+0x48/0x70
  Mar  7 19:07:10 ripper kernel: [9.873551]  dump_stack+0x10/0x20
  Mar  7 19:07:10 ripper kernel: [9.873554]  
__ubsan_handle_out_of_bounds+0xc6/0x110
  Mar  7 19:07:10 ripper kernel: [9.873560]  
smu7_get_pp_table_entry_callback_func_v1+0x9b7/0xa00 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.873897]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.873900]  ? vi_pcie_rreg+0x6e/0x90 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.874187]  ? 
__pfx_smu7_get_pp_table_entry_callback_func_v1+0x10/0x10 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.874515]  
get_powerplay_table_entry_v1_0+0xf8/0x490 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.874842]  
smu7_get_pp_table_entry_v1+0x41/0x4c0 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.875169]  
smu7_get_pp_table_entry+0x3d/0x50 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.875495]  
psm_init_power_state_table+0x161/0x250 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.875826]  hwmgr_hw_init+0xe3/0x1e0 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.876150]  pp_hw_init+0x16/0x50 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.876484]  
amdgpu_device_ip_init+0x48d/0x960 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.876749]  
amdgpu_device_init+0x9a2/0x1150 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877014]  
amdgpu_driver_load_kms+0x1a/0x1c0 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877278]  amdgpu_pci_probe+0x182/0x450 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877541]  local_pci_probe+0x47/0xb0
  Mar  7 19:07:10 ripper kernel: [9.877545]  pci_call_probe+0x55/0x190
  Mar  7 19:07:10 ripper kernel: [9.877550]  pci_device_probe+0x84/0x120
  Mar  7 19:07:10 ripper kernel: [9.877553]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.877557]  really_probe+0x1cc/0x430
  Mar  7 19:07:10 ripper kernel: [9.877560]  
__driver_probe_device+0x8c/0x190
  Mar  7 19:07:10 ripper kernel: [9.877563]  driver_probe_device+0x24/0xd0
  Mar  7 19:07:10 ripper kernel: [9.877566]  __driver_attach+0x10b/0x210
  Mar  7 19:07:10 ripper kernel: [9.877569]  ? 
__pfx___driver_attach+0x10/0x10
  Mar  7 19:07:10 ripper kernel: [9.877572]  bus_for_each_dev+0x8d/0xf0
  Mar  7 19:07:10 ripper kernel: [9.877576]  driver_attach+0x1e/0x30
  Mar  7 19:07:10 ripper kernel: [9.877579]  bus_add_driver+0x127/0x240
  Mar  7 19:07:10 ripper kernel: [9.877583]  driver_register+0x5e/0x130
  Mar  7 19:07:10 ripper kernel: [9.877586]  ? __pfx_amdgpu_init+0x10/0x10 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877849]  __pci_register_driver+0x62/0x70
  Mar  7 19:07:10 ripper kernel: [9.877852]  amdgpu_init+0x69/0xff0 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.878111]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.878114]  do_one_initcall+0x5e/0x340
  Mar  7 19:07:10 ripper kernel: [9.878120]  do_init_module+0x68/0x260
  Mar  7 19:07:10 ripper kernel: [9.878123]  load_module+0xb85/0xcd0
  Mar  7 19:07:10 ripper kernel: [9.878128]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.878131]  ? 
security_kernel_post_read_file+0x75/0x90
  Mar  7 19:07:10 ripper kernel: [9.878136]  
init_module_from_file+0x96/0x100
  Mar  7 19:07:10 ripper kernel: [9.878139]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.878142]  ? 
init_module_from_file+0x96/0x100
  Mar  7 19:07:10 ripper kernel: [9.878149]  
idempotent_init_module+0x11c/0x2b0
  Mar  7 19:07:10 ripper kernel: [9.878155]  
__x64_sys_finit_module+0x64/0xd0
  Mar  7 19:07:10 ripper kernel: [9.878159]  do_syscall_64+0x5b/0x90
  

[Kernel-packages] [Bug 2056498] Re: Kernel crash in amd gpu driver

2024-03-08 Thread Brad Figg
The above crash was happening with large downloads of img files or git
clones of large repositories (Ubuntu kernels) over wifi. I have changed
to hard wired ethernet and I've not been able to reproduce it. With Wifi
it's been very reproduceable.

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-signed-hwe-6.5 in Ubuntu.
https://bugs.launchpad.net/bugs/2056498

Title:
  Kernel crash in amd gpu driver

Status in linux-signed-hwe-6.5 package in Ubuntu:
  New

Bug description:
  Mar  7 19:07:10 ripper kernel: [9.873519] UBSAN: 
array-index-out-of-bounds in 
/build/linux-hwe-6.5-YpKOvT/linux-hwe-6.5-6.5.0/drivers/gpu/drm/amd/amdgpu/../pm/powerplay/hwmgr/smu7_hwmgr.c:3676:4
  Mar  7 19:07:10 ripper kernel: [9.873531] index 7 is out of range for 
type 'ATOM_Polaris_SCLK_Dependency_Record [1]'
  Mar  7 19:07:10 ripper kernel: [9.873538] CPU: 4 PID: 849 Comm: 
systemd-udevd Not tainted 6.5.0-17-generic #17~22.04.1-Ubuntu
  Mar  7 19:07:10 ripper kernel: [9.873542] Hardware name: LENOVO 
30E1S3VV00/1046, BIOS S07KT45A 01/20/2022
  Mar  7 19:07:10 ripper kernel: [9.873544] Call Trace:
  Mar  7 19:07:10 ripper kernel: [9.873545]  
  Mar  7 19:07:10 ripper kernel: [9.873547]  dump_stack_lvl+0x48/0x70
  Mar  7 19:07:10 ripper kernel: [9.873551]  dump_stack+0x10/0x20
  Mar  7 19:07:10 ripper kernel: [9.873554]  
__ubsan_handle_out_of_bounds+0xc6/0x110
  Mar  7 19:07:10 ripper kernel: [9.873560]  
smu7_get_pp_table_entry_callback_func_v1+0x9b7/0xa00 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.873897]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.873900]  ? vi_pcie_rreg+0x6e/0x90 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.874187]  ? 
__pfx_smu7_get_pp_table_entry_callback_func_v1+0x10/0x10 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.874515]  
get_powerplay_table_entry_v1_0+0xf8/0x490 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.874842]  
smu7_get_pp_table_entry_v1+0x41/0x4c0 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.875169]  
smu7_get_pp_table_entry+0x3d/0x50 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.875495]  
psm_init_power_state_table+0x161/0x250 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.875826]  hwmgr_hw_init+0xe3/0x1e0 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.876150]  pp_hw_init+0x16/0x50 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.876484]  
amdgpu_device_ip_init+0x48d/0x960 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.876749]  
amdgpu_device_init+0x9a2/0x1150 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877014]  
amdgpu_driver_load_kms+0x1a/0x1c0 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877278]  amdgpu_pci_probe+0x182/0x450 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877541]  local_pci_probe+0x47/0xb0
  Mar  7 19:07:10 ripper kernel: [9.877545]  pci_call_probe+0x55/0x190
  Mar  7 19:07:10 ripper kernel: [9.877550]  pci_device_probe+0x84/0x120
  Mar  7 19:07:10 ripper kernel: [9.877553]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.877557]  really_probe+0x1cc/0x430
  Mar  7 19:07:10 ripper kernel: [9.877560]  
__driver_probe_device+0x8c/0x190
  Mar  7 19:07:10 ripper kernel: [9.877563]  driver_probe_device+0x24/0xd0
  Mar  7 19:07:10 ripper kernel: [9.877566]  __driver_attach+0x10b/0x210
  Mar  7 19:07:10 ripper kernel: [9.877569]  ? 
__pfx___driver_attach+0x10/0x10
  Mar  7 19:07:10 ripper kernel: [9.877572]  bus_for_each_dev+0x8d/0xf0
  Mar  7 19:07:10 ripper kernel: [9.877576]  driver_attach+0x1e/0x30
  Mar  7 19:07:10 ripper kernel: [9.877579]  bus_add_driver+0x127/0x240
  Mar  7 19:07:10 ripper kernel: [9.877583]  driver_register+0x5e/0x130
  Mar  7 19:07:10 ripper kernel: [9.877586]  ? __pfx_amdgpu_init+0x10/0x10 
[amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.877849]  __pci_register_driver+0x62/0x70
  Mar  7 19:07:10 ripper kernel: [9.877852]  amdgpu_init+0x69/0xff0 [amdgpu]
  Mar  7 19:07:10 ripper kernel: [9.878111]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.878114]  do_one_initcall+0x5e/0x340
  Mar  7 19:07:10 ripper kernel: [9.878120]  do_init_module+0x68/0x260
  Mar  7 19:07:10 ripper kernel: [9.878123]  load_module+0xb85/0xcd0
  Mar  7 19:07:10 ripper kernel: [9.878128]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.878131]  ? 
security_kernel_post_read_file+0x75/0x90
  Mar  7 19:07:10 ripper kernel: [9.878136]  
init_module_from_file+0x96/0x100
  Mar  7 19:07:10 ripper kernel: [9.878139]  ? srso_return_thunk+0x5/0x10
  Mar  7 19:07:10 ripper kernel: [9.878142]  ? 
init_module_from_file+0x96/0x100
  Mar  7 19:07:10 ripper kernel: [9.878149]  
idempotent_init_module+0x11c/0x2b0
  Mar  7 19:07:10 ripper kernel: [9.878155]  
__x64_sys_finit_module+0x64/0xd0
  Mar  7 19:07:10 ripper kernel: [9.878159]  do_syscall_64+0x5b/0x90
  Mar  7 19:07:10