Targeting series with support to AMD Strix Halo or newer.
** Description changed:
Products containing gfx1151 architecture with multiple microcontrollers
(VPE, PSP, VCN, SDMA, etc.), observed a few page faults during heavy
loading or with stress applications on the CRB. This requires rebasing
these firmware versions to eliminate the risk.
# upstream tag 20250211
* 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
+ # upstream tag 20250109
# upstream tag 20241210
* 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
* 4a172771d ("amdgpu: update psp 14.0.1 firmware")
* d316e650c ("amdgpu: update gc 11.5.1 firmware")
# upstream tag 20241110
# upstream tag 20240811
* f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
# upstream tag 20240709
[ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270433] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
[ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data)
(0xa)
[ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
[ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270454] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270470] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
** Changed in: linux-firmware (Ubuntu Plucky)
Status: New => Fix Released
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-firmware in Ubuntu.
https://bugs.launchpad.net/bugs/2100769
Title:
Update amdgpu FW for GC 11.5.1
Status in HWE Next:
New
Status in linux-firmware package in Ubuntu:
Fix Released
Status in linux-firmware source package in Noble:
New
Status in linux-firmware source package in Oracular:
New
Status in linux-firmware source package in Plucky:
Fix Released
Bug description:
Products containing gfx1151 architecture with multiple
microcontrollers (VPE, PSP, VCN, SDMA, etc.), observed a few page
faults during heavy loading or with stress applications on the CRB.
This requires rebasing these firmware versions to eliminate the risk.
# upstream tag 20250211
* 52d598fe2 ("amdgpu: update vcn 4.0.6 firmware")
# upstream tag 20250109
# upstream tag 20241210
* 5bce792a7 ("amdgpu: update vpe 6.1.1 firmware")
* 4a172771d ("amdgpu: update psp 14.0.1 firmware")
* d316e650c ("amdgpu: update gc 11.5.1 firmware")
# upstream tag 20241110
# upstream tag 20240811
* f4b6b75fc ("amdgpu: update SDMA 6.1.1 firmware")
# upstream tag 20240709
[ 217.270407] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270426] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270430] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270433] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00901431
[ 217.270435] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data)
(0xa)
[ 217.270437] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x1
[ 217.270438] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270440] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x3
[ 217.270441] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270442] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270448] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270450] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270452] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270454] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270455] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270456] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270457] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270458] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270459] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270460] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
[ 217.270466] amdgpu 0000:c5:00.0: amdgpu: [gfxhub] page fault (src_id:0
ring:24 vmid:9 pasid:32771)
[ 217.270468] amdgpu 0000:c5:00.0: amdgpu: in process redshiftCmdLine pid
3362 thread redshiftCmdLine pid 3362)
[ 217.270469] amdgpu 0000:c5:00.0: amdgpu: in page starting at address
0x0000000000000000 from client 10
[ 217.270470] amdgpu 0000:c5:00.0: amdgpu:
GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
[ 217.270472] amdgpu 0000:c5:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
[ 217.270473] amdgpu 0000:c5:00.0: amdgpu: MORE_FAULTS: 0x0
[ 217.270474] amdgpu 0000:c5:00.0: amdgpu: WALKER_ERROR: 0x0
[ 217.270475] amdgpu 0000:c5:00.0: amdgpu: PERMISSION_FAULTS: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: MAPPING_ERROR: 0x0
[ 217.270476] amdgpu 0000:c5:00.0: amdgpu: RW: 0x0
To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2100769/+subscriptions
--
Mailing list: https://launchpad.net/~kernel-packages
Post to : [email protected]
Unsubscribe : https://launchpad.net/~kernel-packages
More help : https://help.launchpad.net/ListHelp