[PATCH] drm/amdgpu: avoid repeatedly executing gpu ras reset

2024-07-09 Thread YiPeng Chai
When a gpu in hive is performing ras reset, other gpus in hive do not need to schedule recovery work to reset the gpu. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 20 +++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm

[PATCH V2 1/2] drm/amdgpu: flush all cached ras bad pages to eeprom

2024-07-08 Thread YiPeng Chai
Before uninstalling gpu driver, flush all cached ras bad pages to eeprom. v2: Put the same code into a function and reuse the function. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 35 - 1 file changed, 29 insertions(+), 6 deletions

[PATCH V2 2/2] drm/amdgpu: timely save bad pages to eeprom after gpu ras reset is completed

2024-07-08 Thread YiPeng Chai
: 1. Add the above description to code comments. 2. Reuse existing function. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 18 ++ 2 files changed, 23 insertions(+), 1 deletion(-) diff --git

[PATCH 2/2] drm/amdgpu: timely save bad pages to eeprom after gpu ras reset is complete

2024-07-03 Thread YiPeng Chai
. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 14 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 6 ++ 2 files changed, 19 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 1/2] drm/amdgpu: flush all cached ras bad pages to eeprom

2024-07-03 Thread YiPeng Chai
Before uninstalling gpu driver, flush all cached ras bad pages to eeprom. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 17 + 1 file changed, 17 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: sysfs node disable query error count during gpu reset

2024-07-01 Thread YiPeng Chai
Sysfs node disable query error count during gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/aldebaran.c | 2 -- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 3 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c| 3 +++ 3 files changed, 5 insertions(+), 3 deletions(-) diff

[PATCH V2] drm/amdgpu: sysfs node disable query error count during gpu reset

2024-06-30 Thread YiPeng Chai
Sysfs node disable query error count during gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 15 +-- 1 file changed, 13 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

[PATCH] drm/amdgpu: sysfs node disable query error count during gpu reset

2024-06-28 Thread YiPeng Chai
Sysfs node disable query error count during gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH] drm/amdgpu: add mutex to protect ras shared memory

2024-06-25 Thread YiPeng Chai
Add mutex to protect ras shared memory. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c| 124 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h| 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 2 + 3 files changed, 87 insertions(+), 40 deletions

[PATCH V2 4/4] drm/amdgpu: add gpu reset check and exception handling

2024-06-19 Thread YiPeng Chai
Add gpu reset check and exception handling for page retirement. v2: Clear poison consumption messages cached in fifo after non mode-1 reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 52 + 1 file changed, 52 insertions(+) diff --git

[PATCH V2 2/4] drm/amdgpu: refine poison creation interrupt handler

2024-06-19 Thread YiPeng Chai
-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 37 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 18 insertions(+), 20 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH V2 3/4] drm/amdgpu: refine poison consumption interrupt handler

2024-06-19 Thread YiPeng Chai
1. The poison fifo is only used for poison consumption requests. 2. Merge reset requests when poison fifo caches multiple poison consumption messages Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 56 - drivers/gpu/drm/amd/amdgpu

[PATCH V2 1/4] drm/amdgpu: add variable to record the deferred error number read by driver

2024-06-19 Thread YiPeng Chai
Add variable to record the deferred error number read by driver. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 62 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 4 +- 3 files changed, 48

[PATCH 4/5] drm/amdgpu: add completion to wait for ras reset to complete

2024-06-18 Thread YiPeng Chai
Add completion to wait for ras reset to complete. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 11 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH 5/5] drm/amdgpu: add gpu reset check and exception handling

2024-06-18 Thread YiPeng Chai
Add gpu reset check and exception handling for page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 + 1 file changed, 43 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu

[PATCH 3/5] drm/amdgpu: refine poison consumption interrupt handler

2024-06-18 Thread YiPeng Chai
1. The poison fifo is only used for poison consumption requests. 2. Merge reset requests when poison fifo caches multiple poison consumption messages Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 58 + drivers/gpu/drm/amd/amdgpu

[PATCH 2/5] drm/amdgpu: refine poison creation interrupt handler

2024-06-18 Thread YiPeng Chai
-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 41 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 21 insertions(+), 21 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 1/5] drm/amdgpu: add variable to record the deferred error number read by driver

2024-06-18 Thread YiPeng Chai
Add variable to record the deferred error number read by driver. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 62 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 4 +- 3 files changed, 48

[PATCH 5/5] drm/amdgpu: add gpu reset check before page retirement thread runs

2024-06-12 Thread YiPeng Chai
If gpu is recovering, clear all message reset flags in fifo and wait for gpu to complete recovery. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 12 1 file changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH 2/5] drm/amdgpu: add threshold to interrupt waiting for DE data to be ready

2024-06-12 Thread YiPeng Chai
If the number of messages to be processed in the fifo exceeds the threshold, it will not continue to wait for the DE data to be ready. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 4 +++- 2 files changed

[PATCH 4/5] drm/amdgpu: wait for gpu to complete reset

2024-06-12 Thread YiPeng Chai
Add completion to wait for gpu to complete reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 12 drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 13 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH 3/5] drm/amdgpu: clear all messages reset flags in fifo before gpu reset

2024-06-12 Thread YiPeng Chai
To avoid resetting the gpu repeatedly, clear all message reset flags in the fifo before the first gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 59 - 1 file changed, 58 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd

[PATCH 1/5] drm/amdgpu: add condition check for waking up thread

2024-06-12 Thread YiPeng Chai
1. Cannot add messages to fifo in gpu reset mode. 2. Only when the message is successfully saved to the fifo, the thread can be awakened. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 16 ++-- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 18

[PATCH] drm/amdgpu: change log level

2024-05-07 Thread YiPeng Chai
Change log level. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 4 ++-- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

[PATCH] drm/amdgpu: add mutex to protect ras shared memory

2024-04-28 Thread YiPeng Chai
Add mutex to protect ras shared memory. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c| 121 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h| 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 2 + 3 files changed, 84 insertions(+), 40 deletions

[PATCH 2/2] drm/amdgpu: Remove redundant function call

2024-04-27 Thread YiPeng Chai
Remove redundant function call. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 22 ++ 1 file changed, 6 insertions(+), 16 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 1/2] drm/amdgpu: Remove unused code

2024-04-27 Thread YiPeng Chai
Remove unused code. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 69 -- 1 file changed, 69 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c index 8df84feaf046..12bae67be91c 100644

[PATCH V2] drm/amdgpu: Fix ras mode2 reset failure in ras aca mode

2024-04-23 Thread YiPeng Chai
Fix ras mode2 reset failure in ras aca mode. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index edb3cd0cef96..11a70991152c 100644

[PATCH] drm/amdgpu: Fix ras mode2 reset failure in ras aca mode

2024-04-22 Thread YiPeng Chai
Fix ras mode2 reset failure in ras aca mode for sdma v4_4_2 and gfx v9_4_3. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/gfx_v9_4_3.c | 4 drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c | 4 2 files changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v9_4_3.c

[PATCH 15/15] drm/amdgpu: Use new interface to reserve bad page

2024-04-17 Thread YiPeng Chai
Use new interface to reserve bad page. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index d1a2ab944b7d

[PATCH 14/15] drm/amdgpu: Fix address translation defect

2024-04-17 Thread YiPeng Chai
retired_page is page frame and should be expanded to the full address when querying status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd

[PATCH 13/15] drm/amdgpu: support ACA logging ecc errors

2024-04-17 Thread YiPeng Chai
support ACA logging ecc errors. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 5 + 1 file changed, 5 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c index bd917eb6ea24..8df84feaf046 100644 --- a/drivers

[PATCH 12/15] drm/amdgpu: add poison consumption handler

2024-04-17 Thread YiPeng Chai
Add poison consumption handler. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 ++--- 1 file changed, 39 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 11/15] drm/amdgpu: prepare to handle pasid poison consumption

2024-04-17 Thread YiPeng Chai
Prepare to handle pasid poison consumption. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.c| 9 - drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.h| 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 20 --- drivers/gpu/drm/amd/amdgpu

[PATCH 09/15] drm/amdgpu: add condition check for amdgpu_umc_fill_error_record

2024-04-17 Thread YiPeng Chai
Add condition check for amdgpu_umc_fill_error_record. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 20 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 2 +- 3 files changed, 19 insertions(+), 4 deletions

[PATCH 07/15] drm/amdgpu: umc v12_0 logs ecc errors

2024-04-17 Thread YiPeng Chai
1. umc v12_0 logs ecc errors. 2. Reserve newly detected ecc error pages. 3. Add tag for bad pages, so that they can be retired later. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 67 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 7

[PATCH 10/15] drm/amdgpu: retire bad pages for umc v12_0

2024-04-17 Thread YiPeng Chai
Retire bad pages for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 57 +- 1 file changed, 55 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c index

[PATCH 08/15] drm/amdgpu: Add delay work to retire bad pages

2024-04-17 Thread YiPeng Chai
Add delay work to retire bad pages. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 36 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 2 +- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 3 +++ 4 files

[PATCH 06/15] drm/amdgpu: umc v12_0 converts error address

2024-04-17 Thread YiPeng Chai
Umc v12_0 converts error address. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 94 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.h | 12 2 files changed, 105 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b

[PATCH 05/15] drm/amdgpu: add interface to update umc v12_0 ecc status

2024-04-17 Thread YiPeng Chai
Add interface to update umc v12_0 ecc status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 9 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 6 + drivers/gpu/drm/amd/amdgpu/umc_v12_0.c

[PATCH 04/15] drm/amdgpu: add poison creation handler

2024-04-17 Thread YiPeng Chai
Add poison creation handler. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 74 +++-- 1 file changed, 69 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 01/15] drm/amdgpu: Add interface to reserve bad page

2024-04-17 Thread YiPeng Chai
Add interface to reserve bad page. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 19 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 4 2 files changed, 23 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd

[PATCH 03/15] drm/amdgpu: prepare for logging ecc errors

2024-04-17 Thread YiPeng Chai
Prepare for logging ecc errors. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 33 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 23 + 2 files changed, 56 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b

[PATCH 02/15] drm/amdgpu: add message fifo to handle RAS poison events

2024-04-17 Thread YiPeng Chai
Add message fifo to handle RAS poison events. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 32 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 18 ++ 2 files changed, 50 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH V2] drm/amdgpu: Fix incorrect return value

2024-04-12 Thread YiPeng Chai
add new nodes for the addresses that are not in the reserved_pages list and reservations_pending list. V2: Avoid repeated locking/unlocking. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 25 +--- 1 file changed, 16 insertions(+), 9 deletions

[PATCH] drm/amdgpu: Fix incorrect return value

2024-04-03 Thread YiPeng Chai
add new nodes for the addresses that are not in the reserved_pages list and reservations_pending list. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 28 +--- 1 file changed, 19 insertions(+), 9 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: Need to resume ras during gpu reset for gfx v9_4_3 sriov

2024-01-30 Thread YiPeng Chai
Need to resume ras during gpu reset for gfx v9_4_3 sriov Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c index afc0b4eb7f8e

[PATCH] drm/amdgpu: Fix driver uninstallation problem

2024-01-23 Thread YiPeng Chai
/0x80 [ 484.496866] ? exc_page_fault+0x87/0x170 [ 484.496868] ? asm_exc_page_fault+0x8/0x30 [ 484.496871] entry_SYSCALL_64_after_hwframe+0x44/0xae Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git

[PATCH V2 3/5] drm/amdgpu: Use asynchronous polling to handle umc_v12_0 poisoning

2024-01-17 Thread YiPeng Chai
Use asynchronous polling to handle umc_v12_0 poisoning. v2: 1. Change function name. 2. Change the debugging information content. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 139 ++-- drivers

[PATCH V2 4/5] drm/amdgpu: add interface to check mca umc status

2024-01-17 Thread YiPeng Chai
Add interface to check mca umc status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.c | 12 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 4 +++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c| 20

[PATCH V2 5/5] drm/amdgpu:Support retiring multiple MCA error address pages

2024-01-17 Thread YiPeng Chai
Support retiring multiple MCA error address pages in one in-band query for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 8 ++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 66

[PATCH V2 2/5] drm/amdgpu: Prepare for asynchronous processing of umc page retirement

2024-01-17 Thread YiPeng Chai
Preparing for asynchronous processing of umc page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 34 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 5 2 files changed, 39 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH V2 1/5] drm/amdgpu: Add log info for umc_v12_0 and smu_v13_0_6

2024-01-17 Thread YiPeng Chai
Add log info for umc_v12_0 and smu_v13_0_6. v2: Delete redundant logs. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 11 +++ drivers/gpu/drm/amd/amdkfd/kfd_events.c | 6 +- 2 files changed, 16 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm

[PATCH 1/5] drm/amdgpu: Add log info for umc_v12_0 and smu_v13_0_6

2024-01-16 Thread YiPeng Chai
Add log info for umc_v12_0 and smu_v13_0_6. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 11 +++ drivers/gpu/drm/amd/amdkfd/kfd_events.c | 6 +- .../gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c| 13 + 3 files

[PATCH 2/5] drm/amdgpu: Prepare for asynchronous processing of umc page retirement

2024-01-16 Thread YiPeng Chai
Preparing for asynchronous processing of umc page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 34 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 5 2 files changed, 39 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH 5/5] drm/amdgpu:Support retiring multiple MCA error address pages

2024-01-16 Thread YiPeng Chai
Support retiring multiple MCA error address pages in one in-band query for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 8 ++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 66

[PATCH 4/5] drm/amdgpu: add interface to check mca umc status

2024-01-16 Thread YiPeng Chai
Add interface to check mca umc status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.c | 12 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 4 +++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c| 20

[PATCH 3/5] drm/amdgpu: Use asynchronous polling to handle umc_v12_0 poisoning

2024-01-16 Thread YiPeng Chai
Use asynchronous polling to handle umc_v12_0 poisoning. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 143 +++- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 3 + 3 files changed, 120 insertions(+), 31

[PATCH 1/4] drm/amdgpu: MCA supports recording umc address information

2023-12-18 Thread YiPeng Chai
MCA supports recording umc address information. V2: Move err_addr variable from struct ras_err_node to struct ras_err_info. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.c | 13 +++-- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 22

[PATCH 3/4] drm/amdgpu: Add umc page retirement for umc v12_0

2023-12-18 Thread YiPeng Chai
Add umc page retirement for umc v12_0. V2: 1. Changed umc page retirement check condition to call umc_v12_0_is_uncorrectable_error. 2. Use memset to clear the contents of the umc error address structure. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 56

[PATCH 4/4] drm/amd/pm: smu v13_0_6 supports ecc info by default

2023-12-18 Thread YiPeng Chai
smu v13_0_6 supports ecc info by default. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c | 8 1 file changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c b/drivers/gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c

[PATCH 2/4] drm/amdgpu: Add poison mode check error condition for umc v12_0

2023-12-18 Thread YiPeng Chai
Add poison mode check error condition for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c| 20 ++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.h| 4 ++-- .../drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c | 4 ++-- 3 files changed, 19

[PATCH] drm/amdgpu: support saving bad pages after gpu ras reset

2023-10-18 Thread YiPeng Chai
Support saving bad pages after gpu ras reset for umc_v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 40 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 35 ++ drivers/gpu/drm

[PATCH] drm/amdgpu: Enable ras for mp0 v13_0_6 sriov

2023-08-15 Thread YiPeng Chai
Enable ras for mp0 v13_0_6 sriov Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index 7689395e44fd..378478cf9c21 100644 --- a/drivers

[PATCH V2] drm/amdgpu: mode1 reset needs to recover mp1 for mp0 v13_0_10

2023-08-09 Thread YiPeng Chai
Mode1 reset needs to recover mp1 in fatal error case for mp0 v13_0_10. v2: Define a macro to wrap psp function calls. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h | 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 ++ drivers/gpu/drm/amd/amdgpu/psp_v13_0.c

[PATCH] drm/amdgpu: mode1 reset needs to recover mp1 for mp0 v13_0_10

2023-08-08 Thread YiPeng Chai
Mode1 reset needs to recover mp1 in fatal error case for mp0 v13_0_10. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 3 +++ drivers/gpu/drm/amd/amdgpu/psp_v13_0.c | 24 +++- 3 files changed, 27

[PATCH 2/2] drm/amdgpu: fix incorrect vmhub index

2023-07-19 Thread YiPeng Chai
Fix incorrect vmhub index. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c | 5 - 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c index d04fc0f19a29..c0b588e5d6aa 100644

[PATCH 1/2] drm/amdgpu: fix printing empty string array

2023-07-19 Thread YiPeng Chai
Fix printing empty string array. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c index c571f0d95994..d04fc0f19a29

[PATCH 2/2] drm/amdgpu: not update the same version ras ta

2023-07-19 Thread YiPeng Chai
not update the same version ras ta. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 20 +++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c index

[PATCH 1/2] drm/amdgpu: add ta initialization failure check condition

2023-07-19 Thread YiPeng Chai
Add ta initialization failure check condition. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c index

[PATCH] drm/amdgpu: gpu recovers from fatal error in poison mode

2023-06-25 Thread YiPeng Chai
Fatal error occurs in ras poison mode, mode1 reset is used to recover gpu. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 11 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: change reserved vram info print

2023-05-25 Thread YiPeng Chai
The link object of mgr->reserved_pages is the blocks variable in struct amdgpu_vram_reservation, not the link variable in struct drm_buddy_block. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 7 --- 1 file changed, 4 insertions(+), 3 deletions(-) diff --

[PATCH] drm/amdgpu: perform mode2 reset for sdma fed error on gfx v11_0_3

2023-05-16 Thread YiPeng Chai
perform mode2 reset for sdma fed error on gfx v11_0_3. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 8 +++- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 5 + drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c | 14 +- 3 files changed, 25 insertions(+), 2

[PATCH V2] amd/amdgpu: perform soft reset for sdma fed error

2023-05-15 Thread YiPeng Chai
When testing sdma ib ring fails to detect sdma hang for sdma fed error, force to perform soft reset. V2: Add poison mode support check for special code path. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 19 +++ 1 file changed, 19 insertions

[PATCH] amd/amdgpu: perform soft reset for sdma fed error

2023-05-12 Thread YiPeng Chai
When testing sdma ib ring fails to detect sdma hang for sdma fed error, force to perform soft reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 16 1 file changed, 16 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c b/drivers/gpu

[PATCH 3/3] drm/amdgpu: adjust gpu reset sequence for gfx v11_0_3

2023-04-27 Thread YiPeng Chai
When gfx ras poison consumption causes gpu reset on gfx v11_0_3, the sequence of gpu reset is "soft reset -> mode2 reset -> mode1 reset". If the previous reset fails, fall back to the next reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdg

[PATCH 1/3] drm/amdgpu: add variable to record gpu reset reason

2023-04-27 Thread YiPeng Chai
Add variable to record gpu reset reason. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +++ drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c | 6 +- 2 files changed, 8 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h b/drivers/gpu/drm

[PATCH 2/3] drm/amdgpu: don't output mes error message when gfx hang during gpu reset

2023-04-27 Thread YiPeng Chai
: recover vram bo from shadow done [ 390.931067] amdgpu :63:00.0: amdgpu: GPU reset(1) succeeded! Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 2 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 10 ++ drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: add gfx v11_0_3 fed irq handling for sriov

2023-04-11 Thread YiPeng Chai
Add gfx v11_0_3 fed irq handling for sriov. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c | 14 +++--- 1 file changed, 11 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c b/drivers/gpu/drm/amd

[PATCH 2/2] drm/amdgpu: optimize redundant code in umc_v6_7

2023-04-02 Thread YiPeng Chai
Optimize redundant code in umc_v6_7. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v6_7.c | 162 +++--- 1 file changed, 71 insertions(+), 91 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v6_7.c b/drivers/gpu/drm/amd/amdgpu/umc_v6_7.c index

[PATCH 1/2] drm/amdgpu: optimize redundant code in umc_v8_10

2023-04-02 Thread YiPeng Chai
Optimize redundant code in umc_v8_10 Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 31 drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 7 + drivers/gpu/drm/amd/amdgpu/umc_v8_10.c | 197 +--- 3 files changed, 115 insertions(+), 120 deletions

[PATCH 2/3] drm/amdgpu: reinit mes ip block during reset on SRIOV

2023-03-20 Thread YiPeng Chai
Reinit mes ip block during reset on SRIOV. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c index

[PATCH 3/3] drm/amdgpu: resume ras for gfx v11_0_3 during reset on SRIOV

2023-03-20 Thread YiPeng Chai
Gfx v11_0_3 supports ras on SRIOV, so need to resume ras during reset. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b

[PATCH 1/3] drm/amdgpu: enable ras for mp0 v13_0_10 on SRIOV

2023-03-20 Thread YiPeng Chai
Enable ras for mp0 v13_0_10 on SRIOV. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index 63dfcc98152d

[PATCH 2/2] drm/amdgpu: Optimize sdma ras block initialization code for sdma v4_0

2023-01-18 Thread YiPeng Chai
Optimize sdma ras block initialization code for sdma v4_0. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 21 + 1 file changed, 5 insertions(+), 16 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c b/drivers

[PATCH 1/2] drm/amdgpu: Add sdma ras function on sdma v6_0_3

2023-01-18 Thread YiPeng Chai
Add sdma ras function on sdma v6_0_3. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.c | 35 drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.h | 1 + drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 24 3 files changed

[PATCH 2/2] drm/amdgpu: Adjust ras support check condition for special asic

2023-01-12 Thread YiPeng Chai
that the ras block supports ras function. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 17 - 1 file changed, 16 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu

[PATCH 1/2] drm/amdgpu: Remove unnecessary ras block support check

2023-01-12 Thread YiPeng Chai
check. Even if the ras block checked is not in the ras list, it will return a null pointer and will have no effect. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 3 --- 1 file changed, 3 deletions(-) diff --git

[PATCH 5/5] drm/amdgpu: Perform gpu reset after gfx finishes processing ras poison consumption on gfx_v11_0_3

2023-01-12 Thread YiPeng Chai
, the .hw_ops null pointer check in amdgpu_ras_interrupt_poison_consumption_handler needs to be adjusted. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang Reviewed-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c | 9 + drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h | 4

[PATCH 4/5] drm/amdgpu: Add gfx cp ecc error irq handling on gfx v11_0_3

2023-01-12 Thread YiPeng Chai
V2: Optimize gfx_v11_0_set_cp_ecc_error_state function. V3: Define macro constant for me pipe instance address interval. V5: Register and handle gfx cp ecc error irq on gfx v11_0_3. V6: Remove invalid intermediate function call. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang

[PATCH 3/5] drm/amdgpu: Add gfx ras poison consumption irq handling on gfx v11_0_3

2023-01-12 Thread YiPeng Chai
Add gfx ras poison consumption irq handling on gfx v11_0_3. V2: Move ras poison consumption irq handling code of gfx v11_0_3 to gfx_v11_0_3.c. V5: Create dedicated irq handler for RLC_GC_FED_INTERRUPT. V6: Remove invalid function call. Signed-off-by: YiPeng Chai Reviewed-by: Hawking

[PATCH 2/5] amd/amdgpu: Add RLC_RLCS_FED_STATUS_* to gc v11_0_3 ip headers

2023-01-12 Thread YiPeng Chai
V2: Add RLC_RLCS_FED_STATUS_0 and RLC_RLCS_FED_STATUS_1 register offset and shift masks. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang Reviewed-by: Tao Zhou Reviewed-by: Alex Deucher --- .../include/asic_reg/gc/gc_11_0_3_offset.h| 8 +++ .../include/asic_reg/gc

[PATCH 1/5] drm/amdgpu: Add gfx ras function on gfx v11_0_3

2023-01-12 Thread YiPeng Chai
gfx_v11_0_3_ras_ops. V4: Revert changes in amdgpu_ras_interrupt_poison_consumption_handler. V5: 1. Remove invalid include file in gfx_v11_0_3.c. 2. Reduce the number of parameters of amdgpu_gfx_ras_sw_init. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang Reviewed-by: Tao Zhou --- drivers/gpu/drm

[PATCH] drm/amdgpu: Fixed bug on error when uninstalling amdgpu

2022-12-15 Thread YiPeng Chai
853540] __x64_sys_delete_module+0x142/0x260 [ 304.853548] ? exit_to_user_mode_prepare+0x3e/0x190 [ 304.853555] do_syscall_64+0x38/0x90 [ 304.853562] entry_SYSCALL_64_after_hwframe+0x63/0xcd Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 2 +- 1 file chan

[PATCH] drm/amdgpu: Enable mode-1 reset for RAS recovery in fatal error mode

2022-11-14 Thread YiPeng Chai
The patch is enabling mode-1 reset for RAS recovery in fatal error mode. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang Reviewed-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 4 drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c| 7 ++- 2 files changed, 10 insertions

[PATCH] drm/amdgpu: Fixed the problem that ras error can't be queried after gpu recovery is completed

2022-11-13 Thread YiPeng Chai
Amdgpu_ras_set_error_query_ready is called at the start of amdgpu_device_gpu_recover to disable query ras error, but the code behind only enables query ras error in full reset path, but not in soft reset path, emergency restart path and skip the hardware reset path. Signed-off-by: YiPeng Chai

[PATCH] drm/amdgpu: Add umc channel index mapping table for umc_v8_10

2022-11-13 Thread YiPeng Chai
Add umc channel index mapping table for umc_v8_10. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c | 5 - drivers/gpu/drm/amd/amdgpu/umc_v8_10.c | 10 ++ drivers/gpu/drm/amd/amdgpu/umc_v8_10.h | 4 3 files changed, 18

[PATCH] drm/amdgpu: Fixed ras warning when uninstalling amdgpu

2022-09-19 Thread YiPeng Chai
. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index e55f106621ef..3deb716710e6 100644 --- a/drivers/gpu/drm/amd/amdgpu

[PATCH V3 3/3] drm/amd/pm: Remove redundant check condition

2022-09-08 Thread YiPeng Chai
condition checks, so the first conditional check in amdgpu_pm_sysfs_fini can be removed. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/pm/amdgpu_pm.c | 3 --- 1 file changed, 3 deletions(-) diff --git a/drivers/gpu/drm/amd/pm/amdgpu_pm.c b/drivers/gpu/drm/amd/pm/amdgpu_pm.c index 5e318b3f6c0f

[PATCH V3 2/3] drm/amdgpu: Fixed psp fence and memory issues when removing amdgpu device

2022-09-08 Thread YiPeng Chai
pu_device_gpu_recover, then amdgpu_fill_buffer will not be called when psp_suspend is called. 2. Free psp ring memory in psp_sw_fini. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 5 - drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c| 1 + drivers/gpu/drm/am

  1   2   >