[PATCH] ALSA: hda/hdmi - fix atpx_present when CLASS is not VGA

2019-12-20 Thread Alex Deucher
You can't use PCI_BASE_CLASS with pci_get_class(). This happens to work by luck on devices with PCI_CLASS_DISPLAY_VGA, but misses PCI_CLASS_DISPLAY_OTHER. Add a check for those as well. Signed-off-by: Alex Deucher --- sound/pci/hda/hda_intel.c | 12 +++- 1 file changed, 11

[PATCH] Revert "drm/amdgpu: simplify ATPX detection"

2019-12-20 Thread Alex Deucher
This reverts commit f5fda6d89afe6e9cedaa1c3303903c905262f6e8. You can't use BASE_CLASS in pci_get_class. Bug: https://gitlab.freedesktop.org/drm/amd/issues/995 Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_atpx_handler.c | 12 +++- 1 file changed, 11 insertions(+),

Re: [PATCH] drm/dp_mst: clear time slots for ports invalid

2019-12-20 Thread Lyude Paul
Mhh-I think I understand the problem you're trying to solve here but I think this solution might be a bit overkill. When I did the rework of topology references for ports, I made it so that we can guarantee memory access to a port without it needing to be a valid part of the topology. As well, all

Re: [PATCH 3/5] drm/amdgpu: GPU TLB flush API moved to amdgpu_amdkfd

2019-12-20 Thread Yong Zhao
On 2019-12-20 6:50 p.m., Yong Zhao wrote: Inline. On 2019-12-20 4:35 p.m., Felix Kuehling wrote: On 2019-12-20 1:24, Alex Sierra wrote: [Why] TLB flush method has been deprecated using kfd2kgd interface. This implementation is now on the amdgpu_amdkfd API. [How] TLB flush functions now

Re: [PATCH 2/5] drm/amdgpu: export function to flush TLB via pasid

2019-12-20 Thread Yong Zhao
On 2019-12-20 1:24 a.m., Alex Sierra wrote: This can be used directly from amdgpu and amdkfd to invalidate TLB through pasid. It supports gmc v7, v8, v9 and v10. Change-Id: I6563a8eba2e42d1a67fa2547156c20da41d1e490 Signed-off-by: Alex Sierra --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 6

Re: [PATCH 3/5] drm/amdgpu: GPU TLB flush API moved to amdgpu_amdkfd

2019-12-20 Thread Yong Zhao
Inline. On 2019-12-20 4:35 p.m., Felix Kuehling wrote: On 2019-12-20 1:24, Alex Sierra wrote: [Why] TLB flush method has been deprecated using kfd2kgd interface. This implementation is now on the amdgpu_amdkfd API. [How] TLB flush functions now implemented in amdgpu_amdkfd. Change-Id:

Re: [PATCH 1/5] drm/amdgpu: Avoid reclaim fs while eviction lock

2019-12-20 Thread Yong Zhao
One style comment inline. Yong On 2019-12-20 1:24 a.m., Alex Sierra wrote: [Why] Avoid reclaim filesystem while eviction lock is held called from MMU notifier. [How] Setting PF_MEMALLOC_NOFS flags while eviction mutex is locked. Using memalloc_nofs_save / memalloc_nofs_restore API.

Re: [PATCH v9 14/18] drm/amd/display: Add PBN per slot calculation for DSC

2019-12-20 Thread Leo
On 2019-12-13 3:08 p.m., mikita.lip...@amd.com wrote: > From: Mikita Lipski > > [why] > Need to calculate VCPI slots differently for DSC > to take in account current link rate, link count > and FEC. > [how] > Add helper to get pbn_div from dc_link > > Cc: Harry Wentland > Cc: Lyude Paul >

Re: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread shaoyunl
I agree this patch is a big improvement , I think we need this patch so SRIOV can put the  amdkfd_pre_reset in right place as bare metal mode . The further improvement can be in separate change . This serial is reviewed by shaoyun.liu < shaoyun@amd.com> Regards shaoyun.liu On

Re: [PATCH 1/2] drm/amdgpu: update the method to get fb_loc of memory training(V4)

2019-12-20 Thread Luben Tuikov
On 2019-12-20 10:27 a.m., Alex Deucher wrote: > On Thu, Dec 19, 2019 at 9:00 PM Yin, Tianci (Rico) wrote: >> >> [AMD Official Use Only - Internal Distribution Only] >> >> >> Hi Luben, >> >> May I have your Review-by? >> If you'd like--it's completely up to you. If you choose to, like Alex's

Re: [PATCH v9 10/18] drm/dp_mst: Manually overwrite PBN divider for calculating timeslots

2019-12-20 Thread Lyude Paul
Reviewed-by: Lyude Paul On Fri, 2019-12-13 at 15:08 -0500, mikita.lip...@amd.com wrote: > From: Mikita Lipski > > [why] > For DSC case we cannot use topology manager's PBN divider > variable. The default divider does not take FEC into account. > Therefore the driver has to calculate its own

Re: [PATCH v9 14/18] drm/amd/display: Add PBN per slot calculation for DSC

2019-12-20 Thread Lyude Paul
Acked-by: Lyude Paul On Fri, 2019-12-13 at 15:08 -0500, mikita.lip...@amd.com wrote: > From: Mikita Lipski > > [why] > Need to calculate VCPI slots differently for DSC > to take in account current link rate, link count > and FEC. > [how] > Add helper to get pbn_div from dc_link > > Cc: Harry

Re: [PATCH v9 15/18] drm/amd/display: MST DSC compute fair share

2019-12-20 Thread Lyude Paul
Acked-by: Lyude Paul On Fri, 2019-12-13 at 15:08 -0500, mikita.lip...@amd.com wrote: > From: David Francis > > If there is limited link bandwidth on a MST network, > it must be divided fairly between the streams on that network > > Implement an algorithm to determine the correct DSC config >

Re: [PATCH v9 16/18] drm/amd/display: Recalculate VCPI slots for new DSC connectors

2019-12-20 Thread Lyude Paul
So I reviewed this already but realized I made a very silly mistake, comments down below: On Fri, 2019-12-13 at 15:08 -0500, mikita.lip...@amd.com wrote: > From: Mikita Lipski > > [why] > Since for DSC MST connector's PBN is claculated differently > due to compression, we have to recalculate

Re: [PATCH 5/5] drm/amdgpu: invalidate BO during userptr mapping

2019-12-20 Thread Felix Kuehling
I think this patch is just a proof of concept for now. It should not be submitted because there are still some known locking issues that need to be solved, and we don't have the code yet that handles the recoverable page faults resulting from this. Regards,   Felix On 2019-12-20 1:24, Alex

Re: [PATCH 4/5] drm/amdgpu: flush TLB functions removal from kfd2kgd interface

2019-12-20 Thread Felix Kuehling
On 2019-12-20 1:24, Alex Sierra wrote: [Why] kfd2kgd interface will be deprecated. This removal only covers TLB invalidation for now. They have been replaced in amdgpu_amdkfd API. [How] TLB invalidate functions removed from the different amdkfd_gfx_v* versions. Change-Id:

Re: [PATCH 1/5] drm/amdgpu: Avoid reclaim fs while eviction lock

2019-12-20 Thread Felix Kuehling
On 2019-12-20 1:24, Alex Sierra wrote: [Why] Avoid reclaim filesystem while eviction lock is held called from MMU notifier. [How] Setting PF_MEMALLOC_NOFS flags while eviction mutex is locked. Using memalloc_nofs_save / memalloc_nofs_restore API. Change-Id:

Re: [PATCH v9 01/18] drm/dp_mst: Add PBN calculation for DSC modes

2019-12-20 Thread Lyude Paul
Actually, one comment on this that should be very simple to add On Fri, 2019-12-13 at 15:08 -0500, mikita.lip...@amd.com wrote: > From: David Francis > > With DSC, bpp can be fractional in multiples of 1/16. > > Change drm_dp_calc_pbn_mode to reflect this, adding a new > parameter bool dsc.

Re: [PATCH 3/5] drm/amdgpu: GPU TLB flush API moved to amdgpu_amdkfd

2019-12-20 Thread Felix Kuehling
On 2019-12-20 1:24, Alex Sierra wrote: [Why] TLB flush method has been deprecated using kfd2kgd interface. This implementation is now on the amdgpu_amdkfd API. [How] TLB flush functions now implemented in amdgpu_amdkfd. Change-Id: Ic51cccdfe6e71288d78da772b6e1b6ced72f8ef7 Signed-off-by: Alex

Re: [PATCH 2/5] drm/amdgpu: export function to flush TLB via pasid

2019-12-20 Thread Felix Kuehling
On 2019-12-20 1:24, Alex Sierra wrote: This can be used directly from amdgpu and amdkfd to invalidate TLB through pasid. It supports gmc v7, v8, v9 and v10. Two small corrections inline to make the behaviour between KIQ and MMIO-based flushing consistent. Looks good otherwise. Change-Id:

[PATCH 1/2] drm/amdgpu/smu/navi: Adjust default behavior for peak sclk profile

2019-12-20 Thread Alex Deucher
Fetch the sclk from the pptable if there is no specified sclk for the board. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/powerplay/navi10_ppt.c | 10 -- 1 file changed, 8 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/powerplay/navi10_ppt.c

[PATCH 2/2] drm/amdgpu/smu: add peak profile support for navi12

2019-12-20 Thread Alex Deucher
Add defined peak sclk for navi12 peak profile mode. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/powerplay/navi10_ppt.c | 3 +++ drivers/gpu/drm/amd/powerplay/navi10_ppt.h | 2 ++ 2 files changed, 5 insertions(+) diff --git a/drivers/gpu/drm/amd/powerplay/navi10_ppt.c

[PATCH] drm/amdgpu/smu/navi: Adjust default behavior for peak sclk profile

2019-12-20 Thread Alex Deucher
Fetch the sclk from the pptable if there is no specified sclk for the board. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/powerplay/navi10_ppt.c | 10 -- 1 file changed, 8 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/powerplay/navi10_ppt.c

Re: ✗ Fi.CI.BAT: failure for drm: Add support for DP 1.4 Compliance edid corruption test (rev3)

2019-12-20 Thread Harry Wentland
On 2019-12-20 2:40 p.m., Zuo, Jerry wrote: > Hi All: > > I got CI check failures. Among those, hdmi-hpd-fast seems related, but I > am not sure why. Please take a brief review and help to determine if it is a > real false-positive again. > It looks like the hdmi-hpd-fast failures are

Re: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Felix Kuehling
On 2019-12-20 14:31, shaoyunl wrote: Can we use the  dqm_lock when we try to get the dqm->is_hw_hang and dqm->is_resetting inside function kq_uninitialize ? Spreading the DQM lock around is probably not a good idea. Then I'd rather do more refactoring to move hqd_load and hqd_destroy out of

Re: [PATCH v2] drm/amd/display: Reduce HDMI pixel encoding if max clock is exceeded

2019-12-20 Thread Alex Deucher
On Fri, Dec 20, 2019 at 10:10 AM Tom Anderson wrote: > > Ping. Is there any action required to get this landed? Looks good to me, but I'd like to hear from the display guys. Alex > > On Tue, Dec 10, 2019 at 10:59:24AM -0800, Tom Anderson wrote: > > Friendly ping. > > > > On Mon, Dec 02, 2019

RE: ✗ Fi.CI.BAT: failure for drm: Add support for DP 1.4 Compliance edid corruption test (rev3)

2019-12-20 Thread Zuo, Jerry
Hi All: I got CI check failures. Among those, hdmi-hpd-fast seems related, but I am not sure why. Please take a brief review and help to determine if it is a real false-positive again. Thanks a lot. Regards, Jerry -Original Message- From: Patchwork Sent: December 9, 2019

Re: [PATCH] drm/amdgpu: Remove unneeded variable 'ret' in navi10_ih.c

2019-12-20 Thread Alex Deucher
On Fri, Dec 20, 2019 at 10:10 AM Ma Feng wrote: > > Fixes coccicheck warning: > > drivers/gpu/drm/amd/amdgpu/navi10_ih.c:113:5-8: Unneeded variable: "ret". > Return "0" on line 182 > > Reported-by: Hulk Robot > Signed-off-by: Ma Feng Applied. Thanks! Alex > --- >

Re: [PATCH] drm/amdgpu: Remove unneeded variable 'ret' in amdgpu_device.c

2019-12-20 Thread Alex Deucher
On Fri, Dec 20, 2019 at 10:10 AM Ma Feng wrote: > > Fixes coccicheck warning: > > drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:1036:5-8: Unneeded variable: > "ret". Return "0" on line 1079 > > Reported-by: Hulk Robot > Signed-off-by: Ma Feng Applied. thanks! Alex > --- >

Re: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread shaoyunl
Can we use the  dqm_lock when we try to get the dqm->is_hw_hang and  dqm->is_resetting inside function kq_uninitialize ? I think more closer we check the status  to hqd_destroy it will be  more accurate . It does look better with this logic if the status are changed after dqm unmap_queue call

Re: [PATCH 3/4] drm/amdkfd: Improve HWS hang detection and handling

2019-12-20 Thread Felix Kuehling
On 2019-12-20 12:18, Zeng, Oak wrote: [AMD Official Use Only - Internal Distribution Only] With this improvement, it is still possible that two reset be scheduled. There is a period of time after HWS hang and before kfd pre-reset is called, during which, if a thread already passed the

Re: [PATCH] drm/dp_mst: clear time slots for ports invalid

2019-12-20 Thread Lyude Paul
Hi! I will try to review this patch today, must have gotten lost in the noise On Fri, 2019-12-20 at 01:46 +, Lin, Wayne wrote: > [AMD Official Use Only - Internal Distribution Only] > > Pinged. > Hi, can someone help to review please. > > Thanks a lot. > > Regards, > Wayne > >

RE: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Zeng, Oak
[AMD Official Use Only - Internal Distribution Only] I see. Thank you Felix for the explanation. Regards, Oak -Original Message- From: Kuehling, Felix Sent: Friday, December 20, 2019 12:28 PM To: Zeng, Oak ; amd-gfx@lists.freedesktop.org Subject: Re: [PATCH 4/4] drm/amdkfd: Avoid

Re: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Felix Kuehling
On 2019-12-20 12:22, Zeng, Oak wrote: [AMD Official Use Only - Internal Distribution Only] Regards, Oak -Original Message- From: amd-gfx On Behalf Of Felix Kuehling Sent: Friday, December 20, 2019 3:30 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH 4/4] drm/amdkfd: Avoid

RE: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Zeng, Oak
[AMD Official Use Only - Internal Distribution Only] Regards, Oak -Original Message- From: amd-gfx On Behalf Of Felix Kuehling Sent: Friday, December 20, 2019 3:30 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch Don't use

RE: [PATCH 3/4] drm/amdkfd: Improve HWS hang detection and handling

2019-12-20 Thread Zeng, Oak
[AMD Official Use Only - Internal Distribution Only] With this improvement, it is still possible that two reset be scheduled. There is a period of time after HWS hang and before kfd pre-reset is called, during which, if a thread already passed the is_hws_hang check but was scheduled out, then

Re: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Felix Kuehling
dqm->is_hws_hang is protected by the DQM lock. kq_uninitialize runs outside that lock protection. Therefore I opted to pass in the hanging flag as a parameter. It also keeps the logic that decides all of that inside the device queue manager, which I think is cleaner. I was trying to clean

Re: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread shaoyunl
Looks like patch 2 is not related to this serial , but anyway . Patch 1,2,3 are reviewed by shaoyunl  For patch 4 ,  is it possible we directly check dqm->is_hws_hang || dqm->is_resetting  inside function kq_uninitialize.  so we don't need other interface change . I think even Inside that

Re: [PATCH 1/2] drm/amdgpu: update the method to get fb_loc of memory training(V4)

2019-12-20 Thread Alex Deucher
On Thu, Dec 19, 2019 at 9:00 PM Yin, Tianci (Rico) wrote: > > [AMD Official Use Only - Internal Distribution Only] > > > Hi Luben, > > May I have your Review-by? > Series is: Reviewed-by: Alex Deucher > Thanks a lot! > Rico > > From: Tuikov, Luben > Sent:

[PATCH] drm/amdgpu: Remove unneeded variable 'ret' in navi10_ih.c

2019-12-20 Thread Ma Feng
Fixes coccicheck warning: drivers/gpu/drm/amd/amdgpu/navi10_ih.c:113:5-8: Unneeded variable: "ret". Return "0" on line 182 Reported-by: Hulk Robot Signed-off-by: Ma Feng --- drivers/gpu/drm/amd/amdgpu/navi10_ih.c | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git

[PATCH] drm/amdgpu: Remove unneeded variable 'ret' in amdgpu_device.c

2019-12-20 Thread Ma Feng
Fixes coccicheck warning: drivers/gpu/drm/amd/amdgpu/amdgpu_device.c:1036:5-8: Unneeded variable: "ret". Return "0" on line 1079 Reported-by: Hulk Robot Signed-off-by: Ma Feng --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git

[PATCH ] drm/radeon: Fix potential buffer overflow in ci_set_mc_special_registers()

2019-12-20 Thread chenmaodong
Hi, The length of table->mc_reg_address is SMU7_DISCRETE_MC_REGISTER_ARRAY_SIZE. In ci_set_mc_special_registers(), the boundary checking here("if (j > SMU7_DISCRETE_MC_REGISTER_ARRAY_SIZE)") allows 'j' equal to SMU7_DISCRETE_MC_REGISTER_ARRAY_SIZE which can easily cause the

[PATCH AUTOSEL 4.19 03/34] drm/amdgpu: add check before enabling/disabling broadcast mode

2019-12-20 Thread Sasha Levin
From: Guchun Chen [ Upstream commit 6e807535dae5dbbd53bcc5e81047a20bf5eb08ea ] When security violation from new vbios happens, data fabric is risky to stop working. So prevent the direct access to DF mmFabricConfigAccessControl from the new vbios and onwards. Signed-off-by: Guchun Chen

[PATCH AUTOSEL 4.19 04/34] drm/amdgpu: add cache flush workaround to gfx8 emit_fence

2019-12-20 Thread Sasha Levin
From: Pierre-Eric Pelloux-Prayer [ Upstream commit bf26da927a1cd57c9deb2db29ae8cf276ba8b17b ] The same workaround is used for gfx7. Both PAL and Mesa use it for gfx8 too, so port this commit to gfx_v8_0_ring_emit_fence_gfx. Signed-off-by: Pierre-Eric Pelloux-Prayer Reviewed-by: Alex Deucher

[PATCH AUTOSEL 4.19 05/34] drm/amd/display: Fixed kernel panic when booting with DP-to-HDMI dongle

2019-12-20 Thread Sasha Levin
From: David Galiffi [ Upstream commit a51d9f8fe756beac51ce26ef54195da00a260d13 ] [Why] In dc_link_is_dp_sink_present, if dal_ddc_open fails, then dal_gpio_destroy_ddc is called, destroying pin_data and pin_clock. They are created only on dc_construct, and next aux access will cause a panic.

[PATCH AUTOSEL 5.4 09/52] drm/amd/display: Change the delay time before enabling FEC

2019-12-20 Thread Sasha Levin
From: "Leo (Hanghong) Ma" [ Upstream commit 28fa24ad14e8f7d23c62283eaf9c79b4fd165c16 ] [why] DP spec requires 1000 symbols delay between the end of link training and enabling FEC in the stream. Currently we are using 1 miliseconds delay which is not accurate. [how] One lane RBR should have the

[PATCH AUTOSEL 5.4 11/52] drm/amd/display: update dispclk and dppclk vco frequency

2019-12-20 Thread Sasha Levin
From: Eric Yang [ Upstream commit 44ce6c3dc8479bb3ed68df13b502b0901675e7d6 ] Value obtained from DV is not allowing 8k60 CTA mode with DSC to pass, after checking real value being used in hw, find out that correct value is 3600, which will allow that mode. Signed-off-by: Eric Yang

[PATCH AUTOSEL 5.4 05/52] drm/amdgpu: add header line for power profile on Arcturus

2019-12-20 Thread Sasha Levin
From: Alex Deucher [ Upstream commit 14891c316ca7e15d81dba78f30fb630e3f9ee2c9 ] So the output is consistent with other asics. Reviewed-by: Evan Quan Signed-off-by: Alex Deucher Signed-off-by: Sasha Levin --- drivers/gpu/drm/amd/powerplay/arcturus_ppt.c | 5 + 1 file changed, 5

[PATCH AUTOSEL 5.4 07/52] drm/amd/display: Map DSC resources 1-to-1 if numbers of OPPs and DSCs are equal

2019-12-20 Thread Sasha Levin
From: Nikola Cornij [ Upstream commit a1fc44b609b4e9c0941f0e4a1fc69d367af5ab69 ] [why] On ASICs where number of DSCs is the same as OPPs there's no need for DSC resource management. Mappping 1-to-1 fixes mode-set- or S3- -related issues for such platforms. [how] Map DSC resources 1-to-1 to

[PATCH AUTOSEL 5.4 10/52] drm/amd/display: Reset steer fifo before unblanking the stream

2019-12-20 Thread Sasha Levin
From: Nikola Cornij [ Upstream commit 87de6cb2f28153bc74d0a001ca099c29453e145f ] [why] During mode transition steer fifo could overflow. Quite often it recovers by itself, but sometimes it doesn't. [how] Add steer fifo reset before unblanking the stream. Also add a short delay when resetting

[PATCH AUTOSEL 5.4 04/52] drm/amdgpu: add check before enabling/disabling broadcast mode

2019-12-20 Thread Sasha Levin
From: Guchun Chen [ Upstream commit 6e807535dae5dbbd53bcc5e81047a20bf5eb08ea ] When security violation from new vbios happens, data fabric is risky to stop working. So prevent the direct access to DF mmFabricConfigAccessControl from the new vbios and onwards. Signed-off-by: Guchun Chen

[PATCH AUTOSEL 5.4 08/52] drm/amd/display: Fixed kernel panic when booting with DP-to-HDMI dongle

2019-12-20 Thread Sasha Levin
From: David Galiffi [ Upstream commit a51d9f8fe756beac51ce26ef54195da00a260d13 ] [Why] In dc_link_is_dp_sink_present, if dal_ddc_open fails, then dal_gpio_destroy_ddc is called, destroying pin_data and pin_clock. They are created only on dc_construct, and next aux access will cause a panic.

[PATCH AUTOSEL 5.4 06/52] drm/amdgpu: add cache flush workaround to gfx8 emit_fence

2019-12-20 Thread Sasha Levin
From: Pierre-Eric Pelloux-Prayer [ Upstream commit bf26da927a1cd57c9deb2db29ae8cf276ba8b17b ] The same workaround is used for gfx7. Both PAL and Mesa use it for gfx8 too, so port this commit to gfx_v8_0_ring_emit_fence_gfx. Signed-off-by: Pierre-Eric Pelloux-Prayer Reviewed-by: Alex Deucher

RE: [PATCH 1/2] drm/amdgpu: remove FB location config for sriov

2019-12-20 Thread Deng, Emily
[AMD Official Use Only - Internal Distribution Only] Reviewed-by: Emily Deng Best wishes Emily Deng >-Original Message- >From: amd-gfx On Behalf Of >Frank.Min >Sent: Thursday, December 19, 2019 7:44 PM >To: amd-gfx@lists.freedesktop.org >Cc: Min, Frank >Subject: [PATCH 1/2]

RE: [PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Deng, Emily
[AMD Official Use Only - Internal Distribution Only] Series Tested-by: Emily Deng on sriov environment with vege10 about TDR-1, TDR-2 and TDR-3 test cases. Best wishes Emily Deng >-Original Message- >From: amd-gfx On Behalf Of Felix >Kuehling >Sent: Friday, December 20, 2019 4:30

[PATCH 3/4] drm/amdkfd: Improve HWS hang detection and handling

2019-12-20 Thread Felix Kuehling
Move HWS hand detection into unmap_queues_cpsch to catch hangs in all cases. If this happens during a reset, don't schedule another reset because the reset already in progress is expected to take care of it. Signed-off-by: Felix Kuehling --- drivers/gpu/drm/amd/amdkfd/kfd_device.c | 3

[PATCH 1/4] drm/amdkfd: Fix permissions of hang_hws

2019-12-20 Thread Felix Kuehling
Reading from /sys/kernel/debug/kfd/hang_hws would cause a kernel oops because we didn't implement a read callback. Set the permission to write-only to prevent that. Signed-off-by: Felix Kuehling --- drivers/gpu/drm/amd/amdkfd/kfd_debugfs.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)

[PATCH 4/4] drm/amdkfd: Avoid hanging hardware in stop_cpsch

2019-12-20 Thread Felix Kuehling
Don't use the HWS if it's known to be hanging. In a reset also don't try to destroy the HIQ because that may hang on SRIOV if the KIQ is unresponsive. Signed-off-by: Felix Kuehling --- .../gpu/drm/amd/amdkfd/kfd_device_queue_manager.c| 12

[PATCH 2/4] drm/amdkfd: Remove unused variable

2019-12-20 Thread Felix Kuehling
dqm->pipeline_mem wasn't used anywhere. Signed-off-by: Felix Kuehling --- drivers/gpu/drm/amd/amdkfd/kfd_device_queue_manager.c | 1 - drivers/gpu/drm/amd/amdkfd/kfd_device_queue_manager.h | 1 - 2 files changed, 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_device_queue_manager.c

RE: [PATCH 1/1] drm/amdkfd: Don't touch the hardware in pre_reset callback

2019-12-20 Thread Liu, Monk
Okay Can you send me the patch (in attachment) once you finished it, I need to verify it on SRIOV Thanks _ Monk Liu|GPU Virtualization Team |AMD -Original Message- From: Kuehling, Felix Sent: Friday, December 20, 2019 3:56 PM To: Liu, Monk ;