[PATCH v8] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did not signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is

RE: [PATCH] drm/amdgpu: remove gfx9 NGG

2019-09-19 Thread Yan, Alex
These registers are used in GFX9 NGG and the addresses are managed by KMD; GFX10 has changed the design; UMD does not need to care about these values any more. I can confirm UMD does not read these values from KMD. Alex -Original Message- From: Zhou, David(ChunMing) Sent: Friday, Sept

RE: [PATCH] drm/amdgpu: remove gfx9 NGG

2019-09-19 Thread Zhou, David(ChunMing)
+Alex Yan to confirm which doesn't affect us. -Original Message- From: amd-gfx On Behalf Of Marek Olšák Sent: Friday, September 20, 2019 10:16 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH] drm/amdgpu: remove gfx9 NGG From: Marek Olšák Never used. Signed-off-by: Marek Olšák

[PATCH v7] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did not signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is

[PATCH v6] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is a t

[PATCH] drm/amdgpu: remove gfx9 NGG

2019-09-19 Thread Marek Olšák
From: Marek Olšák Never used. Signed-off-by: Marek Olšák --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 5 - drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 41 - drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h | 25 --- drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c | 11 -- drivers/gpu/drm/amd/amdgpu/gfx

Re: [PATCH 15/15] drm/amd/display: Trigger modesets on MST DSC connectors

2019-09-19 Thread Lyude Paul
This still needs to be moved into an atomic helper so it can be reused by other drivers ... ... however, I've had this patch series on my mind for a while and something occurred to me that would be a lot easier. Why exactly are we not just enabling DSC wherever it's supported, regardless of whethe

Re: [PATCH 09/15] drm/dp_mst: Add new quirk for Synaptics MST hubs

2019-09-19 Thread Lyude Paul
Great work! Reviewed-by: Lyude Paul On Wed, 2019-09-18 at 16:26 -0400, mikita.lip...@amd.com wrote: > From: David Francis > > Synaptics DP1.4 hubs (BRANCH_ID 0x90CC24) do not > support virtual DPCD registers, but do support DSC. > The DSC caps can be read from the physical aux, > like in SST D

Re: [PATCH 08/15] drm/dp_mst: Add helpers for MST DSC and virtual DPCD aux

2019-09-19 Thread Lyude Paul
Reviewed-by: Lyude Paul On Wed, 2019-09-18 at 16:26 -0400, mikita.lip...@amd.com wrote: > From: David Francis > > Add drm_dp_mst_dsc_aux_for_port. To enable DSC, the DSC_ENABLED > register might have to be written on the leaf port's DPCD, > its parent's DPCD, or the MST manager's DPCD. This fun

Re: [PATCH 03/15] drm/amdgpu: validate mst topology in atomic check

2019-09-19 Thread Lyude Paul
This also needs to be squashed into the previous two patches. There's no point in using drm_dp_atomic_find_vcpi_slots() or drm_dp_atomic_release_vcpi_slots() without drm_dp_mst_atomic_check(), since the VCPI allocations setup by the two functions aren't validated until then. On Wed, 2019-09-18 at

Re: [PATCH 02/15] drm/amdgpu: Add connector atomic check

2019-09-19 Thread Lyude Paul
On Wed, 2019-09-18 at 16:26 -0400, mikita.lip...@amd.com wrote: > From: Mikita Lipski > > [why] > Complying with new MST atomic check requirements. > The driver needs to call this function on every > atomic check to reset the VCPI slots if new state > disables > [how] > - Verify that it is a MST

Re: [PATCH 01/15] drm/amdgpu: Add encoder atomic check

2019-09-19 Thread Lyude Paul
Ok, so reviewing this is kind of difficult because this series doesn't apply to drm-tip, and also doesn't make any mention of what branch it's supposed to apply to. So there's no way for me to apply any of these changes in my tree to get an idea of how things look overall with these patches applied

Re: [PATCH v2] drm/amd/display: fix struct init in update_bounding_box

2019-09-19 Thread Alex Deucher
On Wed, Aug 28, 2019 at 2:51 PM Raul E Rangel wrote: > > dcn20_resource.c:2636:9: error: missing braces around initializer > [-Werror=missing-braces] > struct _vcs_dpi_voltage_scaling_st > calculated_states[MAX_CLOCK_LIMIT_STATES] = {0}; > ^ > Fixes: 7ed4e6352c16f ("drm/amd/display: A

Re: linux-next: Tree for Sep 19 (amdgpu)

2019-09-19 Thread Randy Dunlap
On 9/19/19 9:06 AM, Mark Brown wrote: > Hi all, > > Changes since 20190918: > ../drivers/gpu/drm/amd/amdgpu/../display/dc/dml/Makefile:70: *** missing 'endif'. Stop. -- ~Randy ___ amd-gfx mailing list amd-gfx@lists.freedesktop.org https://lists.f

[PATCH 1/6] drm/amdgpu: fix documentation for amdgpu_gem_prime_export

2019-09-19 Thread Alex Deucher
Drop extra function parameter. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_dma_buf.c | 1 - 1 file changed, 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_dma_buf.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_dma_buf.c index 61f108ec2b5c..4917b548b7f2 100644 --- a/d

[PATCH 5/6] drm/amdgpu: fix documentation for amdgpu_pm.c

2019-09-19 Thread Alex Deucher
Fix DOC link name, clean up formatting in pp_dpm_* section. Signed-off-by: Alex Deucher --- Documentation/gpu/amdgpu.rst | 6 +++--- drivers/gpu/drm/amd/amdgpu/amdgpu_pm.c | 13 + 2 files changed, 12 insertions(+), 7 deletions(-) diff --git a/Documentation/gpu/amdgpu.rst

[PATCH 2/6] drm/amdgpu/mn: fix documentation for amdgpu_mn_read_lock

2019-09-19 Thread Alex Deucher
Document the new parameter. Fixes: 93065ac753e4 ("mm, oom: distinguish blockable mode for mmu notifiers") Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_mn.c b/drivers/gpu/drm/amd/amdgp

[PATCH 3/6] drm/amdgpu/vm: fix up documentation in amdgpu_vm.c

2019-09-19 Thread Alex Deucher
Missing parameters, wrong comment type, etc. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c | 17 ++--- 1 file changed, 10 insertions(+), 7 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c index 2bb531

[PATCH 6/6] drm/amdgpu/ras: fix and update the documentation for RAS

2019-09-19 Thread Alex Deucher
Add new sections to amdgpu.rst, fix up formatting issues, add additional documentation to each section. Signed-off-by: Alex Deucher --- Documentation/gpu/amdgpu.rst| 24 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 53 + 2 files changed, 68 insertions

[PATCH 4/6] drm/amdgpu/ih: fix documentation in amdgpu_irq_dispatch

2019-09-19 Thread Alex Deucher
Fix parameters. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_irq.c index 1f2628f445c4..8f49d8131d63 100644 --- a/drivers/gpu

Re: [PATCH v2] drm/amd/display: fix struct init in update_bounding_box

2019-09-19 Thread Raul Rangel
Friendly ping for review. Thanks

[PATCH] drm/amdgpu: Enable gfx cache probing on HDP write for arcturus

2019-09-19 Thread Zeng, Oak
This allows gfx cache to be probed and invalidated (for none-dirty cache lines) on a HDP write (from either another GPU or CPU). This should work only for the memory mapped as RW memory type newly added for arcturus, to achieve some cache coherence b/t multiple memory clients. Change-Id: I0a69d000

Re: [PATCH] drm/amdkfd: fix a potential NULL pointer dereference

2019-09-19 Thread Kuehling, Felix
On 2019-09-18 12:30 p.m., Allen Pais wrote: > alloc_workqueue is not checked for errors and as a result, > a potential NULL dereference could occur. > > Signed-off-by: Allen Pais > --- > drivers/gpu/drm/amd/amdkfd/kfd_interrupt.c | 5 + > 1 file changed, 5 insertions(+) > > diff --git a/dri

Help building the UMS branch of xf86-video-ati on a modern Linux distro

2019-09-19 Thread Nicol?s Colla
Hello everyone, I desperately need to bring back userland mode setting (UMS) support on my machine for hardware compatibility reasons. As you probably know, UMS support was removed some years ago, and a new git branch was created with the UMS-compatible source code of the driver. I cloned the

Re: [PATCH] drm/amd/display; Fix kernel doc warnings

2019-09-19 Thread Alex Deucher
On Thu, Sep 19, 2019 at 1:52 PM Harry Wentland wrote: > > We had a couple of missing definitions and formatting errors. > > v2: Fix 'notifying' type > > Signed-off-by: Harry Wentland > Reviewed-by: Nicholas Kazlauskas Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/display/amdgpu_dm/am

[PATCH] drm/amd/display; Fix kernel doc warnings

2019-09-19 Thread Harry Wentland
We had a couple of missing definitions and formatting errors. v2: Fix 'notifying' type Signed-off-by: Harry Wentland Reviewed-by: Nicholas Kazlauskas --- drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c | 14 ++ drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.h | 10 +- 2

Re: [PATCH] drm/amdgpu: remove excess function parameter description

2019-09-19 Thread Alex Deucher
On Thu, Sep 19, 2019 at 10:03 AM yu kuai wrote: > > Fixes gcc warning: > > drivers/gpu/drm/amd/amdgpu/vcn_v2_5.c:431: warning: Excess function > parameter 'sw' description in 'vcn_v2_5_disable_clock_gating' > drivers/gpu/drm/amd/amdgpu/vcn_v2_5.c:550: warning: Excess function > parameter 'sw' desc

Re: [PATCH] drm/amd/display; Fix kernel doc warnings

2019-09-19 Thread Kazlauskas, Nicholas
On 2019-09-19 1:42 p.m., Harry Wentland wrote: > We had a couple of missing definitions and formatting errors. > > Signed-off-by: Harry Wentland > --- > drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c | 14 ++ > drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.h | 10 +- >

[PATCH] drm/amd/display; Fix kernel doc warnings

2019-09-19 Thread Harry Wentland
We had a couple of missing definitions and formatting errors. Signed-off-by: Harry Wentland --- drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c | 14 ++ drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.h | 10 +- 2 files changed, 23 insertions(+), 1 deletion(-) diff --git a

Re: [PATCH] drm/radeon: fix a potential NULL pointer dereference

2019-09-19 Thread Michel Dänzer
On 2019-09-18 6:31 p.m., Allen Pais wrote: > alloc_workqueue is not checked for errors and as a result, > a potential NULL dereference could occur. > > Signed-off-by: Allen Pais > --- > drivers/gpu/drm/radeon/radeon_display.c | 4 > 1 file changed, 4 insertions(+) > > diff --git a/drivers/

Re: [PATCH] drm/amdgpu: fix potential VM faults

2019-09-19 Thread Christian König
If the page tables are reserved or fenced while you allocate a new one, they would not be evicted. And exactly that's not correct. The TTM_OPT_FLAG_ALLOW_RES_EVICT flag allows evicting of reserved objects. This is useful for allocating per VM BOs, but is of course completely fatal in all othe

Re: [PATCH] drm/amd/display: hide an unused variable

2019-09-19 Thread Alex Deucher
On Thu, Sep 19, 2019 at 9:45 AM Harry Wentland wrote: > > On 2019-09-18 3:53 p.m., Arnd Bergmann wrote: > > Without CONFIG_DEBUG_FS, we get a warning for an unused > > variable: > > > > drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:6020:33: error: > > unused variable 'source' [-Werr

Re: [PATCH v2] drm/amdgpu: fix multiple memory leaks

2019-09-19 Thread Koenig, Christian
Am 19.09.19 um 16:28 schrieb Sven Van Asbroeck: > Hi Christian, > > On Thu, Sep 19, 2019 at 4:05 AM Koenig, Christian > wrote: >>> +out4: >>> + kfree(i2s_pdata); >>> +out3: >>> + kfree(adev->acp.acp_res); >>> +out2: >>> + kfree(adev->acp.acp_cell); >>> +out1: >>> + kfree(adev->acp.

[pull] amdgpu drm-fixes-5.4

2019-09-19 Thread Alex Deucher
Hi Dave, Daniel, A few fixes for 5.4. The following changes since commit 945b584c94f8c665b2df3834a8a6a8faf256cd5f: Merge branch 'linux-5.4' of git://github.com/skeggsb/linux into drm-next (2019-09-17 16:31:34 +1000) are available in the Git repository at: git://people.freedesktop.org/~agd

[PATCH] drm/amdgpu/gfx10: add workaround for register aliasing bug

2019-09-19 Thread Alex Deucher
We need a special programming sequence for updating mmUTCL1_CGTT_CLK_CTRL golden settings. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gfx_v10_0.c | 96 +- 1 file changed, 78 insertions(+), 18 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v10_0.

Re: [PATCH v2] drm/amdgpu: fix multiple memory leaks

2019-09-19 Thread Sven Van Asbroeck
Hi Christian, On Thu, Sep 19, 2019 at 4:05 AM Koenig, Christian wrote: > > > +out4: > > + kfree(i2s_pdata); > > +out3: > > + kfree(adev->acp.acp_res); > > +out2: > > + kfree(adev->acp.acp_cell); > > +out1: > > + kfree(adev->acp.acp_genpd); > > kfree on a NULL pointer is harmless,

RE: [PATCH 05/21] drm/amdgpu: refine sdma4 ras_data_cb

2019-09-19 Thread Zhang, Hawking
Let's add comments to clarifying why checking GFX IP BLOCK for SDMA interrupt so people will not be confusing here. Regards, Hawking -Original Message- From: Chen, Guchun Sent: 2019年9月19日 21:59 To: Zhou1, Tao ; amd-gfx@lists.freedesktop.org; Zhang, Hawking Subject: RE: [PATCH 05/21]

Re: [PATCH] drm/amdgpu: fix potential VM faults

2019-09-19 Thread Kuehling, Felix
I'm not disagreeing with the change. Just trying to understand how this could have caused a VM fault. If the page tables are reserved or fenced while you allocate a new one, they would not be evicted. If they are not reserved or fenced, there should be no expectation that they stay resident. Is

RE: [PATCH 00/21] RAS code refinements

2019-09-19 Thread Chen, Guchun
Two comments in patch 5 and patch 11. Apart from that, the series is: Reviewed-by: Guchun Chen Regards, Guchun -Original Message- From: Zhou1, Tao Sent: Thursday, September 19, 2019 3:12 PM To: amd-gfx@lists.freedesktop.org; Chen, Guchun ; Zhang, Hawking Cc: Zhou1, Tao Subject: [PAT

RE: [PATCH 11/21] drm/amdgpu: add common gfx_ras_fini function

2019-09-19 Thread Chen, Guchun
-Original Message- From: Zhou1, Tao Sent: Thursday, September 19, 2019 3:13 PM To: amd-gfx@lists.freedesktop.org; Chen, Guchun ; Zhang, Hawking Cc: Zhou1, Tao Subject: [PATCH 11/21] drm/amdgpu: add common gfx_ras_fini function gfx_ras_fini can be shared among all generations of gfx

[PATCH] drm/amdgpu: remove excess function parameter description

2019-09-19 Thread yu kuai
Fixes gcc warning: drivers/gpu/drm/amd/amdgpu/vcn_v2_5.c:431: warning: Excess function parameter 'sw' description in 'vcn_v2_5_disable_clock_gating' drivers/gpu/drm/amd/amdgpu/vcn_v2_5.c:550: warning: Excess function parameter 'sw' description in 'vcn_v2_5_enable_clock_gating' Fixes: cbead2bdfcf1

RE: [PATCH 05/21] drm/amdgpu: refine sdma4 ras_data_cb

2019-09-19 Thread Chen, Guchun
Regards, Guchun -Original Message- From: Zhou1, Tao Sent: Thursday, September 19, 2019 3:13 PM To: amd-gfx@lists.freedesktop.org; Chen, Guchun ; Zhang, Hawking Cc: Zhou1, Tao Subject: [PATCH 05/21] drm/amdgpu: refine sdma4 ras_data_cb simplify code logic and refine return value S

Re: [PATCH] drm/amd/display: hide an unused variable

2019-09-19 Thread Harry Wentland
On 2019-09-18 3:53 p.m., Arnd Bergmann wrote: > Without CONFIG_DEBUG_FS, we get a warning for an unused > variable: > > drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:6020:33: error: > unused variable 'source' [-Werror,-Wunused-variable] > > Hide the variable in an #ifdef like its o

Re: [PATCH] drm/amdgpu: restrict hotplug error message

2019-09-19 Thread Deucher, Alexander
Acked-by: Alex Deucher From: amd-gfx on behalf of Christian König Sent: Thursday, September 19, 2019 9:16 AM To: amd-gfx@lists.freedesktop.org Cc: Deng, Emily ; Zhang, Jack (Jian) Subject: [PATCH] drm/amdgpu: restrict hotplug error message We should print the

[PATCH] drm/amdgpu: restrict hotplug error message

2019-09-19 Thread Christian König
We should print the error only when we are hotplugged and crash basically all userspace applications. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 5 - 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/driv

Re: [PATCH] drm/amdgpu: fix potential VM faults

2019-09-19 Thread Deucher, Alexander
Acked-by: Alex Deucher From: amd-gfx on behalf of Christian König Sent: Thursday, September 19, 2019 4:41 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH] drm/amdgpu: fix potential VM faults When we allocate new page tables under memory pressure we should

Re: [PATCH v2] drm/amd/powerplay [Vega10]Vega10 BKM0.83 AVFS parameters patch for Linux Driver

2019-09-19 Thread Alex Deucher
On Thu, Sep 19, 2019 at 3:14 AM Jesse Zhang wrote: > > From: zhexzhan > > Issue: DROOP coef read by HDT appear to be mismatch with requirement of > BKM0.83 > > Root cause: These values are supposed to be overwritten by PPLIB. > However, driver missed code of this part. > > Solution: Add overwrit

Re: [PATCH v5] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Christian König
Am 19.09.19 um 12:09 schrieb Jesse Zhang: When compute fence did signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout

Re: [PATCH v3 07/11] drm/ttm: rename ttm_fbdev_mmap

2019-09-19 Thread Thomas Zimmermann
Hi Am 19.09.19 um 12:02 schrieb Gerd Hoffmann: > Rename ttm_fbdev_mmap to ttm_bo_mmap_obj. Move the vm_pgoff sanity > check to amdgpu_bo_fbdev_mmap (only ttm_fbdev_mmap user in tree). > > The ttm_bo_mmap_obj function can now be used to map any buffer object. > This allows to implement &drm_gem_o

[PATCH v5] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is a t

[PATCH v3 07/11] drm/ttm: rename ttm_fbdev_mmap

2019-09-19 Thread Gerd Hoffmann
Rename ttm_fbdev_mmap to ttm_bo_mmap_obj. Move the vm_pgoff sanity check to amdgpu_bo_fbdev_mmap (only ttm_fbdev_mmap user in tree). The ttm_bo_mmap_obj function can now be used to map any buffer object. This allows to implement &drm_gem_object_funcs.mmap in gem ttm helpers. Signed-off-by: Gerd

RE: [PATCH] drm/amdgpu/sriov: omit fbcon error under sriov or passthrough

2019-09-19 Thread Deng, Emily
Ok, thanks very much. Best wishes Emily Deng From: Koenig, Christian Sent: Thursday, September 19, 2019 5:06 PM To: Deng, Emily Cc: Zhang, Jack (Jian) ; amd-gfx@lists.freedesktop.org; Teng, Rui ; Cui, Flora Subject: RE: [PATCH] drm/amdgpu/sriov: omit fbcon error under sriov or passthrough I

RE: [PATCH] drm/amdgpu/sriov: omit fbcon error under sriov or passthrough

2019-09-19 Thread Koenig, Christian
I can create a patch based on this today and push it on Monday. Christian. Am 19.09.2019 11:05 schrieb "Deng, Emily" : Hi Christian, Could you please help to push the code? Best wishes Emily Deng From: Zhang, Jack (Jian) Sent: Thursday, September 19, 2019 11:33 AM To: Deng, Emily ; Koenig, Chri

RE: [PATCH] drm/amdgpu/sriov: omit fbcon error under sriov or passthrough

2019-09-19 Thread Deng, Emily
Hi Christian, Could you please help to push the code? Best wishes Emily Deng From: Zhang, Jack (Jian) Sent: Thursday, September 19, 2019 11:33 AM To: Deng, Emily ; Koenig, Christian Cc: amd-gfx@lists.freedesktop.org; Teng, Rui ; Cui, Flora Subject: RE: [PATCH] drm/amdgpu/sriov: omit fbcon err

[PATCH] drm/amdgpu: fix potential VM faults

2019-09-19 Thread Christian König
When we allocate new page tables under memory pressure we should not evict old ones. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_object.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c b/drivers/gpu/drm/a

Re: [PATCH v4] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Christian König
Am 19.09.19 um 10:00 schrieb Jesse Zhang: When compute fence did not signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring tim

Re: [PATCH v2] drm/amdgpu: fix multiple memory leaks

2019-09-19 Thread Koenig, Christian
Am 18.09.19 um 21:05 schrieb Navid Emamdoost: > In acp_hw_init there are some allocations that needs to be released in > case of failure: > > 1- adev->acp.acp_genpd should be released if any allocation attemp for > adev->acp.acp_cell, adev->acp.acp_res or i2s_pdata fails. > 2- all of those allocati

[PATCH v4] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did not signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is

[PATCH v3] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is a t

[PATCH v2] drm/amd/powerplay [Vega10]Vega10 BKM0.83 AVFS parameters patch for Linux Driver

2019-09-19 Thread Jesse Zhang
From: zhexzhan Issue: DROOP coef read by HDT appear to be mismatch with requirement of BKM0.83 Root cause: These values are supposed to be overwritten by PPLIB. However, driver missed code of this part. Solution: Add overwriting process when reading pptable from vBIOS Hardcode specific coef wit

[PATCH 16/21] drm/amdgpu: add ras fini for nbio

2019-09-19 Thread Zhou1, Tao
add a common nbio ras fini implementation to cleanup nbio ras framework Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_nbio.c | 14 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_nbio.h | 2 +- drivers/gpu/drm/amd/amdgpu/soc15.c | 1 + 3 files changed, 16 insertions(+),

[PATCH 13/21] drm/amdgpu: remove ih_info parameter of umc_ras_late_init

2019-09-19 Thread Zhou1, Tao
umc_ras_late_init can get the info by itself Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 15 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 4 ++-- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 5 + 3 files changed, 10 insertions(+), 14 deletions(-) diff -

[PATCH 12/21] drm/amdgpu: add common sdma_ras_fini function

2019-09-19 Thread Zhou1, Tao
sdma_ras_fini can be shared among all generations of sdma Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.c | 19 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.h | 1 + drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 16 +--- 3 files changed, 21 insertions

[PATCH 18/21] drm/amdgpu: move umc ras fini to umc block

2019-09-19 Thread Zhou1, Tao
it's more suitable to put umc ras fini in umc block Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 12 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 15 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 1 + 3 files changed, 17 insertions(+), 11 deletions

[PATCH 15/21] drm/amdgpu: simplify the access to eeprom_control struct

2019-09-19 Thread Zhou1, Tao
simplify the code of accessing to eeprom_control struct Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index f72c9f011

[PATCH 20/21] drm/amdgpu: move xgmi ras fini to xgmi block

2019-09-19 Thread Zhou1, Tao
it's more suitable to put xgmi ras fini in xgmi block Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 13 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c | 14 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.h | 1 + 3 files changed, 17 insertions(+), 11 dele

[PATCH 14/21] drm/amdgpu: remove ih_info parameter of gfx_ras_late_init

2019-09-19 Thread Zhou1, Tao
gfx_ras_late_init can get the info by itself Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c | 16 +++- drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h | 3 +-- drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 5 + 3 files changed, 9 insertions(+), 15 deletions(-) diff --

[PATCH 21/21] drm/amdgpu: implement common gmc_ecc_late_init

2019-09-19 Thread Zhou1, Tao
common gmc_ecc_late_init can be shared among all generations of gmc Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 19 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 + drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 22 +- 3 files changed,

[PATCH 17/21] drm/amdgpu: add ras fini for xgmi

2019-09-19 Thread Zhou1, Tao
add ras fini for xgmi to cleanup xgmi ras framework Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 11 +++ 1 file changed, 11 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c index 58a95a67336a..4cd206ee

[PATCH 09/21] drm/amdgpu: move mmhub_ras_if from gmc to mmhub block

2019-09-19 Thread Zhou1, Tao
mmhub_ras_if is relevant to mmhub Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 - drivers/gpu/drm/amd/amdgpu/amdgpu_mmhub.c | 24 +++ drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 4 ++-- 3 files changed, 14 insertions(+), 15 deletions(-) diff --

[PATCH 19/21] drm/amdgpu: move mmhub ras fini to mmhub block

2019-09-19 Thread Zhou1, Tao
it's more suitable to put mmhub ras fini in mmhub block Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 12 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_mmhub.c | 14 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_mmhub.h | 2 +- 3 files changed, 16 insertions(+), 12

[PATCH 05/21] drm/amdgpu: refine sdma4 ras_data_cb

2019-09-19 Thread Zhou1, Tao
simplify code logic and refine return value Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 32 ++ 1 file changed, 17 insertions(+), 15 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c b/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c index a

[PATCH 11/21] drm/amdgpu: add common gfx_ras_fini function

2019-09-19 Thread Zhou1, Tao
gfx_ras_fini can be shared among all generations of gfx Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c | 15 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h | 1 + drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 14 +- 3 files changed, 17 insertions(+), 13 del

[PATCH 08/21] drm/amdgpu: replace mmhub_funcs with mmhub.funcs

2019-09-19 Thread Zhou1, Tao
remove mmhub_funcs in adev Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 1 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 6 +++--- 3 files changed, 5 insertions(+), 6 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/am

[PATCH 10/21] drm/amdgpu: add common gmc_ras_fini function

2019-09-19 Thread Zhou1, Tao
gmc_ras_fini can be shared among all generations of gmc Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 26 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 + drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 28 + 3 files changed, 28 in

[PATCH 07/21] drm/amdgpu: add common mmhub member for adev

2019-09-19 Thread Zhou1, Tao
put mmhub_funcs and ras_if pointer into mmhub struct Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 3 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_mmhub.h | 5 + 2 files changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu.h b/drivers/gpu/drm/amd/amdgp

[PATCH 06/21] drm/amdgpu: move umc_ras_if from gmc to umc block

2019-09-19 Thread Zhou1, Tao
umc_ras_if is relevant to umc Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.h | 1 - drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 28 - drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 1 + drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 4 ++-- 4 files changed, 17

[PATCH v2] drm/amd/amdgpu:Fix compute ring unable to detect hang.

2019-09-19 Thread Jesse Zhang
When compute fence did signal, compute ring cannot detect hardware hang because its timeout value is set to be infinite by default. In SR-IOV and passthrough mode, if user does not declare custome timeout value for compute ring, then use gfx ring timeout value as default. So that when there is a t

[PATCH 02/21] drm/amdgpu: move umc ras irq functions to umc block

2019-09-19 Thread Zhou1, Tao
move umc ras irq functions from gmc v9 to generic umc block Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 65 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 6 +++ drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 68 + 3 files changed,

[PATCH 03/21] drm/amdgpu: move gfx ecc functions to generic gfx file

2019-09-19 Thread Zhou1, Tao
gfx ras ecc common functions could be reused among all gfx generations Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.c | 33 drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h | 6 drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 41 ++--- 3 files

[PATCH 04/21] drm/amdgpu: move sdma ecc functions to generic sdma file

2019-09-19 Thread Zhou1, Tao
sdma ras ecc functions can be reused among all sdma generations Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.c | 28 drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.h | 6 + drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 24 ++-- 3 files cha

[PATCH 01/21] drm/amdgpu: update parameter of ras_ih_cb

2019-09-19 Thread Zhou1, Tao
change struct ras_err_data *err_data to void *err_data, align with the implementation of umc code and the callback's declaration in each ras block could pay no attention to the structure type Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 2 +- drivers/gpu/drm/amd/amdgpu/g

[PATCH 00/21] RAS code refinements

2019-09-19 Thread Zhou1, Tao
some refinements for RAS, no functional change: 1. make more ras code can be reusable among different generations of ras block; 2. make some ras code simpler; Tao Zhou (21): drm/amdgpu: update parameter of ras_ih_cb drm/amdgpu: move umc ras irq functions to umc block drm/amdgpu: move gfx ec