Hi Christian ,
I just wonder when encounter ENOMEM error during pin amdgpu BOs can we retry
validate again as below.
With the following simply patch the Abaqus pinned issue not observed.
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
b/drivers/gpu/drm/amd/amdgpu/amdgpu_object.c
index 1
On Tue, May 14, 2019 at 5:12 PM Kuehling, Felix wrote:
>
>
> On 2019-05-13 4:21 p.m., Deucher, Alexander wrote:
> > [CAUTION: External Email]
> > I reverted all the amdgpu HMM patches for 5.2 because they also
> > depended on this patch:
> > https://cgit.freedesktop.org/~agd5f/linux/commit/?h=drm-
On 2019-05-13 5:27 p.m., Andrew Morton wrote:
> [CAUTION: External Email]
>
> On Fri, 10 May 2019 19:53:23 + "Kuehling, Felix"
> wrote:
>
>> From: Philip Yang
>>
>> While the page is migrating by NUMA balancing, HMM failed to detect this
>> condition and still return the old page. Applicatio
On 2019-05-13 4:21 p.m., Deucher, Alexander wrote:
> [CAUTION: External Email]
> I reverted all the amdgpu HMM patches for 5.2 because they also
> depended on this patch:
> https://cgit.freedesktop.org/~agd5f/linux/commit/?h=drm-next-5.2-wip&id=ce05ef71564f7cbe270cd4337c36ee720ea534db
> which did
On 2019-05-14 2:20 p.m., Kazlauskas, Nicholas wrote:
> [CAUTION: External Email]
>
> On 5/14/19 1:49 PM, Harry Wentland wrote:
>>
>> [WHY]
>> We only want to load DMCU FW on Picasso and Raven 2, not on Raven 1.
>>
>> Signed-off-by: Harry Wentland
>> ---
>> drivers/gpu/drm/amd/display/include/da
Hi Roman,
Thanks for your feedback. I will rework and send new patch soon.
Best Regards,
Harish
From: Roman Gushchin
Sent: Tuesday, May 14, 2019 1:37 PM
To: Kasiviswanathan, Harish
Cc: cgro...@vger.kernel.org; amd-gfx@lists.freedesktop.org
Subject: Re: [PATCH 4/4] drm/amdkfd: Check against
This series fixes the OOM errors. However, if I torture the kernel driver
more, I can get it to deadlock and end up with unkillable processes. I can
also get an OOM error. I just ran the test 5 times:
AMD_DEBUG=testgdsmm glxgears & AMD_DEBUG=testgdsmm glxgears &
AMD_DEBUG=testgdsmm glxgears & AMD_
On Tue, May 14, 2019 at 01:58:40AM +, Roman Gushchin wrote:
> On Wed, May 01, 2019 at 02:59:29PM +, Kasiviswanathan, Harish wrote:
> > Participate in device cgroup. All kfd devices are exposed via /dev/kfd.
> > So use /dev/dri/renderN node.
> >
> > Before exposing the device to a task chec
On 5/14/19 1:49 PM, Harry Wentland wrote:
>
> [WHY]
> We only want to load DMCU FW on Picasso and Raven 2, not on Raven 1.
>
> Signed-off-by: Harry Wentland
> ---
> drivers/gpu/drm/amd/display/include/dal_asic_id.h | 7 ---
> 1 file changed, 4 insertions(+), 3 deletions(-)
>
> diff --git
On 2019-05-14 2:02 p.m., Kazlauskas, Nicholas wrote:
> On 5/14/19 1:49 PM, Harry Wentland wrote:
>>
>> [WHY]
>> These were only needed for bringup. They're not needed anymore.
>>
>> Signed-off-by: Harry Wentland
>
> Series is:
>
> Reviewed-by: Nicholas Kazlauskas
>
> I think a lot of those DCN
On 5/14/19 1:49 PM, Harry Wentland wrote:
>
> [WHY]
> These were only needed for bringup. They're not needed anymore.
>
> Signed-off-by: Harry Wentland
Series is:
Reviewed-by: Nicholas Kazlauskas
I think a lot of those DCN guards around checking ASIC revision aren't
strictly necessary (like
[WHY]
We only want to load DMCU FW on Picasso and Raven 2, not on Raven 1.
Signed-off-by: Harry Wentland
---
drivers/gpu/drm/amd/display/include/dal_asic_id.h | 7 ---
1 file changed, 4 insertions(+), 3 deletions(-)
diff --git a/drivers/gpu/drm/amd/display/include/dal_asic_id.h
b/drivers/g
[WHY]
These were only needed for bringup. They're not needed anymore.
Signed-off-by: Harry Wentland
---
drivers/gpu/drm/amd/display/Kconfig | 6 --
.../gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c | 2 --
.../display/dc/bios/command_table_helper2.c | 5 -
.../gpu/drm/amd/disp
[WHY]
Some early Raven boards had a bad SBIOS that doesn't play nicely with
the DMCU FW. We thought the issues were fixed by ignoring errors on DMCU
load but that doesn't seem to be the case. We've still seen reports of
users unable to boot their systems at all.
[HOW]
Disable DMCU load on Raven 1.
how to refresh LRU to keep the order align with bo list passed from user space?
you can verify it by some games, performance could be different much between
multiple runnings.
-David
Original Message
Subject: Re: [PATCH 11/11] drm/amdgpu: stop removing BOs from the LRU during
Hi Jack,
Let's hold on patch #2 for now. I'd prefer to gracefully deal with dpm_disabled
in amdgpu_pm interface level (or at least amdgpu_smu level) so that we don't
need to maintain the case for each ASIC since ppt is asic specific ones.
Regards,
Hawking
-Original Message-
From: Gui,
Series is:
Reviewed-by: Alex Deucher
From: amd-gfx on behalf of Evan Quan
Sent: Tuesday, May 14, 2019 7:19 AM
To: amd-gfx@lists.freedesktop.org
Cc: Quan, Evan
Subject: [PATCH 2/2] drm/amd/powerplay: support sw smu hotspot and memory
temperature retrieval
[CAUT
> -Original Message-
> From: amd-gfx On Behalf Of
> Christian König
> Sent: Tuesday, May 14, 2019 6:34 AM
> To: Zhou, Tiecheng ; amd-
> g...@lists.freedesktop.org
> Cc: Deng, Emily
> Subject: Re: [PATCH] drm/amdgpu/sriov: Need to initialize the
> HDP_NONSURFACE_BAStE
>
> [CAUTION: Extern
Reviewed-by: Alex Deucher
From: amd-gfx on behalf of Tiecheng
Zhou
Sent: Monday, May 13, 2019 11:34 PM
To: amd-gfx@lists.freedesktop.org
Cc: Zhou, Tiecheng; Deng, Emily
Subject: [PATCH] drm/amdgpu/sriov: Need to initialize the HDP_NONSURFACE_BAStE
[CAUTION: Ext
Reviewed-by: Hawking Zhang
Regards,
Hawking
_
From: Gui, Jack
Sent: 2019年5月13日 11:32
To: Zhang, Hawking ; amd-gfx@lists.freedesktop.org
Subject: RE: [PATCH 1/2] drm/amd/powerplay: Enable "disable dpm" feature to
support swSMU debug
<< File: 0001-dr
Hui? What do you mean with that?
Christian.
Am 14.05.19 um 15:12 schrieb Zhou, David(ChunMing):
my only concern is how to fresh LRU when bo is from bo list.
-David
Original Message
Subject: [PATCH 11/11] drm/amdgpu: stop removing BOs from the LRU
during CS
From: Christian
I don't think that anyone understands all of the internals of FW version
handling 😉 Right now we're just returning the same values as what debugfs
returns. The SMI tool will do the version interpretation (like turning
0x00282b00 into 40.43 for the SMU, for example). Or knowing that the CP
firmw
my only concern is how to fresh LRU when bo is from bo list.
-David
Original Message
Subject: [PATCH 11/11] drm/amdgpu: stop removing BOs from the LRU during CS
From: Christian König
To: "Olsak, Marek" ,"Zhou, David(ChunMing)" ,"Liang, Prike"
,dri-de...@lists.freedesktop.org,am
Move BOs which are currently in the system domain to
the new LRU before allocating backing space.
This makes sure that we always have enough entries on the
LRU to allow for other processes to wait for an operation
to complete.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 45
Let userspace try again if we really run into a deadlock during eviction.
This has a low chance of live locking, but with guaranteed forward process.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 2 ++
1 file changed, 2 insertions(+)
diff --git a/drivers/gpu/drm/ttm/ttm_bo.
From: Chunming Zhou
add ticket for display bo, so that it can preempt busy bo.
v2: fix stupid rebase error
Change-Id: I9f031cdcc8267de00e819ae303baa0a52df8ebb9
Signed-off-by: Chunming Zhou
Reviewed-by: Christian König
---
.../gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c | 21 ++-
This avoids OOM situations when we have lots of threads
submitting at the same time.
Signed-off-by: Christian König
---
drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c
b/drivers/gpu/drm/amd/amdgpu/
If drivers don't prefer a system memory placement
they should not but it into the placement list first.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 18 +-
1 file changed, 9 insertions(+), 9 deletions(-)
diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gp
We tried this once before, but that turned out to be more
complicated than thought. With all the right prerequisites
it looks like we can do this now.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 128 ++-
1 file changed, 67 insertions(+), 61 d
This way they are available for eviction immediately.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 13 +
1 file changed, 5 insertions(+), 8 deletions(-)
diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gpu/drm/ttm/ttm_bo.c
index 233bfb86068b..a301c876ae31 100
When a signal arrives we should return immediately for
handling it and not try other placements first.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 7 +++
1 file changed, 3 insertions(+), 4 deletions(-)
diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gpu/drm/ttm/ttm
Pipeline removal of the BOs backing store when no placement is given
during validation.
Signed-off-by: Christian König
---
drivers/gpu/drm/ttm/ttm_bo.c | 12
1 file changed, 12 insertions(+)
diff --git a/drivers/gpu/drm/ttm/ttm_bo.c b/drivers/gpu/drm/ttm/ttm_bo.c
index e634d3a36923
We are already doing this for DMA-buf imports and also for
amdgpu VM BOs for quite a while now.
If this doesn't run into any problems we are probably going
to stop removing BOs from the LRU altogether.
Signed-off-by: Christian König
---
.../gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 9 +--
From: Chunming Zhou
heavy gpu job could occupy memory long time, which lead other user fail to get
memory.
basically pick up Christian idea:
1. Reserve the BO in DC using a ww_mutex ticket (trivial).
2. If we then run into this EBUSY condition in TTM check if the BO we need
memory for (or rat
Support hotspot and memory temperature retrieval on sw smu routine.
Change-Id: If2ed1e2835f4b158a4a6d93aee8b358af18b9bfc
Signed-off-by: Evan Quan
---
drivers/gpu/drm/amd/powerplay/inc/smu_v11_0.h | 3 +
drivers/gpu/drm/amd/powerplay/smu_v11_0.c | 74 ---
2 files changed, 66
Support realtime uclk activity report.
Change-Id: I89cf7c95233060ee106e9fcef3b8e6707cd60466
Signed-off-by: Evan Quan
---
drivers/gpu/drm/amd/powerplay/smu_v11_0.c | 15 ++-
1 file changed, 14 insertions(+), 1 deletion(-)
diff --git a/drivers/gpu/drm/amd/powerplay/smu_v11_0.c
b/driv
Am 14.05.19 um 12:59 schrieb Tiecheng Zhou:
it requires to initialize HDP_NONSURFACE_BASE, so as to avoid
using the value left by a previous VM under sriov scenario.
v2: it should not hurt baremetal, generalize it for both sriov
and baremetal
Signed-off-by: Emily Deng
Signed-off-by: Tiecheng Z
it requires to initialize HDP_NONSURFACE_BASE, so as to avoid
using the value left by a previous VM under sriov scenario.
v2: it should not hurt baremetal, generalize it for both sriov
and baremetal
Signed-off-by: Emily Deng
Signed-off-by: Tiecheng Zhou
---
drivers/gpu/drm/amd/amdgpu/gmc_v9_0.
Am 14.05.19 um 12:24 schrieb Tiecheng Zhou:
it requires to initialize HDP_NONSURFACE_BASE, so as to avoid
using the value left by a previous VM under sriov scenario.
Signed-off-by: Emily Deng
Signed-off-by: Tiecheng Zhou
Would it hurt us to also do this on bare metal?
Apart from that looks
I don't claim that I understand all of the internals of fw version
handling, but this looks really nice to me from a kernel perspective.
Feel free to add my Reviewed-by: Christian König .
Regards,
Christian.
Am 14.05.19 um 12:19 schrieb Russell, Kent:
Looks fine to me, but hoping to get an RB
it requires to initialize HDP_NONSURFACE_BASE, so as to avoid
using the value left by a previous VM under sriov scenario.
Signed-off-by: Emily Deng
Signed-off-by: Tiecheng Zhou
---
drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 5 +
1 file changed, 5 insertions(+)
diff --git a/drivers/gpu/drm/amd
Looks fine to me, but hoping to get an RB from Christian now that we're moving
them into a subfolder.
Kent
-Original Message-
From: amd-gfx On Behalf Of Messinger,
Ori
Sent: Thursday, May 9, 2019 4:35 PM
To: amd-gfx@lists.freedesktop.org
Cc: Messinger, Ori
Subject: [PATCH] drm/amdgpu
Thank you, Lionel.
-David
On 2019年05月14日 17:49, Lionel Landwerlin wrote:
[CAUTION: External Email]
With the small nits, patches 2 & 4 are : Reviewed-by: Lionel Landwerlin
The other patches are a bit amdgpu specific so maybe you might want
someone more familiar with amdgpu to review them.
Sti
With the small nits, patches 2 & 4 are : Reviewed-by: Lionel Landwerlin
The other patches are a bit amdgpu specific so maybe you might want
someone more familiar with amdgpu to review them.
Still I didn't see anything wrong with them so remaining patches are :
Acked-by: Lionel Landwerlin
I'l
On 13/05/2019 10:53, Chunming Zhou wrote:
v2: use one transfer ioctl
Signed-off-by: Chunming Zhou
---
xf86drm.c | 33 +
xf86drm.h | 6 ++
2 files changed, 39 insertions(+)
diff --git a/xf86drm.c b/xf86drm.c
index 17e3d880..acd16fab 100644
--- a/xf86drm.
On 13/05/2019 10:53, Chunming Zhou wrote:
v2: drop export/import
Signed-off-by: Chunming Zhou
---
xf86drm.c | 44
xf86drm.h | 6 ++
2 files changed, 50 insertions(+)
diff --git a/xf86drm.c b/xf86drm.c
index 2c19376b..17e3d880 100644
--- a/x
46 matches
Mail list logo