[linux-next:master] BUILD REGRESSION 8a11187eb62b8b910d2c5484e1f5d160e8b11eb4

2022-03-16 Thread kernel test robot
tree/branch: https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master branch HEAD: 8a11187eb62b8b910d2c5484e1f5d160e8b11eb4 Add linux-next specific files for 20220316 Error/Warning reports: https://lore.kernel.org/linux-doc/202202240704.pqd40a9l-...@intel.com https

Re: [PATCH] drm/amdgpu/pm: fix the Stable pstate Test in amdgpu_test

2022-03-16 Thread Huang Rui
On Thu, Mar 17, 2022 at 11:14:53AM +0800, Zhang, Yifan wrote: > [AMD Official Use Only] > > Since it is a generic swsmu issue, shall we merge it first as temporary fix > to unblock amdgpu_test ? As most of swsmu_pm_funcs have an assumption that > pm.dpm_enabled is true, if allow this value is be

Re: [PATCH v1 1/3] mm: split vm_normal_pages for LRU and non-LRU handling

2022-03-16 Thread Alistair Popple
Felix Kuehling writes: > On 2022-03-11 04:16, David Hildenbrand wrote: >> On 10.03.22 18:26, Alex Sierra wrote: >>> DEVICE_COHERENT pages introduce a subtle distinction in the way >>> "normal" pages can be used by various callers throughout the kernel. >>> They behave like normal pages for purpos

Re: [PATCH v1 1/3] mm: split vm_normal_pages for LRU and non-LRU handling

2022-03-16 Thread Alistair Popple
Felix Kuehling writes: > Am 2022-03-10 um 14:25 schrieb Matthew Wilcox: >> On Thu, Mar 10, 2022 at 11:26:31AM -0600, Alex Sierra wrote: >>> @@ -606,7 +606,7 @@ static void print_bad_pte(struct vm_area_struct *vma, >>> unsigned long addr, >>>* PFNMAP mappings in order to support COWable mappi

Re: radeon ring 0 test failed on arm64

2022-03-16 Thread Peter Geis
Good Evening, I apologize for raising this email chain from the dead, but there have been some developments that have introduced even more questions. I've looped the Rockchip mailing list into this too, as this affects rk356x, and likely the upcoming rk3588 if [1] is to be believed. TLDR for thos

Re: radeon ring 0 test failed on arm64

2022-03-16 Thread Kever Yang
Hi Peter, On 2022/3/17 08:14, Peter Geis wrote: Good Evening, I apologize for raising this email chain from the dead, but there have been some developments that have introduced even more questions. I've looped the Rockchip mailing list into this too, as this affects rk356x, and likely the upcom

RE: [PATCH] drm/amdgpu/pm: fix the Stable pstate Test in amdgpu_test

2022-03-16 Thread Zhang, Yifan
[AMD Official Use Only] Since it is a generic swsmu issue, shall we merge it first as temporary fix to unblock amdgpu_test ? As most of swsmu_pm_funcs have an assumption that pm.dpm_enabled is true, if allow this value is be false, Lots of swsmu pm logic needs to be changed. -Original Mess

RE: [PATCH 1/4] drm/amdkfd: refine event_interrupt_poison_consumption

2022-03-16 Thread Zhou1, Tao
[AMD Official Use Only] > -Original Message- > From: Kuehling, Felix > Sent: Wednesday, March 16, 2022 10:04 PM > To: Zhou1, Tao ; amd-gfx@lists.freedesktop.org; Zhang, > Hawking ; Yang, Stanley > ; Chai, Thomas > Subject: Re: [PATCH 1/4] drm/amdkfd: refine > event_interrupt_poison_con

RE: amd-gfx Digest, Vol 70, Issue 199

2022-03-16 Thread Yat Sin, David
> > Use proper amdgpu_gem_prime_import function to handle all kinds of > imports. Remember the dmabuf reference to enable proper multi-GPU > attachment to multiple VMs without erroneously re-exporting the underlying > BO multiple times. > > Signed-off-by: Felix Kuehling > --- > .../gpu/drm/a

RE: [PATCH 1/1] drm/amdgpu: set noretry=1 for GFX 10.3.4

2022-03-16 Thread Chen, Guchun
Shall we sort it? Anyway, the patch is: Reviewed-by: Guchun Chen Regards, Guchun -Original Message- From: amd-gfx On Behalf Of Felix Kuehling Sent: Thursday, March 17, 2022 7:33 AM To: amd-gfx@lists.freedesktop.org Subject: Re: [PATCH 1/1] drm/amdgpu: set noretry=1 for GFX 10.3.4 Pin

[RFC PATCH 0/4] Enable render node VA mapping API for KFD BOs

2022-03-16 Thread Felix Kuehling
The KFD API is quite inflexible in that it allows only mapping entire BOs at the same virtual address on all GPUs. This is incompatible with newer CUDA memory management APIs. (see https://docs.nvidia.com/cuda/cuda-driver-api/group__CUDA__VA.html) Instead of inventing more KFD APIs to fix this, th

[RFC PATCH 4/4] drm/amdgpu: Do bo_va ref counting for KFD BOs

2022-03-16 Thread Felix Kuehling
This is needed to correctly handle BOs imported into the GEM API, which would otherwise get added twice to the same VM. Signed-off-by: Felix Kuehling --- .../gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 26 +++ 1 file changed, 21 insertions(+), 5 deletions(-) diff --git a/drivers

[RFC PATCH 3/4] drm/amdgpu: update mappings not managed by KFD

2022-03-16 Thread Felix Kuehling
When restoring after an eviction, use amdgpu_vm_handle_moved to update BO VA mappings in KFD VMs that are not managed through the KFD API. This should allow using the render node API to create more flexible memory mappings in KFD VMs. Signed-off-by: Felix Kuehling --- drivers/gpu/drm/amd/amdgpu/

[RFC PATCH 1/4] drm/amdkfd: Improve amdgpu_vm_handle_moved

2022-03-16 Thread Felix Kuehling
Let amdgpu_vm_handle_moved update all BO VA mappings of BOs reserved by the caller. This will be useful for handling extra BO VA mappings in KFD VMs that are managed through the render node API. TODO: This may also allow simplification of amdgpu_cs_vm_handling. See the TODO comment in the code. S

[RFC PATCH 2/4] drm/amdgpu: Attach eviction fence on alloc

2022-03-16 Thread Felix Kuehling
Instead of attaching the eviction fence when a KFD BO is first mapped, attach it when it is allocated or imported. This in preparation to allow KFD BOs to be mapped using the render node API. Signed-off-by: Felix Kuehling --- .../gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c | 62 +--

[PATCH v2 2/2] drm/amdkfd: Implement DMA buf fd export for RDMA

2022-03-16 Thread Felix Kuehling
Exports a DMA buf fd of a given KFD buffer handle. This is intended for the new upstreamable RDMA solution coming to UCX and libfabric. The corresponding user mode change (Thunk API and kfdtest) is here: https://github.com/RadeonOpenCompute/ROCT-Thunk-Interface/commits/fxkamd/dmabuf Signed-off-by

[PATCH v2 1/2] drm/amdgpu: Generalize KFD dmabuf import

2022-03-16 Thread Felix Kuehling
Use proper amdgpu_gem_prime_import function to handle all kinds of imports. Remember the dmabuf reference to enable proper multi-GPU attachment to multiple VMs without erroneously re-exporting the underlying BO multiple times. Signed-off-by: Felix Kuehling --- .../gpu/drm/amd/amdgpu/amdgpu_amdkf

Re: [PATCH 1/1] drm/amdgpu: set noretry=1 for GFX 10.3.4

2022-03-16 Thread Felix Kuehling
Ping? On 2022-03-02 15:20, Felix Kuehling wrote: Retry faults are not supported on GFX 10.3.4. Signed-off-by: Felix Kuehling --- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c b/drivers/gpu/drm/amd/amdgpu

Re: [PATCH v2 1/2] drm: Add GPU reset sysfs event

2022-03-16 Thread Rob Clark
On Tue, Mar 8, 2022 at 11:40 PM Shashank Sharma wrote: > > From: Shashank Sharma > > This patch adds a new sysfs event, which will indicate > the userland about a GPU reset, and can also provide > some information like: > - process ID of the process involved with the GPU reset > - process name of

2022 X.Org Foundation Membership deadline for voting in the election

2022-03-16 Thread Lyude Paul
The 2022 X.Org Foundation elections are rapidly approaching. We will be forwarding the election schedule and nominating process to the membership shortly. Please note that only current members can vote in the upcoming election, and that the deadline for new memberships or renewals to vote in the u

2022 X.Org Board of Directors Elections Nomination period is NOW

2022-03-16 Thread Lyude Paul
The Board consists of directors elected from the membership. Each year, an election is held to bring the total number of directors to eight. The four members receiving the highest vote totals will serve as directors for two year terms. The directors who received two year terms starting in 2021 we

Re: [PATCH v2 1/2] drm: Add GPU reset sysfs event

2022-03-16 Thread Rob Clark
On Wed, Mar 16, 2022 at 8:48 AM Alex Deucher wrote: > > On Wed, Mar 16, 2022 at 11:35 AM Rob Clark wrote: > > > > On Wed, Mar 16, 2022 at 7:12 AM Alex Deucher wrote: > > > > > > On Wed, Mar 16, 2022 at 4:48 AM Pekka Paalanen > > > wrote: > > > > > > [snip] > > > > With new UAPI comes the deman

Re: [PATCH] drm/amdgpu/gmc: use PCI BARs for APUs in passthrough

2022-03-16 Thread Michel Dänzer
On 2022-03-16 08:18, Christian König wrote: > Am 15.03.22 um 19:13 schrieb Alex Deucher: >> If the GPU is passed through to a guest VM, use the PCI >> BAR for CPU FB access rather than the physical address of >> carve out.  The physical address is not valid in a guest. >> >> v2: Fix HDP handing as

Re: [PATCH] drm/amdgpu/gmc: use PCI BARs for APUs in passthrough

2022-03-16 Thread Michel Dänzer
On 2022-03-16 16:55, Alex Deucher wrote: > On Wed, Mar 16, 2022 at 11:52 AM Michel Dänzer > wrote: >> >> On 2022-03-11 18:43, Michel Dänzer wrote: >>> On 2022-03-11 11:32, Christian König wrote: Am 11.03.22 um 11:24 schrieb Michel Dänzer: > On 2022-03-10 19:06, Alex Deucher wrote: >>

Re: [PATCH] drm/amdgpu/gmc: use PCI BARs for APUs in passthrough

2022-03-16 Thread Alex Deucher
On Wed, Mar 16, 2022 at 11:52 AM Michel Dänzer wrote: > > On 2022-03-11 18:43, Michel Dänzer wrote: > > On 2022-03-11 11:32, Christian König wrote: > >> Am 11.03.22 um 11:24 schrieb Michel Dänzer: > >>> On 2022-03-10 19:06, Alex Deucher wrote: > If the GPU is passed through to a guest VM, use

Re: [PATCH] drm/amdgpu/gmc: use PCI BARs for APUs in passthrough

2022-03-16 Thread Michel Dänzer
On 2022-03-11 18:43, Michel Dänzer wrote: > On 2022-03-11 11:32, Christian König wrote: >> Am 11.03.22 um 11:24 schrieb Michel Dänzer: >>> On 2022-03-10 19:06, Alex Deucher wrote: If the GPU is passed through to a guest VM, use the PCI BAR for CPU FB access rather than the physical addres

Re: [PATCH v2 1/2] drm: Add GPU reset sysfs event

2022-03-16 Thread Alex Deucher
On Wed, Mar 16, 2022 at 11:35 AM Rob Clark wrote: > > On Wed, Mar 16, 2022 at 7:12 AM Alex Deucher wrote: > > > > On Wed, Mar 16, 2022 at 4:48 AM Pekka Paalanen wrote: > > > > [snip] > > > With new UAPI comes the demand of userspace proof, not hand-waving. You > > > would not be proposing this n

Re: [PATCH v2 1/2] drm: Add GPU reset sysfs event

2022-03-16 Thread Rob Clark
On Wed, Mar 16, 2022 at 7:12 AM Alex Deucher wrote: > > On Wed, Mar 16, 2022 at 4:48 AM Pekka Paalanen wrote: > > [snip] > > With new UAPI comes the demand of userspace proof, not hand-waving. You > > would not be proposing this new interface if you didn't have use cases > > in mind, even just on

Re: [PATCH v2 1/2] drm: Add GPU reset sysfs event

2022-03-16 Thread Alex Deucher
On Wed, Mar 16, 2022 at 4:48 AM Pekka Paalanen wrote: > > On Tue, 15 Mar 2022 10:54:38 -0400 > Alex Deucher wrote: > > > On Mon, Mar 14, 2022 at 11:26 AM Pekka Paalanen wrote: > > > > > > On Mon, 14 Mar 2022 10:23:27 -0400 > > > Alex Deucher wrote: > > > > > > > On Fri, Mar 11, 2022 at 3:30 AM

Re: [PATCH 1/4] drm/amdkfd: refine event_interrupt_poison_consumption

2022-03-16 Thread Felix Kuehling
Am 2022-03-16 um 05:26 schrieb Tao Zhou: Combine reading and setting poison flag as one atomic operation and add print message for the function. Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c | 11 +-- 1 file changed, 5 insertions(+), 6 deletions(-) diff

RE: [PATCH 4/4] drm/amdkfd: add RAS poison consumption handling for UTCL2 (v2)

2022-03-16 Thread Zhang, Hawking
[AMD Official Use Only] Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: Zhou1, Tao Sent: Wednesday, March 16, 2022 17:26 To: amd-gfx@lists.freedesktop.org; Zhang, Hawking ; Kuehling, Felix ; Yang, Stanley ; Chai, Thomas Cc: Zhou1, Tao Subject: [PATCH 4/4] drm/am

RE: [PATCH 1/4] drm/amdkfd: refine event_interrupt_poison_consumption

2022-03-16 Thread Zhang, Hawking
[AMD Official Use Only] Hit send too quickly. With below addressed, the patch is Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: amd-gfx On Behalf Of Zhang, Hawking Sent: Wednesday, March 16, 2022 21:49 To: Zhou1, Tao ; amd-gfx@lists.freedesktop.org; Kuehling, Fe

RE: [PATCH 3/4] drm/amdgpu: add UTCL2 RAS poison query for Aldebaran (v2)

2022-03-16 Thread Zhang, Hawking
[AMD Official Use Only] V2 looks good to me Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: Zhou1, Tao Sent: Wednesday, March 16, 2022 17:26 To: amd-gfx@lists.freedesktop.org; Zhang, Hawking ; Kuehling, Felix ; Yang, Stanley ; Chai, Thomas Cc: Zhou1, Tao Subje

RE: [PATCH 2/4] drm/amdkfd: replace source_id with client_id for RAS poison consumption

2022-03-16 Thread Zhang, Hawking
[AMD Official Use Only] + case SOC15_IH_CLIENTID_SDMA5: + case SOC15_IH_CLIENTID_SDMA6: + case SOC15_IH_CLIENTID_SDMA7: Please drop the unnecessary case here since the feature is not available on all the ASICs. + case SOC15_IH_CLIENTID_GRBM_CP: + case SOC15_IH_CLIE

RE: [PATCH 1/4] drm/amdkfd: refine event_interrupt_poison_consumption

2022-03-16 Thread Zhang, Hawking
[AMD Official Use Only] + pr_warn("RAS poison consumption handling\n"); Given you already print out this, might be better add some more information to help identify which blocks issued this poison consumption interrupt. e.g. add client_id or source_id Regards, Hawking -Original Mess

Re: [PATCH] drm/amdgpu: drop redundant check of harvest info

2022-03-16 Thread Deucher, Alexander
[Public] Reviewed-by: Alex Deucher From: Chen, Guchun Sent: Wednesday, March 16, 2022 2:53 AM To: amd-gfx@lists.freedesktop.org ; Zhang, Hawking ; Koenig, Christian ; Pan, Xinhui ; Deucher, Alexander Cc: Chen, Guchun Subject: [PATCH] drm/amdgpu: drop redundan

Re: [PATCH 0/3] Move FPU related code from DCN3.1x drivers to DML folder

2022-03-16 Thread Rodrigo Siqueira Jordao
On 2022-03-07 10:47, Melissa Wen wrote: This series moves FPU code from DCN 3.1x drivers to dml/dcn31 folder to isolate FPU operations. For this, it creates dcn31_fpu files to centralize FPU operations and structs from dcn31x drivers, that include: - _vcs_dpi_ip_params_st and _vcs_dpi_soc_boun

Re: [PATCH v2] drm: Fix a infinite loop condition when order becomes 0

2022-03-16 Thread Christian König
Am 16.03.22 um 12:31 schrieb Matthew Auld: On 16/03/2022 06:34, Arunpravin Paneer Selvam wrote: handle a situation in the condition order-- == min_order, when order = 0 and min_order = 0, leading to order = -1, it now won't exit the loop. To avoid this problem, added a order check in the same co

Re: [PATCH v2] drm: Fix a infinite loop condition when order becomes 0

2022-03-16 Thread Matthew Auld
On 16/03/2022 06:34, Arunpravin Paneer Selvam wrote: handle a situation in the condition order-- == min_order, when order = 0 and min_order = 0, leading to order = -1, it now won't exit the loop. To avoid this problem, added a order check in the same condition, (i.e) when order is 0, we return -E

RE: [PATCH] WIP: drm/dp_mst: Add support for dumping topology ref histories from debugfs

2022-03-16 Thread Lin, Wayne
[Public] > -Original Message- > From: Lyude Paul > Sent: Wednesday, March 16, 2022 8:48 AM > To: Lin, Wayne > Cc: dri-de...@lists.freedesktop.org; amd-gfx@lists.freedesktop.org > Subject: Re: [PATCH] WIP: drm/dp_mst: Add support for dumping topology ref > histories from debugfs > > (Add

Re: [PATCH v3] drm/amdgpu: add workarounds for VCN TMZ issue on CHIP_RAVEN

2022-03-16 Thread Christian König
Am 16.03.22 um 10:57 schrieb Paul Menzel: Dear Christian, Am 16.03.22 um 10:41 schrieb Christian König: Am 16.03.22 um 07:21 schrieb Lang Yu: On 03/16/ , Paul Menzel wrote: Am 16.03.22 um 02:27 schrieb Lang Yu: On 03/15/ , Paul Menzel wrote: Am 14.03.22 um 03:45 schrieb Lang Yu: Than

Re: [PATCH v3] drm/amdgpu: add workarounds for VCN TMZ issue on CHIP_RAVEN

2022-03-16 Thread Paul Menzel
Dear Christian, Am 16.03.22 um 10:41 schrieb Christian König: Am 16.03.22 um 07:21 schrieb Lang Yu: On 03/16/ , Paul Menzel wrote: Am 16.03.22 um 02:27 schrieb Lang Yu: On 03/15/ , Paul Menzel wrote: Am 14.03.22 um 03:45 schrieb Lang Yu: Thank you for your patch. A shorter commit message

Re: [PATCH v3] drm/amdgpu: add workarounds for VCN TMZ issue on CHIP_RAVEN

2022-03-16 Thread Lang Yu
On 03/16/ , Christian König wrote: > Am 16.03.22 um 07:21 schrieb Lang Yu: > > On 03/16/ , Paul Menzel wrote: > > > Dear Lang, > > > > > > > > > Am 16.03.22 um 02:27 schrieb Lang Yu: > > > > On 03/15/ , Paul Menzel wrote: > > > > > Am 14.03.22 um 03:45 schrieb Lang Yu: > > > > > > > > > > Thank

Re: [PATCH v2] drm: Fix a infinite loop condition when order becomes 0

2022-03-16 Thread Christian König
Am 16.03.22 um 07:34 schrieb Arunpravin Paneer Selvam: handle a situation in the condition order-- == min_order, when order = 0 and min_order = 0, leading to order = -1, it now won't exit the loop. To avoid this problem, added a order check in the same condition, (i.e) when order is 0, we return

Re: [PATCH v3] drm/amdgpu: add workarounds for VCN TMZ issue on CHIP_RAVEN

2022-03-16 Thread Christian König
Am 16.03.22 um 07:21 schrieb Lang Yu: On 03/16/ , Paul Menzel wrote: Dear Lang, Am 16.03.22 um 02:27 schrieb Lang Yu: On 03/15/ , Paul Menzel wrote: Am 14.03.22 um 03:45 schrieb Lang Yu: Thank you for your patch. A shorter commit message summary would be: drm/amdgpu: Work around VNC TMZ i

[PATCH 4/4] drm/amdkfd: add RAS poison consumption handling for UTCL2 (v2)

2022-03-16 Thread Tao Zhou
Do RAS page retirement and use gpu reset as fallback in UTCL2 fault handler. v2: replace vm fault event with posion consumed event in UTCL2 poison consumption. Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c | 6 ++ 1 file changed, 6 insertions(+) diff --git a/d

[PATCH 3/4] drm/amdgpu: add UTCL2 RAS poison query for Aldebaran (v2)

2022-03-16 Thread Tao Zhou
Add help functions to query and reset RAS UTCL2 poison status. v2: implement it on amdgpu side and kfd only calls it. Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.c | 8 drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_gfx.h

[PATCH 2/4] drm/amdkfd: replace source_id with client_id for RAS poison consumption

2022-03-16 Thread Tao Zhou
client_id is more accruate here and we can deal with more different cases. Signed-off-by: Tao Zhou --- .../gpu/drm/amd/amdkfd/kfd_int_process_v9.c | 25 ++- 1 file changed, 19 insertions(+), 6 deletions(-) diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c b/drivers

[PATCH 1/4] drm/amdkfd: refine event_interrupt_poison_consumption

2022-03-16 Thread Tao Zhou
Combine reading and setting poison flag as one atomic operation and add print message for the function. Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c | 11 +-- 1 file changed, 5 insertions(+), 6 deletions(-) diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_int_p

Re: [PATCH v2 1/2] drm: Add GPU reset sysfs event

2022-03-16 Thread Pekka Paalanen
On Tue, 15 Mar 2022 10:54:38 -0400 Alex Deucher wrote: > On Mon, Mar 14, 2022 at 11:26 AM Pekka Paalanen wrote: > > > > On Mon, 14 Mar 2022 10:23:27 -0400 > > Alex Deucher wrote: > > > > > On Fri, Mar 11, 2022 at 3:30 AM Pekka Paalanen > > > wrote: > > > > > > > > On Thu, 10 Mar 2022 11:5

[PATCH] drm/amdgpu: fix off by one in amdgpu_gfx_kiq_acquire()

2022-03-16 Thread Dan Carpenter
This post-op should be a pre-op so that we do not pass -1 as the bit number to test_bit(). The current code will loop downwards from 63 to -1. After changing to a pre-op, it loops from 63 to 0. Fixes: 71c37505e7ea ("drm/amdgpu/gfx: move more common KIQ code to amdgpu_gfx.c") Signed-off-by: Dan

Re: [PATCH] drm/amdgpu/gmc: use PCI BARs for APUs in passthrough

2022-03-16 Thread Christian König
Am 15.03.22 um 19:13 schrieb Alex Deucher: If the GPU is passed through to a guest VM, use the PCI BAR for CPU FB access rather than the physical address of carve out. The physical address is not valid in a guest. v2: Fix HDP handing as suggested by Michel Signed-off-by: Alex Deucher Might