Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-26 Thread Christian König
Am 25.08.22 um 19:48 schrieb Bjorn Helgaas: On Thu, Aug 25, 2022 at 10:18:28AM +0200, Christian König wrote: Am 25.08.22 um 09:54 schrieb Lazar, Lijo: On 8/25/2022 1:04 PM, Christian König wrote: Am 25.08.22 um 08:40 schrieb Stefan Roese: On 24.08.22 16:45, Tom Seewald wrote: On Wed, Aug 24,

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Bjorn Helgaas
On Thu, Aug 25, 2022 at 10:18:28AM +0200, Christian König wrote: > Am 25.08.22 um 09:54 schrieb Lazar, Lijo: > > On 8/25/2022 1:04 PM, Christian König wrote: > > > Am 25.08.22 um 08:40 schrieb Stefan Roese: > > > > On 24.08.22 16:45, Tom Seewald wrote: > > > > > On Wed, Aug 24, 2022 at 12:11 AM Laz

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Felix Kuehling
Am 2022-08-24 um 01:10 schrieb Lazar, Lijo: On 8/23/2022 10:34 PM, Tom Seewald wrote: On Sat, Aug 20, 2022 at 2:53 AM Lazar, Lijo wrote: Missed the remap part, the offset is here - https://elixir.bootlin.com/linux/v6.0-rc1/source/drivers/gpu/drm/amd/amdgpu/nv.c#L680 The trace is com

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Christian König
Am 25.08.22 um 09:54 schrieb Lazar, Lijo: On 8/25/2022 1:04 PM, Christian König wrote: Am 25.08.22 um 08:40 schrieb Stefan Roese: On 24.08.22 16:45, Tom Seewald wrote: On Wed, Aug 24, 2022 at 12:11 AM Lazar, Lijo wrote: Unfortunately, I don't have any NV platforms to test. Attached is an '

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Lazar, Lijo
On 8/25/2022 1:04 PM, Christian König wrote: Am 25.08.22 um 08:40 schrieb Stefan Roese: On 24.08.22 16:45, Tom Seewald wrote: On Wed, Aug 24, 2022 at 12:11 AM Lazar, Lijo wrote: Unfortunately, I don't have any NV platforms to test. Attached is an 'untested-patch' based on your trace logs.

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Christian König
Am 25.08.22 um 08:40 schrieb Stefan Roese: On 24.08.22 16:45, Tom Seewald wrote: On Wed, Aug 24, 2022 at 12:11 AM Lazar, Lijo wrote: Unfortunately, I don't have any NV platforms to test. Attached is an 'untested-patch' based on your trace logs. Thanks, Lijo Thank you for the patch. It appli

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Tom Seewald
On Wed, Aug 24, 2022 at 12:11 AM Lazar, Lijo wrote: > Unfortunately, I don't have any NV platforms to test. Attached is an > 'untested-patch' based on your trace logs. > > Thanks, > Lijo Thank you for the patch. It applied cleanly to v6.0-rc2 and after booting that kernel I no longer see any mess

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-25 Thread Stefan Roese
On 24.08.22 16:45, Tom Seewald wrote: On Wed, Aug 24, 2022 at 12:11 AM Lazar, Lijo wrote: Unfortunately, I don't have any NV platforms to test. Attached is an 'untested-patch' based on your trace logs. Thanks, Lijo Thank you for the patch. It applied cleanly to v6.0-rc2 and after booting tha

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-24 Thread Bjorn Helgaas
[Adding amdgpu folks] On Wed, Aug 17, 2022 at 11:45:15PM +, bugzilla-dae...@kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=216373 > > Bug ID: 216373 >Summary: Uncorrected errors reported for AMD GPU > Kernel Version: v6.0-rc1 > Regression:

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-24 Thread Tom Seewald
On Sat, Aug 20, 2022 at 2:53 AM Lazar, Lijo wrote: > > Missed the remap part, the offset is here - > > https://elixir.bootlin.com/linux/v6.0-rc1/source/drivers/gpu/drm/amd/amdgpu/nv.c#L680 > > > The trace is coming from *_flush_hdp. > > You may also check if *_remap_hdp_registers() is getting call

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU #forregzbot

2022-08-24 Thread Thorsten Leemhuis
TWIMC: this mail is primarily send for documentation purposes and for regzbot, my Linux kernel regression tracking bot. These mails usually contain '#forregzbot' in the subject, to make them easy to spot and filter. [TLDR: I'm adding this regression report to the list of tracked regressions; all t

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-23 Thread Lazar, Lijo
On 8/23/2022 10:34 PM, Tom Seewald wrote: On Sat, Aug 20, 2022 at 2:53 AM Lazar, Lijo wrote: Missed the remap part, the offset is here - https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Felixir.bootlin.com%2Flinux%2Fv6.0-rc1%2Fsource%2Fdrivers%2Fgpu%2Fdrm%2Famd%2Famdgpu%2Fnv

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-20 Thread Lazar, Lijo
On 8/20/2022 12:37 AM, Bjorn Helgaas wrote: On Fri, Aug 19, 2022 at 12:13:03PM -0500, Bjorn Helgaas wrote: On Thu, Aug 18, 2022 at 03:38:12PM -0500, Bjorn Helgaas wrote: [Adding amdgpu folks] On Wed, Aug 17, 2022 at 11:45:15PM +, bugzilla-dae...@kernel.org wrote: https://nam11.safelink

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-19 Thread Bjorn Helgaas
On Fri, Aug 19, 2022 at 12:13:03PM -0500, Bjorn Helgaas wrote: > On Thu, Aug 18, 2022 at 03:38:12PM -0500, Bjorn Helgaas wrote: > > [Adding amdgpu folks] > > > > On Wed, Aug 17, 2022 at 11:45:15PM +, bugzilla-dae...@kernel.org wrote: > > > https://bugzilla.kernel.org/show_bug.cgi?id=216373 > >

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-19 Thread Bjorn Helgaas
On Thu, Aug 18, 2022 at 03:38:12PM -0500, Bjorn Helgaas wrote: > [Adding amdgpu folks] > > On Wed, Aug 17, 2022 at 11:45:15PM +, bugzilla-dae...@kernel.org wrote: > > https://bugzilla.kernel.org/show_bug.cgi?id=216373 > > > > Bug ID: 216373 > >Summary: Uncorrected erro

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-19 Thread Bjorn Helgaas
On Fri, Aug 19, 2022 at 02:03:59PM +0530, Lazar, Lijo wrote: > Or, it could be amdgpu or some other software component - > > register mmio base: 0x95E0 > Address 0x95e7f000 > > 0x95e7f000 indicates access from CPU to a register offset 0x7FE000. This > doesn't look like a valid register

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-19 Thread Lazar, Lijo
On 8/19/2022 12:35 PM, Christian König wrote: Hi Bjorn, Am 18.08.22 um 22:38 schrieb Bjorn Helgaas: [Adding amdgpu folks] On Wed, Aug 17, 2022 at 11:45:15PM +, bugzilla-dae...@kernel.org wrote: https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugzilla.kernel.org%2Fshow

Re: [Bug 216373] New: Uncorrected errors reported for AMD GPU

2022-08-19 Thread Christian König
Hi Bjorn, Am 18.08.22 um 22:38 schrieb Bjorn Helgaas: [Adding amdgpu folks] On Wed, Aug 17, 2022 at 11:45:15PM +, bugzilla-dae...@kernel.org wrote: https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fbugzilla.kernel.org%2Fshow_bug.cgi%3Fid%3D216373&data=05%7C01%7Cchristian.ko