[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-29 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 Timothy Pearson (tpear...@raptorengineering.com) changed: What|Removed |Added Status|NEW

[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-29 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 --- Comment #6 from Timothy Pearson (tpear...@raptorengineering.com) --- Yes, my fault, sorry about that -- different box, unbeknownst to me had a different GPU (note to self, check lspci next time before decoding trace). To top it off, this

[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-29 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 --- Comment #5 from Alex Deucher (alexdeuc...@gmail.com) --- This doesn't look related to the first one. The first one is a vega10 asic according to the description, the second one is from a older VI asic. mmHDP_MEM_COHERENCY_FLUSH_CNTL is a

[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-29 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 --- Comment #4 from Timothy Pearson (tpear...@raptorengineering.com) --- Stack decodes to: arch/powerpc/include/asm/eeh.h:403 [if (EEH_POSSIBLE_ERROR(val, u32))] drivers/gpu/drm/amd/amdgpu/vi.c:913 [RREG32(mmHDP_MEM_COHERENCY_FLUSH_CNTL)]

[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-28 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 --- Comment #3 from Timothy Pearson (tpear...@raptorengineering.com) --- Just had a chance to test on 5.4.0, still fails (haven't had a chance to bisect yet; I suspect it's more related to the 64-bit enablement on POWER in 5.4 than anything

[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-20 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 --- Comment #2 from Timothy Pearson (tpear...@raptorengineering.com) --- I am travelling now but can bisect when back at the lab next week. -- You are receiving this mail because: You are watching the assignee of the bug.

[Bug 205585] [Regression] [amdgpu] AMD Vega 64 GPU invalid access and EEH under load

2019-11-20 Thread bugzilla-daemon
https://bugzilla.kernel.org/show_bug.cgi?id=205585 Alex Deucher (alexdeuc...@gmail.com) changed: What|Removed |Added CC|