Re: [PATCH 1/2] drm/amd/powerplay: Add UMD PState Renoir Msg Parameters

2019-09-03 Thread Wang, Kevin(Yang)
i think it's better to merge these two patches into one patch. after fixed: Reviewed-by: Kevin Wang Best Regards, Kevin From: amd-gfx on behalf of Liang, Prike Sent: Wednesday, September 4, 2019 2:01 PM To: amd-gfx@lists.freedesktop.org Cc: Deucher, Alexander

[PATCH 2/2] drm/amd/display: update renoir_ip_offset.h

2019-09-03 Thread Liu, Aaron
This patch updates MP1_BASE in renoir_ip_offset.h Signed-off-by: Aaron Liu --- drivers/gpu/drm/amd/include/renoir_ip_offset.h | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/include/renoir_ip_offset.h b/drivers/gpu/drm/amd/include/renoir_ip_offset.h index

[PATCH 1/2] drm/amd/display: bypass stereo programming

2019-09-03 Thread Liu, Aaron
From: Roman Li hack to avoid crash on renoir should be resolved after upcoming gpio refactoring promotion Signed-off-by: Roman Li --- drivers/gpu/drm/amd/display/dc/dcn10/dcn10_hw_sequencer.c | 6 ++ 1 file changed, 6 insertions(+) diff --git a/drivers/gpu/drm/amd/display/dc/dcn10/dcn10_h

[PATCH 2/2] drm/amd/powerplay: implement sysfs for getting dpm clock

2019-09-03 Thread Liang, Prike
With the common interface print_clk_levels can get the following dpm clock: -pp_dpm_dcefclk -pp_dpm_fclk -pp_dpm_mclk -pp_dpm_sclk -pp_dpm_socclk Signed-off-by: Prike Liang Reviewed-by: Evan Quan --- drivers/gpu/drm/amd/powerplay/renoir_ppt.c | 70 ++ drivers/gpu/dr

[PATCH 1/2] drm/amd/powerplay: Add UMD PState Renoir Msg Parameters

2019-09-03 Thread Liang, Prike
Add UMD PState macro definition for PState update. Signed-off-by: Prike Liang reviewed-by: Evan Quan --- drivers/gpu/drm/amd/powerplay/renoir_ppt.h | 5 + 1 file changed, 5 insertions(+) diff --git a/drivers/gpu/drm/amd/powerplay/renoir_ppt.h b/drivers/gpu/drm/amd/powerplay/renoir_ppt.h i

RE: libdrm patch merge request

2019-09-03 Thread Chen, Guchun
Resend in case of missing. Hi Alex, I need your help to merge libdrm patches from my personal repository below to drm master branch. Thanks. Regards, Guchun From: Chen, Guchun Sent: Friday, August 30, 2019 2:19 PM To: 'Michel Dänzer' ; 'Alex Deucher' Cc: Zhou1, Tao ; 'amd-gfx@lists.freedeskto

Re: [PATCH] drm/amd/powerplay: replace smu->table_count with SMU_TABLE_COUNT in smu

2019-09-03 Thread Wang, Kevin(Yang)
From: Quan, Evan Sent: Wednesday, September 4, 2019 9:17 AM To: Wang, Kevin(Yang) ; amd-gfx@lists.freedesktop.org ; Huang, Ray Cc: Wang, Kevin(Yang) Subject: RE: [PATCH] drm/amd/powerplay: replace smu->table_count with SMU_TABLE_COUNT in smu Please drop the t

RE: [PATCH] drm/amd/powerplay: replace smu->table_count with SMU_TABLE_COUNT in smu

2019-09-03 Thread Quan, Evan
Please drop the table_count member from the smu_table structure as it's totally unused. With that fixed, the patch is reviewed-by: Evan Quan > -Original Message- > From: amd-gfx On Behalf Of > Wang, Kevin(Yang) > Sent: 2019年9月3日 16:17 > To: amd-gfx@lists.freedesktop.org; Huang, Ray > C

Re: [PATCH v2 15/27] drm/dp_mst: Cleanup drm_dp_send_link_address() a bit

2019-09-03 Thread Dave Airlie
On Wed, 4 Sep 2019 at 06:48, Lyude Paul wrote: > > Declare local pointer to the drm_dp_link_address_ack_reply struct > instead of constantly dereferencing it through the union in > txmsg->reply. Then, invert the order of conditionals so we don't have to > do the bulk of the work inside them, and c

Re: [PATCH v2 11/27] drm/dp_mst: Constify guid in drm_dp_get_mst_branch_by_guid()

2019-09-03 Thread Dave Airlie
On Wed, 4 Sep 2019 at 06:48, Lyude Paul wrote: > > And it's helper, we'll be using this in just a moment. > Reviewed-by: Dave Airlie > Cc: Juston Li > Cc: Imre Deak > Cc: Ville Syrjälä > Cc: Harry Wentland > Cc: Daniel Vetter > Signed-off-by: Lyude Paul > --- > drivers/gpu/drm/drm_dp_mst

[PATCH] drm: add drm device name

2019-09-03 Thread Jiang, Sonny
Add DRM device name and use DRM_IOCTL_VERSION ioctl drmVersion::desc passing it to user space instead of unused DRM driver name descriptor. Change-Id: I809f6d3e057111417efbe8fa7cab8f0113ba4b21 Signed-off-by: Sonny Jiang --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 2 ++ drivers/gpu/drm/drm

Re: [PATCH v2 06/27] drm/dp_mst: Combine redundant cases in drm_dp_encode_sideband_req()

2019-09-03 Thread Dave Airlie
On Wed, 4 Sep 2019 at 06:48, Lyude Paul wrote: > > Noticed this while working on adding a drm_dp_decode_sideband_req(). > DP_POWER_DOWN_PHY/DP_POWER_UP_PHY both use the same struct, so we can > just combine their cases. both use the same struct as enum path resources? Since otherwise the patch d

Re: [PATCH 2/2] drm/amdgpu: cleanup PTE flag generation

2019-09-03 Thread Liu, Shaoyun
Maybe  it's not necessary to separate the mtype from the get_vm_pte function , so we just need one  asic specific functions (get_vm_pte) . What make me confusing originally is  the  the  logic  to setup the  PTE flag.   We first map the  UAPI flags to HW specific PTE flag , save it into mapping

Re: [PATCH 2/3] drm/amdgpu: reserve at least 4MB of VRAM for page tables v2

2019-09-03 Thread Kuehling, Felix
On 2019-09-03 5:09 a.m., Christian König wrote: > This hopefully helps reduce the contention for page tables. > > v2: adjust maximum reported VRAM size as well > > Signed-off-by: Christian König I'll need to do something similar (and also take the vram_pin_size into account) in amdgpu_amdkfd_res

RE: [PATCH] drm/amd/powerplay: replace smu->table_count with SMU_TABLE_COUNT in smu

2019-09-03 Thread Huang, Ray
Acked-by: Huang Rui -Original Message- From: Wang, Kevin(Yang) Sent: Tuesday, September 3, 2019 4:17 AM To: amd-gfx@lists.freedesktop.org; Huang, Ray Cc: Wang, Kevin(Yang) Subject: [PATCH] drm/amd/powerplay: replace smu->table_count with SMU_TABLE_COUNT in smu fix bellow patch issue

[PATCH v2 26/27] drm/dp_mst: Also print unhashed pointers for malloc/topology references

2019-09-03 Thread Lyude Paul
Currently we only print mstb/port pointer addresses in our malloc and topology refcount functions using the hashed-by-default %p, but unfortunately if you're trying to debug a use-after-free error caused by a refcounting error then this really isn't terribly useful. On the other hand though, everyt

[PATCH v2 27/27] drm/dp_mst: Add topology ref history tracking for debugging

2019-09-03 Thread Lyude Paul
For very subtle mistakes with topology refs, it can be rather difficult to trace them down with the debugging info that we already have. I had one such issue recently while trying to implement suspend/resume reprobing for MST, and ended up coming up with this. Inspired by Chris Wilson's wakeref tr

[PATCH v2 25/27] drm/dp_mst: Add basic topology reprobing when resuming

2019-09-03 Thread Lyude Paul
Finally! For a very long time, our MST helpers have had one very annoying issue: They don't know how to reprobe the topology state when coming out of suspend. This means that if a user has a machine connected to an MST topology and decides to suspend their machine, we lose all topology changes that

[PATCH v2 23/27] drm/amdgpu: Iterate through DRM connectors correctly

2019-09-03 Thread Lyude Paul
Currently, every single piece of code in amdgpu that loops through connectors does it incorrectly and doesn't use the proper list iteration helpers, drm_connector_list_iter_begin() and drm_connector_list_iter_end(). Yeesh. So, do that. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry We

[PATCH v2 24/27] drm/amdgpu/dm: Resume short HPD IRQs before resuming MST topology

2019-09-03 Thread Lyude Paul
Since we're going to be reprobing the entire topology state on resume now using sideband transactions, we need to ensure that we actually have short HPD irqs enabled before calling drm_dp_mst_topology_mgr_resume(). So, do that. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland C

[PATCH v2 17/27] drm/dp_mst: Rename drm_dp_add_port and drm_dp_update_port

2019-09-03 Thread Lyude Paul
The names for these functions are rather confusing. drm_dp_add_port() sounds like a function that would simply create a port and add it to a topology, and do nothing more. Similarly, drm_dp_update_port() would be assumed to be the function that should be used to update port information after initia

[PATCH v2 21/27] drm/dp_mst: Don't forget to update port->input in drm_dp_mst_handle_conn_stat()

2019-09-03 Thread Lyude Paul
This probably hasn't caused any problems up until now since it's probably nearly impossible to encounter this in the wild, however if we were to receive a connection status notification from the MST hub after resume while we're in the middle of reprobing the link addresses for a topology then there

[PATCH v2 15/27] drm/dp_mst: Cleanup drm_dp_send_link_address() a bit

2019-09-03 Thread Lyude Paul
Declare local pointer to the drm_dp_link_address_ack_reply struct instead of constantly dereferencing it through the union in txmsg->reply. Then, invert the order of conditionals so we don't have to do the bulk of the work inside them, and can wrap lines even less. Then finally, rearrange variable

[PATCH v2 19/27] drm/dp_mst: Handle UP requests asynchronously

2019-09-03 Thread Lyude Paul
Once upon a time, hotplugging devices on MST branches actually worked in DRM. Now, it only works in amdgpu (likely because of how it's hotplug handlers are implemented). On both i915 and nouveau, hotplug notifications from MST branches are noticed - but trying to respond to them causes messaging ti

[PATCH v2 22/27] drm/nouveau: Don't grab runtime PM refs for HPD IRQs

2019-09-03 Thread Lyude Paul
In order for suspend/resume reprobing to work, we need to be able to perform sideband communications during suspend/resume, along with runtime PM suspend/resume. In order to do so, we also need to make sure that nouveau doesn't bother grabbing a runtime PM reference to do so, since otherwise we'll

[PATCH v2 18/27] drm/dp_mst: Remove lies in {up,down}_rep_recv documentation

2019-09-03 Thread Lyude Paul
These are most certainly accessed from far more than the mgr work. In fact, up_req_recv is -only- ever accessed from outside the mgr work. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Cc: Daniel Vetter Signed-off-by: Lyude Paul --- include/drm/drm_dp_mst_helper.h | 8 ++-

[PATCH v2 14/27] drm/dp_mst: Destroy topology_mgr mutexes

2019-09-03 Thread Lyude Paul
Turns out we've been forgetting for a while now to actually destroy any of the mutexes that we create in drm_dp_mst_topology_mgr. So, let's do that. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Cc: Daniel Vetter Signed-off-by: Lyude Paul --- drivers/gpu/drm/drm_dp_mst_to

[PATCH v2 16/27] drm/dp_mst: Refactor pdt setup/teardown, add more locking

2019-09-03 Thread Lyude Paul
Since we're going to be implementing suspend/resume reprobing very soon, we need to make sure we are extra careful to ensure that our locking actually protects the topology state where we expect it to. Turns out this isn't the case with drm_dp_port_setup_pdt() and drm_dp_port_teardown_pdt(), both o

[PATCH v2 20/27] drm/dp_mst: Protect drm_dp_mst_port members with connection_mutex

2019-09-03 Thread Lyude Paul
Yes-you read that right. Currently there is literally no locking in place for any of the drm_dp_mst_port struct members that can be modified in response to a link address response, or a connection status response. Which literally means if we're unlucky enough to have any sort of hotplugging event h

[PATCH v2 09/27] drm/dp_mst: Refactor drm_dp_send_enum_path_resources

2019-09-03 Thread Lyude Paul
Use more pointers so we don't have to write out txmsg->reply.u.path_resources each time. Also, fix line wrapping + rearrange local variables. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Reviewed-by: Daniel Vetter Signed-off-by: Lyude Paul --- drivers/gpu/drm/drm_dp_mst_

[PATCH v2 13/27] drm/dp_mst: Refactor drm_dp_mst_handle_down_rep()

2019-09-03 Thread Lyude Paul
* Remove the big ugly have_eomt conditional * Store &mgr->down_rep_recv.initial_hdr in a var to make line wrapping easier * Remove duplicate memset() calls * Actually wrap lines Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Reviewed-by: Daniel Vetter Signed-off-by: Lyude

[PATCH v2 06/27] drm/dp_mst: Combine redundant cases in drm_dp_encode_sideband_req()

2019-09-03 Thread Lyude Paul
Noticed this while working on adding a drm_dp_decode_sideband_req(). DP_POWER_DOWN_PHY/DP_POWER_UP_PHY both use the same struct, so we can just combine their cases. Signed-off-by: Lyude Paul Cc: Daniel Vetter Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland --- drivers/gpu/d

[PATCH v2 07/27] drm/dp_mst: Add sideband down request tracing + selftests

2019-09-03 Thread Lyude Paul
Unfortunately the DP MST helpers do not have much in the way of debugging utilities. So, let's add some! This adds basic debugging output for down sideband requests that we send from the driver, so that we can actually discern what's happening when sideband requests timeout. Since there wasn't re

[PATCH v2 12/27] drm/dp_mst: Refactor drm_dp_mst_handle_up_req()

2019-09-03 Thread Lyude Paul
There's a couple of changes here, so to summarize: * Remove the big ugly mgr->up_req_recv.have_eomt conditional to save on indenting * Store &mgr->up_req_recv.initial_hdr in a variable so we don't keep going over 80 character long lines * De-duplicate code for calling drm_dp_send_up_ack_reply(

[PATCH v2 04/27] drm/dp_mst: Move test_calc_pbn_mode() into an actual selftest

2019-09-03 Thread Lyude Paul
Yes, apparently we've been testing this for every single driver load for quite a long time now. At least that means our PBN calculation is solid! Anyway, introduce self tests for MST and move this into there. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Reviewed-by: Daniel

[PATCH v2 10/27] drm/dp_mst: Remove huge conditional in drm_dp_mst_handle_up_req()

2019-09-03 Thread Lyude Paul
Which reduces indentation and makes this function more legible. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Cc: Daniel Vetter Signed-off-by: Lyude Paul --- drivers/gpu/drm/drm_dp_mst_topology.c | 90 +-- 1 file changed, 45 insertions(+), 45 delet

[PATCH v2 08/27] drm/dp_mst: Remove PDT teardown in drm_dp_destroy_port() and refactor

2019-09-03 Thread Lyude Paul
This will allow us to add some locking for port->* members, in particular the PDT and ->connector, which can't be done from drm_dp_destroy_port() since we don't know what locks the caller might be holding. Changes since v2: * Clarify commit message Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä

[PATCH v2 11/27] drm/dp_mst: Constify guid in drm_dp_get_mst_branch_by_guid()

2019-09-03 Thread Lyude Paul
And it's helper, we'll be using this in just a moment. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Cc: Daniel Vetter Signed-off-by: Lyude Paul --- drivers/gpu/drm/drm_dp_mst_topology.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/d

[PATCH v2 05/27] drm/print: Add drm_err_printer()

2019-09-03 Thread Lyude Paul
A simple convienence function that returns a drm_printer which prints using pr_err() Changes since v1: * Make __drm_printfn_err() more consistent with DRM_ERROR() - danvet Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Reviewed-by: Daniel Vetter Signed-off-by: Lyude Paul -

[PATCH v2 00/27] DP MST Refactors + debugging tools + suspend/resume reprobing

2019-09-03 Thread Lyude Paul
This is the large series for adding MST suspend/resume reprobing that I've been working on for quite a while now. In addition, I: - Refactored and cleaned up any code I ended up digging through in the process of understanding how some parts of these helpers worked. - Added some debugging tools a

[PATCH v2 01/27] drm/dp_mst: Move link address dumping into a function

2019-09-03 Thread Lyude Paul
Makes things easier to read. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Reviewed-by: Daniel Vetter Signed-off-by: Lyude Paul --- drivers/gpu/drm/drm_dp_mst_topology.c | 35 ++- 1 file changed, 23 insertions(+), 12 deletions(-) diff --git a/driv

[PATCH v2 02/27] drm/dp_mst: Get rid of list clear in destroy_connector_work

2019-09-03 Thread Lyude Paul
This seems to be some leftover detritus from before the port/mstb kref cleanup and doesn't do anything anymore, so get rid of it. Cc: Juston Li Cc: Imre Deak Cc: Ville Syrjälä Cc: Harry Wentland Reviewed-by: Daniel Vetter Signed-off-by: Lyude Paul --- drivers/gpu/drm/drm_dp_mst_topology.c |

[PATCH v2 03/27] drm/dp_mst: Destroy MSTBs asynchronously

2019-09-03 Thread Lyude Paul
When reprobing an MST topology during resume, we have to account for the fact that while we were suspended it's possible that mstbs may have been removed from any ports in the topology. Since iterating downwards in the topology requires that we hold &mgr->lock, destroying MSTBs from this context wo

Re: [PATCH RFC v4 01/16] drm: Add drm_minor_for_each

2019-09-03 Thread Kenny Ho
On Tue, Sep 3, 2019 at 4:12 PM Daniel Vetter wrote: > On Tue, Sep 3, 2019 at 9:45 PM Kenny Ho wrote: > > On Tue, Sep 3, 2019 at 3:57 AM Daniel Vetter wrote: > > > Iterating over minors for cgroups sounds very, very wrong. Why do we care > > > whether a buffer was allocated through kms dumb vs re

Re: [PATCH 1/3] drm/amdgpu: use moving fence instead of exclusive for VM updates

2019-09-03 Thread Kuehling, Felix
On 2019-09-02 6:52 a.m., Christian König wrote: > Make VM updates depend on the moving fence instead of the exclusive one. In effect, this makes the page table update depend on the last move of the BO, rather than the last change of the buffer contents. Makes sense. Reviewed-by: Felix Kuehling

Re: [PATCH 2/2] drm/amdgpu: cleanup PTE flag generation

2019-09-03 Thread Kuehling, Felix
On 2019-09-02 10:57 a.m., Christian König wrote: > Move the ASIC specific code into a new callback function. NAK. I believe the BUG_ONs you're adding will trigger with ROCm on Hawaii and Kaveri. See inline ... ROCm user mode doesn't care that the page table doesn't have an executable bit on tho

Re: gnome-shell stuck because of amdgpu driver [5.3 RC5]

2019-09-03 Thread Daniel Vetter
On Tue, Sep 3, 2019 at 8:07 PM Mikhail Gavrilov wrote: > > On Tue, 3 Sep 2019 at 13:21, Hillf Danton wrote: > > > > Describe the problems you are experiencing please. > > Say is the screen locked up? Machine lockedup? > > Anything unnormal after you see the warning? > > > > According to my observ

Re: [PATCH AUTOSEL 4.19 044/167] drm/amdgpu: validate user pitch alignment

2019-09-03 Thread Daniel Vetter
On Tue, Sep 3, 2019 at 10:01 PM Sasha Levin wrote: > > On Tue, Sep 03, 2019 at 07:03:47PM +0200, Greg KH wrote: > >On Tue, Sep 03, 2019 at 06:40:43PM +0200, Michel Dänzer wrote: > >> On 2019-09-03 6:23 p.m., Sasha Levin wrote: > >> > From: Yu Zhao > >> > > >> > [ Upstream commit 89f23b6efef554766

Re: [PATCH RFC v4 01/16] drm: Add drm_minor_for_each

2019-09-03 Thread Daniel Vetter
On Tue, Sep 3, 2019 at 9:45 PM Kenny Ho wrote: > > On Tue, Sep 3, 2019 at 3:57 AM Daniel Vetter wrote: > > > > On Thu, Aug 29, 2019 at 02:05:18AM -0400, Kenny Ho wrote: > > > To allow other subsystems to iterate through all stored DRM minors and > > > act upon them. > > > > > > Also exposes drm_m

Re: [PATCH AUTOSEL 4.19 044/167] drm/amdgpu: validate user pitch alignment

2019-09-03 Thread Sasha Levin
On Tue, Sep 03, 2019 at 07:03:47PM +0200, Greg KH wrote: On Tue, Sep 03, 2019 at 06:40:43PM +0200, Michel Dänzer wrote: On 2019-09-03 6:23 p.m., Sasha Levin wrote: > From: Yu Zhao > > [ Upstream commit 89f23b6efef554766177bf51aa754bce14c3e7da ] Hold your horses! This commit and c4a32b266da7bb

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Daniel Vetter
On Tue, Sep 3, 2019 at 8:50 PM Tejun Heo wrote: > > Hello, Daniel. > > On Tue, Sep 03, 2019 at 09:55:50AM +0200, Daniel Vetter wrote: > > > * While breaking up and applying control to different types of > > > internal objects may seem attractive to folks who work day in and > > > day out with

Re: [PATCH RFC v4 01/16] drm: Add drm_minor_for_each

2019-09-03 Thread Kenny Ho
On Tue, Sep 3, 2019 at 3:57 AM Daniel Vetter wrote: > > On Thu, Aug 29, 2019 at 02:05:18AM -0400, Kenny Ho wrote: > > To allow other subsystems to iterate through all stored DRM minors and > > act upon them. > > > > Also exposes drm_minor_acquire and drm_minor_release for other subsystem > > to ha

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Kenny Ho
On Tue, Sep 3, 2019 at 5:20 AM Daniel Vetter wrote: > > On Tue, Sep 3, 2019 at 10:24 AM Koenig, Christian > wrote: > > > > Am 03.09.19 um 10:02 schrieb Daniel Vetter: > > > On Thu, Aug 29, 2019 at 02:05:17AM -0400, Kenny Ho wrote: > > >> With this RFC v4, I am hoping to have some consensus on a m

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Kenny Ho
Hi Tejun, Thanks for looking into this. I can definitely help where I can and I am sure other experts will jump in if I start misrepresenting the reality :) (as Daniel already have done.) Regarding your points, my understanding is that there isn't really a TTM vs GEM situation anymore (there is

RE: [PATCH v4 3/3] dmr/amdgpu: Add system auto reboot to RAS.

2019-09-03 Thread Zhang, Hawking
Series is Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: amd-gfx On Behalf Of Andrey Grodzovsky Sent: 2019年9月4日 0:44 To: amd-gfx@lists.freedesktop.org Cc: alexdeuc...@gmail.com; ckoenig.leichtzumer...@gmail.com; Zhou1, Tao ; Grodzovsky, Andrey ; Zhang, Hawking S

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Tejun Heo
Hello, Daniel. On Tue, Sep 03, 2019 at 09:55:50AM +0200, Daniel Vetter wrote: > > * While breaking up and applying control to different types of > > internal objects may seem attractive to folks who work day in and > > day out with the subsystem, they aren't all that useful to users and > >

Re: gnome-shell stuck because of amdgpu driver [5.3 RC5]

2019-09-03 Thread Mikhail Gavrilov
On Tue, 3 Sep 2019 at 13:21, Hillf Danton wrote: > > Describe the problems you are experiencing please. > Say is the screen locked up? Machine lockedup? > Anything unnormal after you see the warning? > According to my observations, all "gnome shell stuck warning" happened when me not sitting on t

Re: [PATCH AUTOSEL 4.19 044/167] drm/amdgpu: validate user pitch alignment

2019-09-03 Thread Greg KH
On Tue, Sep 03, 2019 at 06:40:43PM +0200, Michel Dänzer wrote: > On 2019-09-03 6:23 p.m., Sasha Levin wrote: > > From: Yu Zhao > > > > [ Upstream commit 89f23b6efef554766177bf51aa754bce14c3e7da ] > > Hold your horses! > > This commit and c4a32b266da7bb702e60381ca0c35eaddbc89a6c had to be > reve

[PATCH v4 1/3] drm/amdgpu: Fix bugs in amdgpu_device_gpu_recover in XGMI case.

2019-09-03 Thread Andrey Grodzovsky
Issue 1: In XGMI case amdgpu_device_lock_adev for other devices in hive was called to late, after access to their repsective schedulers. So relocate the lock to the begining of accessing the other devs. Issue 2: Using amdgpu_device_ip_need_full_reset to switch the device list from all devices in

[PATCH v4 2/3] dmr/amdgpu: Avoid HW GPU reset for RAS.

2019-09-03 Thread Andrey Grodzovsky
Problem: Under certain conditions, when some IP bocks take a RAS error, we can get into a situation where a GPU reset is not possible due to issues in RAS in SMU/PSP. Temporary fix until proper solution in PSP/SMU is ready: When uncorrectable error happens the DF will unconditionally broadcast err

[PATCH v4 3/3] dmr/amdgpu: Add system auto reboot to RAS.

2019-09-03 Thread Andrey Grodzovsky
In case of RAS error allow user configure auto system reboot through ras_ctrl. This is also part of the temproray work around for the RAS hang problem. v4: Use latest kernel API for disk sync. Signed-off-by: Andrey Grodzovsky --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 14 ++ d

Re: [PATCH AUTOSEL 4.19 044/167] drm/amdgpu: validate user pitch alignment

2019-09-03 Thread Michel Dänzer
On 2019-09-03 6:23 p.m., Sasha Levin wrote: > From: Yu Zhao > > [ Upstream commit 89f23b6efef554766177bf51aa754bce14c3e7da ] Hold your horses! This commit and c4a32b266da7bb702e60381ca0c35eaddbc89a6c had to be reverted, as they caused regressions. See commits 25ec429e86bb790e40387a550f0501d0ac5

[PATCH AUTOSEL 4.19 117/167] drm/amdgpu/{uvd, vcn}: fetch ring's read_ptr after alloc

2019-09-03 Thread Sasha Levin
From: Shirish S [ Upstream commit 517b91f4cde3043d77b2178548473e8545ef07cb ] [What] readptr read always returns zero, since most likely these blocks are either power or clock gated. [How] fetch rptr after amdgpu_ring_alloc() which informs the power management code that the block is about to be

[PATCH AUTOSEL 4.19 114/167] drm/amdkfd: Add missing Polaris10 ID

2019-09-03 Thread Sasha Levin
From: Kent Russell [ Upstream commit 0a5a9c276c335870a1cecc4f02b76d6d6f663c8b ] This was added to amdgpu but was missed in amdkfd Signed-off-by: Kent Russell Reviewed-by: Alex Deucher Signed-off-by: Alex Deucher Cc: sta...@vger.kernel.rg Signed-off-by: Sasha Levin --- drivers/gpu/drm/amd/a

[PATCH AUTOSEL 4.19 116/167] drm/amdgpu: fix ring test failure issue during s3 in vce 3.0 (V2)

2019-09-03 Thread Sasha Levin
From: Louis Li [ Upstream commit ce0e22f5d886d1b56c7ab4347c45b9ac5fcc058d ] [What] vce ring test fails consistently during resume in s3 cycle, due to mismatch read & write pointers. On debug/analysis its found that rptr to be compared is not being correctly updated/read, which leads to this fail

[PATCH AUTOSEL 4.19 045/167] drm/amdgpu: validate user GEM object size

2019-09-03 Thread Sasha Levin
From: Yu Zhao [ Upstream commit c4a32b266da7bb702e60381ca0c35eaddbc89a6c ] When creating frame buffer, userspace may request to attach to a previously allocated GEM object that is smaller than what GPU requires. Validation must be done to prevent out-of-bound DMA, otherwise it could be exploited

[PATCH AUTOSEL 4.19 024/167] drm/amd/dm: Understand why attaching path/tile properties are needed

2019-09-03 Thread Sasha Levin
From: Lyude Paul [ Upstream commit 04ac4b0ed412f65230b456fcd9aa07e13befff89 ] Path property is used for userspace to know what MST connector goes to what actual DRM DisplayPort connector, the tiling property is for tiling configurations. Not sure what else there is to figure out. Signed-off-b

[PATCH AUTOSEL 4.19 022/167] powerplay: Respect units on max dcfclk watermark

2019-09-03 Thread Sasha Levin
From: David Francis [ Upstream commit f191415b24a3ad3fa22088af7cd7fc328a2f469f ] In a refactor, the watermark clock inputs to powerplay from DC were changed from units of 10kHz to kHz clocks. One division by 100 was not converted into a division by 1000. Signed-off-by: David Francis Reviewed-

[PATCH AUTOSEL 4.19 023/167] drm/amd/pp: Fix truncated clock value when set watermark

2019-09-03 Thread Sasha Levin
From: Rex Zhu [ Upstream commit 4d454e9ffdb1ef5a51ebc147b5389c96048db683 ] the clk value should be tranferred to MHz first and then transfer to uint16. otherwise, the clock value will be truncated. Reviewed-by: Alex Deucher Reported-by: Hersen Wu Signed-off-by: Rex Zhu Signed-off-by: Alex De

[PATCH AUTOSEL 4.19 044/167] drm/amdgpu: validate user pitch alignment

2019-09-03 Thread Sasha Levin
From: Yu Zhao [ Upstream commit 89f23b6efef554766177bf51aa754bce14c3e7da ] Userspace may request pitch alignment that is not supported by GPU. Some requests 32, but GPU ignores it and uses default 64 when cpp is 4. If GEM object is allocated based on the smaller alignment, GPU DMA will go out of

[PATCH AUTOSEL 4.19 010/167] drm/amdgpu: Update gc_9_0 golden settings.

2019-09-03 Thread Sasha Levin
From: Feifei Xu [ Upstream commit c55045adf7210d246a016c961916f078ed31a951 ] Add mmDB_DEBUG3 settings. Signed-off-by: Feifei Xu Reviewed-by: Evan Quan Signed-off-by: Alex Deucher Cc: sta...@vger.kernel.org Signed-off-by: Sasha Levin --- drivers/gpu/drm/amd/amdgpu/gfx_v9_0.c | 1 + 1 file c

[PATCH AUTOSEL 4.19 009/167] drm/amdgpu/gfx9: Update gfx9 golden settings.

2019-09-03 Thread Sasha Levin
From: Feifei Xu [ Upstream commit 54d682d9a5b357eb711994fa94ef1bc44d7ce9d9 ] Update the goldensettings for vega20. Signed-off-by: Feifei Xu Signed-off-by: Evan Quan Reviewed-by: Hawking Zhang Reviewed-by: Alex Deucher Signed-off-by: Alex Deucher Signed-off-by: Sasha Levin --- drivers/gpu

RE: [PATCH 2/2] drm/amdgpu: Disable page faults while reading user wptrs

2019-09-03 Thread Zeng, Oak
Reviewed-by: Oak Zeng Regards, Oak -Original Message- From: amd-gfx On Behalf Of Kuehling, Felix Sent: Friday, August 30, 2019 1:15 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH 2/2] drm/amdgpu: Disable page faults while reading user wptrs These wptrs must be pinned and GPU acc

Re: linux-next: Tree for Sep 3 (gpu/drm/amd/display/)

2019-09-03 Thread Randy Dunlap
On 9/3/19 3:17 AM, Stephen Rothwell wrote: > Hi all, > > News: I will only be doing 1 more release before I leave for Kernel > Summit (there may be some reports on Thursday, but I doubt I will have > time to finish the full release) and then no more until Sept 30. > > Changes since 20190902: >

RE: [PATCH] drm/amd/amdgpu: add sw_fini interface for df_funcs

2019-09-03 Thread Kim, Jonathan
Reviewed-by: Jonathan Kim -Original Message- From: amd-gfx On Behalf Of Jack Zhang Sent: Tuesday, September 3, 2019 5:54 AM To: amd-gfx@lists.freedesktop.org Cc: Zhang, Jack (Jian) Subject: [PATCH] drm/amd/amdgpu: add sw_fini interface for df_funcs [CAUTION: External Email] add sw_fin

RE: [PATCH 1/2] drm/amdgpu: cleanup mtype mapping

2019-09-03 Thread Zeng, Oak
This is nice clean up. Acked-by: Oak Zeng Regards, Oak -Original Message- From: amd-gfx On Behalf Of Christian König Sent: Monday, September 2, 2019 10:58 AM To: amd-gfx@lists.freedesktop.org Subject: [PATCH 1/2] drm/amdgpu: cleanup mtype mapping Unify how we map the UAPI flags to the

Re: ?????? ?????? Bug: amdgpu drm driver cause process into Disk sleep state

2019-09-03 Thread Koenig, Christian
This is just a GPU lock, please open up a bug report on freedesktop.org and attach the full dmesg and which version of Mesa you are using. Regards, Christian. Am 03.09.19 um 15:16 schrieb 7879: Yes, with dmesg|grep drm , I get following. 348571.880718] [drm:amdgpu_job_timedout [amdgpu]] *E

Re: ?????? Bug: amdgpu drm driver cause process into Disk sleep state

2019-09-03 Thread Koenig, Christian
Well that looks like the hardware got stuck. Do you get something in the locks about a timeout on the SDMA ring? Regards, Christian. Am 03.09.19 um 14:50 schrieb 7879: Hi Christian, Sometimes the thread blocked disk sleeping in call to amdgpu_sa_bo_new. following is the stack trace.

Re: [PATCH 3/3] drm/amdgpu: remove amdgpu_cs_try_evict

2019-09-03 Thread Chunming Zhou
Reviewed-by: Chunming Zhou for series. -David 在 2019/9/3 17:09, Christian König 写道: > Trying to evict things from the current working set doesn't work that > well anymore because of per VM BOs. > > Rely on reserving VRAM for page tables to avoid contention. > > Signed-off-by: Christian König >

[PATCH] drm/amd/amdgpu: add sw_fini interface for df_funcs

2019-09-03 Thread Jack Zhang
add sw_fini interface of df_funcs. This interface will remove sysfs file of df_cntr_avail function. The old behavior only create sysfs of df_cntr_avail in sw_init, but never remove it for lack of sw_fini interface. With this,driver will report create sysfs fail when it's loaded for the second time

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Daniel Vetter
On Tue, Sep 3, 2019 at 10:24 AM Koenig, Christian wrote: > > Am 03.09.19 um 10:02 schrieb Daniel Vetter: > > On Thu, Aug 29, 2019 at 02:05:17AM -0400, Kenny Ho wrote: > >> This is a follow up to the RFC I made previously to introduce a cgroup > >> controller for the GPU/DRM subsystem [v1,v2,v3].

[PATCH 3/3] drm/amdgpu: remove amdgpu_cs_try_evict

2019-09-03 Thread Christian König
Trying to evict things from the current working set doesn't work that well anymore because of per VM BOs. Rely on reserving VRAM for page tables to avoid contention. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu.h| 1 - drivers/gpu/drm/amd/amdgpu/amdgpu_cs.c | 71 +--

[PATCH 2/3] drm/amdgpu: reserve at least 4MB of VRAM for page tables v2

2019-09-03 Thread Christian König
This hopefully helps reduce the contention for page tables. v2: adjust maximum reported VRAM size as well Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c | 18 -- drivers/gpu/drm/amd/amdgpu/amdgpu_vm.h | 3 +++ drivers/gpu/drm/amd/amdgpu/am

[PATCH 1/3] drm/amdgpu: use moving fence instead of exclusive for VM updates

2019-09-03 Thread Christian König
Make VM updates depend on the moving fence instead of the exclusive one. Makes it less likely to actually have a dependency. Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/a

Re: [PATCH] drm/amdgpu: change r type to int in gmc_v9_0_late_init

2019-09-03 Thread Christian König
Am 03.09.19 um 10:41 schrieb Zhou1, Tao: change r type from bool to int, suitable for both bool and int return value Signed-off-by: Tao Zhou Reviewed-by: Christian König --- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers

Re: [PATCH 2/3] drm/amdgpu: reserve at least 4MB of VRAM for page tables

2019-09-03 Thread Christian König
Oh, good point! That is probably good idea. Christian. Am 03.09.19 um 08:52 schrieb Zhou, David(ChunMing): Do you need update the vram size reported to UMD ? -David -Original Message- From: amd-gfx On Behalf Of Christian König Sent: Monday, September 2, 2019 6:52 PM To: amd-gfx@list

[PATCH] drm/amdgpu: change r type to int in gmc_v9_0_late_init

2019-09-03 Thread Zhou1, Tao
change r type from bool to int, suitable for both bool and int return value Signed-off-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v9_0.c index 0

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Koenig, Christian
Am 03.09.19 um 10:02 schrieb Daniel Vetter: > On Thu, Aug 29, 2019 at 02:05:17AM -0400, Kenny Ho wrote: >> This is a follow up to the RFC I made previously to introduce a cgroup >> controller for the GPU/DRM subsystem [v1,v2,v3]. The goal is to be able to >> provide resource management to GPU reso

Re: Bug: amdgpu drm driver cause process into Disk sleep state

2019-09-03 Thread Koenig, Christian
Hi Yanhua, please update your kernel first, cause that looks like a known issue which was recently fixed by patch "drm/scheduler: use job count instead of peek". Probably best to try the latest bleeding edge kernel and if that doesn't help please open up a bug report on https://bugs.freedeskto

[PATCH] drm/amd/powerplay: replace smu->table_count with SMU_TABLE_COUNT in smu

2019-09-03 Thread Wang, Kevin(Yang)
fix bellow patch issue: drm/amd/powerplay: introduce smu table id type to handle the smu table for each asic "This patch introduces new smu table type, it's to handle the different smu table defines for each asic with the same smu ip." before: use smu->table_count to represent the actual ta

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Daniel Vetter
On Thu, Aug 29, 2019 at 02:05:17AM -0400, Kenny Ho wrote: > This is a follow up to the RFC I made previously to introduce a cgroup > controller for the GPU/DRM subsystem [v1,v2,v3]. The goal is to be able to > provide resource management to GPU resources using things like container. > > With th

Re: [PATCH RFC v4 01/16] drm: Add drm_minor_for_each

2019-09-03 Thread Daniel Vetter
On Thu, Aug 29, 2019 at 02:05:18AM -0400, Kenny Ho wrote: > To allow other subsystems to iterate through all stored DRM minors and > act upon them. > > Also exposes drm_minor_acquire and drm_minor_release for other subsystem > to handle drm_minor. DRM cgroup controller is the initial consumer of

Re: [PATCH RFC v4 00/16] new cgroup controller for gpu/drm subsystem

2019-09-03 Thread Daniel Vetter
On Fri, Aug 30, 2019 at 09:28:57PM -0700, Tejun Heo wrote: > Hello, > > I just glanced through the interface and don't have enough context to > give any kind of detailed review yet. I'll try to read up and > understand more and would greatly appreciate if you can give me some > pointers to read u

Re: [PATCH 3/3] drm/edid: no CEA extension is not an error

2019-09-03 Thread Jean Delvare
On Mon, 2 Sep 2019 14:46:51 +0300, Ville Syrjälä wrote: > On Fri, Aug 30, 2019 at 06:16:52PM +0200, Jean Delvare wrote: > > It is fine for displays without audio functionality to not implement > > CEA extension in their EDID. Do not return an error in that case, > > instead return 0 as if there was

Bug: amdgpu drm driver cause process into Disk sleep state

2019-09-03 Thread 78666679
Hi, Sirs: I have a wx5100 amdgpu card, It randomly come into failure. sometimes, it will cause processes into uninterruptible wait state. cps-new-ondemand-0587:~ # ps aux|grep -w D root 11268 0.0 0.0 260628 3516 ?Ssl 8??26 0:00 /usr/sbin/gssproxy -D root 136482 0

RE: [PATCH 01/10] drm/amdgpu: set ip specific ras interface pointer to NULL after free it

2019-09-03 Thread Zhou1, Tao
I think ecc_irq and process_ras_data_cb functions could be also moved to generic module files, but we can do it in new patches. Another issue is amdgpu init will fail if vbios enables ras but psp ras ta is not ready on Arcturus. If vbios and psp ta for ras will be released at the same time, the