I reproduced the issue reported by customer. When running a HSA test,
repeating to read power via AGT and rocm-smi (driver). We set power
limit of 175w to a Fiji. The results from AGT are all below 175w and the
results from driver have a lot of value over 175, some are almost double
of 175. So your test cases are not enough, you should run some OCL and
HSA tests.
I have tested 100ms and 150ms, the results still have some wrong. 200ms
is good. It seems more sampling more accurate.
The theoretical period is quoted from smu team and tools team. AGT is
using more than 1sec of period. I don't know how long one cycle of dpm
task is, but is sampling based on dpm task cycle? we should ask smu team
to confirm.
Regards,
Eric
On 03/30/2018 03:52 AM, Zhu, Rex wrote:
>> Power value is wrong reported by customer.
Hi Eric,
What is the wrong value customer reported?
In my end, there is no big difference between 20ms and 200ms or 2s. I
tested on Fiji/Tonga when gpu idle or running fullscreen glxgears.
why need 50 ms?
How long does the SMU core take to complete one cycle of dpm tasks? I
tested, it is less than 1 ms.
So when we delay 20 ms, The output is the average value of more than
20 sampling.
Best Regards
Rex
*From:*amd-gfx [mailto:amd-gfx-boun...@lists.freedesktop.org] *On
Behalf Of *Deucher, Alexander
*Sent:* Friday, March 30, 2018 4:00 AM
*To:* Huang, JinHuiEric; amd-gfx@lists.freedesktop.org
*Subject:* Re: [PATCH] drm/amd/powerply: fix power reading on Fiji
Fiji and tonga I presume. The current code seems to work fine on
tonga at least.
Alex
------------------------------------------------------------------------
*From:*Huang, JinHuiEric
*Sent:* Thursday, March 29, 2018 3:58:42 PM
*To:* Deucher, Alexander; amd-gfx@lists.freedesktop.org
<mailto:amd-gfx@lists.freedesktop.org>
*Subject:* Re: [PATCH] drm/amd/powerply: fix power reading on Fiji
Right. This is only for Fiji. We should use PPSMC_MSG_GetCurrPkgPwr on
poaris.
Thanks,
Eric
On 2018-03-29 03:54 PM, Deucher, Alexander wrote:
Thanks. Patch is:
Acked-by: Alex Deucher <alexander.deuc...@amd.com>
<mailto:alexander.deuc...@amd.com>
Care to make a patch to use PPSMC_MSG_GetCurrPkgPwr on polaris
boards so we don't have to worry about the delay on them?
Alex
------------------------------------------------------------------------
*From:*Huang, JinHuiEric
*Sent:* Thursday, March 29, 2018 3:40:22 PM
*To:* Deucher, Alexander; amd-gfx@lists.freedesktop.org
<mailto:amd-gfx@lists.freedesktop.org>
*Subject:* Re: [PATCH] drm/amd/powerply: fix power reading on Fiji
This reading method is shared with AGT tool only on Fiji, because
SMU FW doesn't support PPSMC_MSG_GetCurrPkgPwr message on Fiji.
But since polaris10, PPSMC_MSG_GetCurrPkgPwr has been supported.
We also use PPSMC_MSG_GetCurrPkgPwr on vega which SMU FW control
sampling period. Driver will not care about it.
Eric
On 2018-03-29 03:31 PM, Deucher, Alexander wrote:
Do you know what the sampling period is on vega? We should
try and be consistent. How about making this selectable via
hwmon:
power[1-*]_average_interval Power use averaging
interval. A poll
notification is sent to this file if the
hardware changes the averaging interval.
Unit: milliseconds
RW
power[1-*]_average_interval_max Maximum power use averaging
interval
Unit: milliseconds
RO
power[1-*]_average_interval_min Minimum power use averaging
interval
Unit: milliseconds
RO
Then the user can select the interval they want.
Alex
------------------------------------------------------------------------
*From:*amd-gfx <amd-gfx-boun...@lists.freedesktop.org>
<mailto:amd-gfx-boun...@lists.freedesktop.org> on behalf of
Eric Huang <jinhuieric.hu...@amd.com>
<mailto:jinhuieric.hu...@amd.com>
*Sent:* Thursday, March 29, 2018 3:21:52 PM
*To:* amd-gfx@lists.freedesktop.org
<mailto:amd-gfx@lists.freedesktop.org>
*Cc:* Huang, JinHuiEric
*Subject:* [PATCH] drm/amd/powerply: fix power reading on Fiji
Power value is wrong reported by customer. It is a regression by
commit a7c7bc4c0c47eaac77b8fa92f0672032df7f4254
Author: Rex Zhu <rex....@amd.com> <mailto:rex....@amd.com>
Date: Mon Mar 27 15:32:59 2017 +0800
drm/amd/powerplay: reduce sample period time
for power readings.
Signed-off-by: Rex Zhu <rex....@amd.com>
<mailto:rex....@amd.com>
Reviewed-by: Alex Deucher <alexander.deuc...@amd.com>
<mailto:alexander.deuc...@amd.com>
Signed-off-by: Alex Deucher <alexander.deuc...@amd.com>
<mailto:alexander.deuc...@amd.com>
The theoretical sampling period is from 50ms to 4sec, original
2sec
is long but correct, and 20ms is too short. change it to more
reasonable 200ms.
Signed-off-by: Eric Huang <jinhuieric.hu...@amd.com>
<mailto:jinhuieric.hu...@amd.com>
---
drivers/gpu/drm/amd/powerplay/hwmgr/smu7_hwmgr.c | 3 ++-
1 file changed, 2 insertions(+), 1 deletion(-)
diff --git a/drivers/gpu/drm/amd/powerplay/hwmgr/smu7_hwmgr.c
b/drivers/gpu/drm/amd/powerplay/hwmgr/smu7_hwmgr.c
index a03b7fe..7631d80 100644
--- a/drivers/gpu/drm/amd/powerplay/hwmgr/smu7_hwmgr.c
+++ b/drivers/gpu/drm/amd/powerplay/hwmgr/smu7_hwmgr.c
@@ -3377,7 +3377,8 @@ static int smu7_get_gpu_power(struct
pp_hwmgr *hwmgr,
"Failed to start pm status log!",
return -1);
- msleep_interruptible(20);
+ /* Sampling period from 50ms to 4sec */
+ msleep_interruptible(200);
PP_ASSERT_WITH_CODE(!smum_send_msg_to_smc(hwmgr,
PPSMC_MSG_PmStatusLogSample),
--
2.7.4
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
<mailto:amd-gfx@lists.freedesktop.org>
https://lists.freedesktop.org/mailman/listinfo/amd-gfx
_______________________________________________
amd-gfx mailing list
amd-gfx@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx