slpc: Add a new SLPC selftest

Belgaumkar, Vinay Mon, 27 Jun 2022 15:52:41 -0700


On 6/24/2022 8:59 PM, Dixit, Ashutosh wrote:

On Thu, 23 Jun 2022 16:21:46 -0700, Belgaumkar, Vinay wrote:

On 6/22/2022 1:32 PM, Dixit, Ashutosh wrote:

On Fri, 10 Jun 2022 16:47:12 -0700, Vinay Belgaumkar wrote:

This test will validate we can achieve actual frequency of RP0. Pcode
grants frequencies based on what GuC is requesting. However, thermal
throttling can limit what is being granted. Add a test to request for
max, but don't fail the test if RP0 is not granted due to throttle
reasons.

Also optimize the selftest by using a common run_test function to avoid
code duplication.

The refactoring does change the order of operations (changing the freq vs
spawning the spinner) but should be fine I think.

Yes, we now start the spinner outside the for loop, so that freq changes
occur quickly. This ensures we don't mess with SLPC algorithm's history by
frequently restarting the WL in the for loop.

Rename the "clamp" tests to vary_max_freq and vary_min_freq.

Either is ok, but maybe "clamp" names were ok I think since they verify req
freq is clamped at min/max.

True, though clamp usually is associated with limiting, whereas we actually
increase the min.

v2: Fix compile warning

Fixes 8ee2c227822e ("drm/i915/guc/slpc: Add SLPC selftest")
Signed-off-by: Vinay Belgaumkar <vinay.belgaum...@intel.com>
---
   drivers/gpu/drm/i915/gt/selftest_slpc.c | 323 ++++++++++++------------
   1 file changed, 158 insertions(+), 165 deletions(-)

diff --git a/drivers/gpu/drm/i915/gt/selftest_slpc.c 
b/drivers/gpu/drm/i915/gt/selftest_slpc.c
index b768cea5943d..099129aae9a5 100644
--- a/drivers/gpu/drm/i915/gt/selftest_slpc.c
+++ b/drivers/gpu/drm/i915/gt/selftest_slpc.c
@@ -8,6 +8,11 @@
   #define delay_for_h2g() usleep_range(H2G_DELAY, H2G_DELAY + 10000)
   #define FREQUENCY_REQ_UNIT   DIV_ROUND_CLOSEST(GT_FREQUENCY_MULTIPLIER, \
                                                  GEN9_FREQ_SCALER)
+enum test_type {
+       VARY_MIN,
+       VARY_MAX,
+       MAX_GRANTED
+};

   static int slpc_set_min_freq(struct intel_guc_slpc *slpc, u32 freq)
   {
@@ -36,147 +41,120 @@ static int slpc_set_max_freq(struct intel_guc_slpc *slpc, 
u32 freq)
        return ret;
   }

-static int live_slpc_clamp_min(void *arg)
+static int vary_max_freq(struct intel_guc_slpc *slpc, struct intel_rps *rps,
+                 u32 *max_act_freq)

Please run checkpatch, indentation seems off.

I had run it. Not sure why this wasn't caught.

Need to use 'checkpatch --strict'.

ok.

   {
-       struct drm_i915_private *i915 = arg;
-       struct intel_gt *gt = to_gt(i915);
-       struct intel_guc_slpc *slpc = &gt->uc.guc.slpc;
-       struct intel_rps *rps = &gt->rps;
-       struct intel_engine_cs *engine;
-       enum intel_engine_id id;
-       struct igt_spinner spin;
+       u32 step, max_freq, req_freq;
+       u32 act_freq;
        u32 slpc_min_freq, slpc_max_freq;
        int err = 0;

-       if (!intel_uc_uses_guc_slpc(&gt->uc))
-               return 0;
-
-       if (igt_spinner_init(&spin, gt))
-               return -ENOMEM;
+       slpc_min_freq = slpc->min_freq;
+       slpc_max_freq = slpc->rp0_freq;

nit but we don't really need such variables since we don't change their
values, we should just use slpc->min_freq, slpc->rp0_freq directly. I'd
change this in all places in this patch.

I will remove it from the sub-functions, but will need to keep the one in
the main run_test(). We should query SLPC's min and max and then restore
that at the end of the test. It is possible that SLPC's min is different
from platform min for certain skus.

Sorry, I am not following. The tests are varying freq between platform min
to platform max correct? And platform min can be different from slpc min?
So why don't the tests start at slpc min rather than platform min? Can't
this return error?

Will start the tests from platform min -> platform max, that way weremain consistent.


And shouldn't slpc->min set to the real slpc min rather than to the
platform min when slpc initializes (in intel_guc_slpc_enable() or
slpc_get_rp_values())? (I am assuming the issue is only for the min and not
the max but not sure).

Certain conditions may result in SLPC setting the min to a differentvalue. We can worry about that in a different patch.


So I'd expect everywhere a consistent set of freq's be used, in run_test()
and the actual vary_min/max_freq tests and also in the main driver.

Agree.

-       if (intel_guc_slpc_get_max_freq(slpc, &slpc_max_freq)) {
-               pr_err("Could not get SLPC max freq\n");
-               return -EIO;
-       }
-
-       if (intel_guc_slpc_get_min_freq(slpc, &slpc_min_freq)) {
-               pr_err("Could not get SLPC min freq\n");
-               return -EIO;

Why do we need these two function calls? Can't we just use slpc->rp0_freq
and slpc->min_freq as we are doing in the vary_min/max_freq() functions
above?

Same as above.

Also, as mentioned below I think here we should just do:

          slpc_set_max_freq(slpc, slpc->rp0_freq);
          slpc_set_min_freq(slpc, slpc->min_freq);

to restore freq to a known state before starting the test (just in case a
previous test changed the values).

Any test that changes the frequencies should restore them as well.

I was saying restore the freq's *before* starting the tests as well to
start from a known state.


Ok, so 2 things here-

- A test should not alter the state of the system - we need to save theslpc_min and slpc_max at the beginning and restore them at the end.

- A test should start from a known state -We can set min/max to RPn/RP0.Also, what if RPn = RP0? This could be a bug in the fuse registers, sowe need to flag an error.

-       }
-
-       if (slpc_min_freq == slpc_max_freq) {
-               pr_err("Min/Max are fused to the same value\n");
-               return -EINVAL;

What if they are actually equal? I think basically the max/min freq test
loops will just not be entered (so effectively the tests will just
skip). The granted freq test will be fine. So I think we can just delete
this if statement?

(It is showing deleted above in the patch but is in the new code somewhere
too).

Actually, we should set it to min/rp0 if this is the case. That change will
be in a separate patch. This is needed for certain cases.

I don't see why it's needed as I said, can you explain? Set what to min/rp0?

Above discussion should resolve this as well.

-       }
-
-       intel_gt_pm_wait_for_idle(gt);
-       intel_gt_pm_get(gt);
-       for_each_engine(engine, gt, id) {
-               struct i915_request *rq;
-               u32 step, min_freq, req_freq;
-               u32 act_freq, max_act_freq;
-
-               if (!intel_engine_can_store_dword(engine))
-                       continue;
+       /* Go from max to min in 5 steps */
+       step = (slpc_max_freq - slpc_min_freq) / NUM_STEPS;
+       *max_act_freq = slpc_min_freq;
+       for (max_freq = slpc_max_freq; max_freq > slpc_min_freq;
+                               max_freq -= step) {
+               err = slpc_set_max_freq(slpc, max_freq);
+               if (err)
+                       break;

-               /* Go from min to max in 5 steps */
-               step = (slpc_max_freq - slpc_min_freq) / NUM_STEPS;
-               max_act_freq = slpc_min_freq;
-               for (min_freq = slpc_min_freq; min_freq < slpc_max_freq;
-                                       min_freq += step) {
-                       err = slpc_set_min_freq(slpc, min_freq);
-                       if (err)
-                               break;
-
-                       st_engine_heartbeat_disable(engine);
-
-                       rq = igt_spinner_create_request(&spin,
-                                                       engine->kernel_context,
-                                                       MI_NOOP);
-                       if (IS_ERR(rq)) {
-                               err = PTR_ERR(rq);
-                               st_engine_heartbeat_enable(engine);
-                               break;
-                       }
+               req_freq = intel_rps_read_punit_req_frequency(rps);

-                       i915_request_add(rq);
+               /* GuC requests freq in multiples of 50/3 MHz */
+               if (req_freq > (max_freq + FREQUENCY_REQ_UNIT)) {
+                       pr_err("SWReq is %d, should be at most %d\n", req_freq,
+                               max_freq + FREQUENCY_REQ_UNIT);
+                       err = -EINVAL;

Probably a nit but check can be (so should we be checking both high and low
limits?):
                if (req_freq > (max_freq + FREQUENCY_REQ_UNIT) ||
                    req_freq < (slpc_min_freq - FREQUENCY_REQ_UNIT))

Though if we do this we'd need to change the pr_err() or have two separate
if statements. Not sure if it's worth it but thought I'll mention it.

We are looking to validate it does not cross the upper limit.

OK.

+static int vary_min_freq(struct intel_guc_slpc *slpc, struct intel_rps *rps,
+                 u32 *max_act_freq)
+{
+       u32 step, min_freq, req_freq;
+       u32 act_freq;
+       u32 slpc_min_freq, slpc_max_freq;
+       int err = 0;

-                       act_freq =  intel_rps_read_actual_frequency(rps);
-                       if (act_freq > max_act_freq)
-                               max_act_freq = act_freq;
+       slpc_min_freq = slpc->min_freq;
+       slpc_max_freq = slpc->rp0_freq;

-                       igt_spinner_end(&spin);
-                       st_engine_heartbeat_enable(engine);
-               }
+       /* Go from min to max in 5 steps */
+       step = (slpc_max_freq - slpc_min_freq) / NUM_STEPS;
+       *max_act_freq = slpc_min_freq;
+       for (min_freq = slpc_min_freq; min_freq < slpc_max_freq;
+                               min_freq += step) {
+               err = slpc_set_min_freq(slpc, min_freq);
+               if (err)
+                       break;

-               pr_info("Max actual frequency for %s was %d\n",
-                       engine->name, max_act_freq);
+               req_freq = intel_rps_read_punit_req_frequency(rps);

-               /* Actual frequency should rise above min */
-               if (max_act_freq == slpc_min_freq) {

Nit again. This check is somewhere in the new code but I think a better
check is

                if (max_act_freq <= slpc_min_freq)

just in case the act freq for whatever reason falls below
slpc_min_freq. Even if we know this is impossible (bugs make the impossible
possible).

sure.

-                       pr_err("Actual freq did not rise above min\n");
+               /* GuC requests freq in multiples of 50/3 MHz */
+               if (req_freq < (min_freq - FREQUENCY_REQ_UNIT)) {
+                       pr_err("SWReq is %d, should be at least %d\n", req_freq,
+                               min_freq - FREQUENCY_REQ_UNIT);
                        err = -EINVAL;

Again nit as above, but check can be:
                if (req_freq < (min_freq - FREQUENCY_REQ_UNIT) ||
                    req_freq > (slpc_max_freq + FREQUENCY_REQ_UNIT)) {

It can be higher, we want to validate lower range.

OK.

                }

+               act_freq =  intel_rps_read_actual_frequency(rps);
+               if (act_freq > *max_act_freq)
+                       *max_act_freq = act_freq;
+
                if (err)
                        break;
        }

-       /* Restore min/max frequencies */
-       slpc_set_max_freq(slpc, slpc_max_freq);
-       slpc_set_min_freq(slpc, slpc_min_freq);
+       return err;
+}

-       if (igt_flush_test(gt->i915))
-               err = -EIO;
+static int max_granted_freq(struct intel_guc_slpc *slpc, struct intel_rps 
*rps, u32 *max_act_freq)
+{
+       struct intel_gt *gt = rps_to_gt(rps);
+       u32 perf_limit_reasons;
+       int err = 0;

-       intel_gt_pm_put(gt);
-       igt_spinner_fini(&spin);
-       intel_gt_pm_wait_for_idle(gt);
+       err = slpc_set_min_freq(slpc, slpc->rp0_freq);
+       if (err)
+               return err;
+
+       *max_act_freq =  intel_rps_read_actual_frequency(rps);
+       if (!(*max_act_freq == slpc->rp0_freq)) {
+               /* Check if there was some throttling by pcode */
+               perf_limit_reasons = intel_uncore_read(gt->uncore, 
GT0_PERF_LIMIT_REASONS);
+
+               /* If not, this is an error */
+               if (perf_limit_reasons && GT0_PERF_LIMIT_REASONS_MASK) {
+                       pr_err("Pcode did not grant max freq\n");
+                       err = -EINVAL;

Looks incorrect, probably something like:
                if (!(perf_limit_reasons & GT0_PERF_LIMIT_REASONS_MASK))

Hmm, good catch. We should flag error iff there is no throttling and act
freq does not go to max.

+               }
+       }

        return err;
   }

-static int live_slpc_clamp_max(void *arg)
+static int run_test(struct intel_gt *gt, int test_type)
   {
-       struct drm_i915_private *i915 = arg;
-       struct intel_gt *gt = to_gt(i915);
-       struct intel_guc_slpc *slpc;
-       struct intel_rps *rps;
+       struct intel_guc_slpc *slpc = &gt->uc.guc.slpc;
+       struct intel_rps *rps = &gt->rps;
        struct intel_engine_cs *engine;
        enum intel_engine_id id;
        struct igt_spinner spin;
-       int err = 0;
        u32 slpc_min_freq, slpc_max_freq;
-
-       slpc = &gt->uc.guc.slpc;
-       rps = &gt->rps;
+       int err = 0;

        if (!intel_uc_uses_guc_slpc(&gt->uc))
                return 0;
@@ -203,69 +181,56 @@ static int live_slpc_clamp_max(void *arg)
        intel_gt_pm_get(gt);
        for_each_engine(engine, gt, id) {
                struct i915_request *rq;
-               u32 max_freq, req_freq;
-               u32 act_freq, max_act_freq;
-               u32 step;
+               u32 max_act_freq;

                if (!intel_engine_can_store_dword(engine))
                        continue;

-               /* Go from max to min in 5 steps */
-               step = (slpc_max_freq - slpc_min_freq) / NUM_STEPS;
-               max_act_freq = slpc_min_freq;
-               for (max_freq = slpc_max_freq; max_freq > slpc_min_freq;
-                                       max_freq -= step) {
-                       err = slpc_set_max_freq(slpc, max_freq);
-                       if (err)
-                               break;
-
-                       st_engine_heartbeat_disable(engine);
-
-                       rq = igt_spinner_create_request(&spin,
-                                                       engine->kernel_context,
-                                                       MI_NOOP);
-                       if (IS_ERR(rq)) {
-                               st_engine_heartbeat_enable(engine);
-                               err = PTR_ERR(rq);
-                               break;
-                       }
+               st_engine_heartbeat_disable(engine);

-                       i915_request_add(rq);
+               rq = igt_spinner_create_request(&spin,
+                                               engine->kernel_context,
+                                               MI_NOOP);
+               if (IS_ERR(rq)) {
+                       err = PTR_ERR(rq);
+                       st_engine_heartbeat_enable(engine);
+                       break;
+               }

-                       if (!igt_wait_for_spinner(&spin, rq)) {
-                               pr_err("%s: SLPC spinner did not start\n",
-                                      engine->name);
-                               igt_spinner_end(&spin);
-                               st_engine_heartbeat_enable(engine);
-                               intel_gt_set_wedged(engine->gt);
-                               err = -EIO;
-                               break;
-                       }
+               i915_request_add(rq);
+
+               if (!igt_wait_for_spinner(&spin, rq)) {
+                       pr_err("%s: Spinner did not start\n",
+                              engine->name);
+                       igt_spinner_end(&spin);
+                       st_engine_heartbeat_enable(engine);
+                       intel_gt_set_wedged(engine->gt);
+                       err = -EIO;
+                       break;
+               }

-                       delay_for_h2g();
+               switch (test_type) {

-                       /* Verify that SWREQ indeed was set to specific value */
-                       req_freq = intel_rps_read_punit_req_frequency(rps);
+               case VARY_MIN:
+                       err = vary_min_freq(slpc, rps, &max_act_freq);
+                       break;
+
+               case VARY_MAX:
+                       err = vary_max_freq(slpc, rps, &max_act_freq);
+                       break;

-                       /* GuC requests freq in multiples of 50/3 MHz */
-                       if (req_freq > (max_freq + FREQUENCY_REQ_UNIT)) {
-                               pr_err("SWReq is %d, should be at most %d\n", 
req_freq,
-                                      max_freq + FREQUENCY_REQ_UNIT);
+               case MAX_GRANTED:
+                       /* Media engines have a different RP0 */
+                       if ((engine->class == VIDEO_DECODE_CLASS) ||
+                           (engine->class == VIDEO_ENHANCEMENT_CLASS)) {
                                igt_spinner_end(&spin);
                                st_engine_heartbeat_enable(engine);
-                               err = -EINVAL;
-                               break;
+                               err = 0;
+                               continue;

I think it's preferable to move this media engine code out of the main loop
into max_granted_freq() function if possible (maybe by faking max_act_freq
if needed)?

All the engine related info is here. I will need to pass it to the
max_granted_freq() function.  Also, faking the act_freq probably
overkill. I can add a fixme here instead to update when we have a
reliable way to obtain media RP0 instead.

OK let's leave as is, no need for FIXME, just leave the comment as before.

ok.

                        }

-                       act_freq =  intel_rps_read_actual_frequency(rps);
-                       if (act_freq > max_act_freq)
-                               max_act_freq = act_freq;
-
-                       st_engine_heartbeat_enable(engine);
-                       igt_spinner_end(&spin);
-
-                       if (err)
-                               break;
+                       err = max_granted_freq(slpc, rps, &max_act_freq);
+                       break;
                }

                pr_info("Max actual frequency for %s was %d\n",
@@ -277,31 +242,59 @@ static int live_slpc_clamp_max(void *arg)
                        err = -EINVAL;
                }

-               if (igt_flush_test(gt->i915)) {
-                       err = -EIO;
-                       break;
-               }
+               igt_spinner_end(&spin);
+               st_engine_heartbeat_enable(engine);

                if (err)
                        break;
        }

-       /* Restore min/max freq */
+       /* Restore min/max frequencies */
        slpc_set_max_freq(slpc, slpc_max_freq);
        slpc_set_min_freq(slpc, slpc_min_freq);

As mentioned above maybe we should restore at the beginning of the test too
(before the for_each_engine() loop) to start from a known state?

Anyway here maybe get rid of the variables and:

This is restoring whatever frequencies SLPC was running with
initially. Regarding resetting the frequencies to min for every engine loop
iteration, we are already iterating from min->max inside the for loop, so
will be duplication.

I didn't say reset frequencies to min for every engine loop iteration, I
said "before the for_each_engine() loop". Same as above: "I was saying
restore the freq's *before* starting the tests as well to start from a
known state".


Every test will work off a known state - from RPn->RP0

Thanks,

Vinay.


Thanks.
--
Ashutosh

Re: [Intel-gfx] [PATCH] drm/i915/guc/slpc: Add a new SLPC selftest

Reply via email to