On Sun, Nov 04, 2018 at 05:31:20PM +0100, Rafael J. Wysocki wrote:
> + * - If there is a pattern of 5 or more recent non-timer wakeups earlier than
> + *   the closest timer event, expect one more of them to occur and use the
> + *   average of the idle duration values corresponding to them to select an
> + *   idle state for the CPU.


> +/**
> + * teo_idle_duration - Estimate the duration of the upcoming CPU idle time.
> + * @drv: cpuidle driver containing state data.
> + * @cpu_data: Governor data for the target CPU.
> + * @sleep_length_us: Time till the closest timer event in microseconds.
> + */
> +unsigned int teo_idle_duration(struct cpuidle_driver *drv,
> +                            struct teo_cpu *cpu_data,
> +                            unsigned int sleep_length_us)
> +{
> +     u64 range, max_spread, max, sum;
> +     unsigned int count;
> +
> +     /*
> +      * If the sleep length is below the target residency of idle state 1,
> +      * the only viable choice is to select the first available (enabled)
> +      * idle state, so return immediately in that case.
> +      */
> +     if (sleep_length_us < drv->states[1].target_residency)
> +             return sleep_length_us;
> +
> +     /*
> +      * The purpose of this function is to check if there is a pattern of
> +      * wakeups indicating that it would be better to select a state
> +      * shallower than the deepest one matching the sleep length or the
> +      * deepest one at all if the sleep lenght is long.  Larger idle duration
> +      * values are beyond the interesting range.
> +      *
> +      * Narrowing the range of interesting values down upfront also helps to
> +      * avoid overflows during the computation below.
> +      */
> +     range = drv->states[drv->state_count-1].target_residency;
> +     range = min_t(u64, sleep_length_us, range + (range >> 2));
> +
> +     /*
> +      * This is the value to compare with the distance between the average
> +      * and the greatest sample to decide whether or not it is small enough.
> +      * Take 10 us as the total cap of it.
> +      */
> +     max_spread = max_t(u64, range >> MAX_SPREAD_SHIFT, 10);
> +
> +     max = range;
> +
> +     do {
> +             u64 cap = max;
> +             int i;
> +
> +             /*
> +              * Compute the sum of the saved intervals below the cap and the
> +              * sum of of their squares.  Count them and find the maximum
> +              * interval below the cap.
> +              */
> +             count = 0;
> +             sum = 0;
> +             max = 0;
> +
> +             for (i = 0; i < INTERVALS; i++) {
> +                     u64 val = cpu_data->intervals[i];
> +
> +                     if (val >= cap)
> +                             continue;
> +
> +                     count++;
> +                     sum += val;
> +                     if (max < val)
> +                             max = val;
> +             }
> +
> +             /*
> +              * Give up if the total number of interesting samples is too
> +              * small.
> +              */
> +             if (cap == range && count <= INTERVALS / 2)
> +                     return sleep_length_us;
> +
> +             /*
> +              * If the distance between the max and the average is too large,
> +              * discard the max an repeat.
> +              */
> +     } while (count > 3 && max > max_spread && (max - max_spread) * count > 
> sum);
> +
> +     return div64_u64(sum, count);
> +}

Instead of this detector; why haven't you used the code from
kernel/irq/timings.c ?

Reply via email to