Re: [PATCH] sched/deadline: Reduce rq lock contention in dl_add_task_root_domain()

2021-02-08 Thread Peter Zijlstra
On Tue, Jan 19, 2021 at 09:35:42AM +0100, Dietmar Eggemann wrote:
> dl_add_task_root_domain() is called during sched domain rebuild:
> 
>   rebuild_sched_domains_locked()
> partition_and_rebuild_sched_domains()
>   rebuild_root_domains()
>  for all top_cpuset descendants:
>update_tasks_root_domain()
>  for all tasks of cpuset:
>dl_add_task_root_domain()
> 
> Change it so that only the task pi lock is taken to check if the task
> has a SCHED_DEADLINE (DL) policy. In case that p is a DL task take the
> rq lock as well to be able to safely de-reference root domain's DL
> bandwidth structure.
> 
> Most of the tasks will have another policy (namely SCHED_NORMAL) and
> can now bail without taking the rq lock.
> 
> One thing to note here: Even in case that there aren't any DL user
> tasks, a slow frequency switching system with cpufreq gov schedutil has
> a DL task (sugov) per frequency domain running which participates in DL
> bandwidth management.
> 
> Reviewed-by: Quentin Perret 
> Signed-off-by: Dietmar Eggemann 

Thanks!


Re: [PATCH] sched/deadline: Reduce rq lock contention in dl_add_task_root_domain()

2021-01-20 Thread Juri Lelli
Hi,

On 19/01/21 09:35, Dietmar Eggemann wrote:
> dl_add_task_root_domain() is called during sched domain rebuild:
> 
>   rebuild_sched_domains_locked()
> partition_and_rebuild_sched_domains()
>   rebuild_root_domains()
>  for all top_cpuset descendants:
>update_tasks_root_domain()
>  for all tasks of cpuset:
>dl_add_task_root_domain()
> 
> Change it so that only the task pi lock is taken to check if the task
> has a SCHED_DEADLINE (DL) policy. In case that p is a DL task take the
> rq lock as well to be able to safely de-reference root domain's DL
> bandwidth structure.
> 
> Most of the tasks will have another policy (namely SCHED_NORMAL) and
> can now bail without taking the rq lock.
> 
> One thing to note here: Even in case that there aren't any DL user
> tasks, a slow frequency switching system with cpufreq gov schedutil has
> a DL task (sugov) per frequency domain running which participates in DL
> bandwidth management.
> 
> Reviewed-by: Quentin Perret 
> Signed-off-by: Dietmar Eggemann 

Looks good to me, thanks!

Acked-by: Juri Lelli 

Best,
Juri



Re: [PATCH] sched/deadline: Reduce rq lock contention in dl_add_task_root_domain()

2021-01-19 Thread Daniel Bristot de Oliveira
On 1/19/21 9:35 AM, Dietmar Eggemann wrote:
> dl_add_task_root_domain() is called during sched domain rebuild:
> 
>   rebuild_sched_domains_locked()
> partition_and_rebuild_sched_domains()
>   rebuild_root_domains()
>  for all top_cpuset descendants:
>update_tasks_root_domain()
>  for all tasks of cpuset:
>dl_add_task_root_domain()
> 
> Change it so that only the task pi lock is taken to check if the task
> has a SCHED_DEADLINE (DL) policy. In case that p is a DL task take the
> rq lock as well to be able to safely de-reference root domain's DL
> bandwidth structure.
> 
> Most of the tasks will have another policy (namely SCHED_NORMAL) and
> can now bail without taking the rq lock.
> 
> One thing to note here: Even in case that there aren't any DL user
> tasks, a slow frequency switching system with cpufreq gov schedutil has
> a DL task (sugov) per frequency domain running which participates in DL
> bandwidth management.
> 
> Reviewed-by: Quentin Perret 
> Signed-off-by: Dietmar Eggemann 

Reviewed-by: Daniel Bristot de Oliveira 

Thanks!
-- Daniel



Re: [PATCH] sched/deadline: Reduce rq lock contention in dl_add_task_root_domain()

2021-01-19 Thread Valentin Schneider
On 19/01/21 09:35, Dietmar Eggemann wrote:
> dl_add_task_root_domain() is called during sched domain rebuild:
>
>   rebuild_sched_domains_locked()
> partition_and_rebuild_sched_domains()
>   rebuild_root_domains()
>  for all top_cpuset descendants:
>update_tasks_root_domain()
>  for all tasks of cpuset:
>dl_add_task_root_domain()
>
> Change it so that only the task pi lock is taken to check if the task
> has a SCHED_DEADLINE (DL) policy. In case that p is a DL task take the
> rq lock as well to be able to safely de-reference root domain's DL
> bandwidth structure.
>

A task's policy is stable under ->pi_lock, so LGTM.

Reviewed-by: Valentin Schneider 

> Most of the tasks will have another policy (namely SCHED_NORMAL) and
> can now bail without taking the rq lock.
>
> One thing to note here: Even in case that there aren't any DL user
> tasks, a slow frequency switching system with cpufreq gov schedutil has
> a DL task (sugov) per frequency domain running which participates in DL
> bandwidth management.
>
> Reviewed-by: Quentin Perret 
> Signed-off-by: Dietmar Eggemann 


[PATCH] sched/deadline: Reduce rq lock contention in dl_add_task_root_domain()

2021-01-19 Thread Dietmar Eggemann
dl_add_task_root_domain() is called during sched domain rebuild:

  rebuild_sched_domains_locked()
partition_and_rebuild_sched_domains()
  rebuild_root_domains()
 for all top_cpuset descendants:
   update_tasks_root_domain()
 for all tasks of cpuset:
   dl_add_task_root_domain()

Change it so that only the task pi lock is taken to check if the task
has a SCHED_DEADLINE (DL) policy. In case that p is a DL task take the
rq lock as well to be able to safely de-reference root domain's DL
bandwidth structure.

Most of the tasks will have another policy (namely SCHED_NORMAL) and
can now bail without taking the rq lock.

One thing to note here: Even in case that there aren't any DL user
tasks, a slow frequency switching system with cpufreq gov schedutil has
a DL task (sugov) per frequency domain running which participates in DL
bandwidth management.

Reviewed-by: Quentin Perret 
Signed-off-by: Dietmar Eggemann 
---

The use case in which this makes a noticeable difference is Android's
'CPU pause' power management feature.

It uses CPU hotplug control to clear a CPU from active state to force
all threads which are not per-cpu kthreads away from this CPU.

Making DL bandwidth management faster during sched domain rebuild helps
to reduce the time to pause/un-pause a CPU.

 kernel/sched/deadline.c | 11 +++
 1 file changed, 7 insertions(+), 4 deletions(-)

diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c
index 5421782fe897..c7b1a63a053b 100644
--- a/kernel/sched/deadline.c
+++ b/kernel/sched/deadline.c
@@ -2409,9 +2409,13 @@ void dl_add_task_root_domain(struct task_struct *p)
struct rq *rq;
struct dl_bw *dl_b;
 
-   rq = task_rq_lock(p, );
-   if (!dl_task(p))
-   goto unlock;
+   raw_spin_lock_irqsave(>pi_lock, rf.flags);
+   if (!dl_task(p)) {
+   raw_spin_unlock_irqrestore(>pi_lock, rf.flags);
+   return;
+   }
+
+   rq = __task_rq_lock(p, );
 
dl_b = >rd->dl_bw;
raw_spin_lock(_b->lock);
@@ -2420,7 +2424,6 @@ void dl_add_task_root_domain(struct task_struct *p)
 
raw_spin_unlock(_b->lock);
 
-unlock:
task_rq_unlock(rq, p, );
 }
 
-- 
2.25.1