On 9/5/25 11:01 AM, Marco Crivellari wrote:
> Currently if a user enqueue a work item using schedule_delayed_work() the
> used wq is "system_wq" (per-cpu wq) while queue_delayed_work() use
> WORK_CPU_UNBOUND (used when a cpu is not specified). The same applies to
> schedule_work() that is using system_wq and queue_work(), that makes use
> again of WORK_CPU_UNBOUND.
> 
> This lack of consistentcy cannot be addressed without refactoring the API.
> 
> system_wq is a per-CPU worqueue, yet nothing in its name tells about that
> CPU affinity constraint, which is very often not required by users. Make
> it clear by adding a system_percpu_wq.
> 
> queue_work() / queue_delayed_work() mod_delayed_work() will now use the
> new per-cpu wq: whether the user still stick on the old name a warn will
> be printed along a wq redirect to the new one.
> 
> This patch add the new system_percpu_wq except for mm, fs and net
> subsystem, whom are handled in separated patches.
> 
> The old wq will be kept for a few release cylces.
> 
> Suggested-by: Tejun Heo <t...@kernel.org>
> Signed-off-by: Marco Crivellari <marco.crivell...@suse.com>
> ---
>  kernel/module/dups.c | 4 ++--
>  1 file changed, 2 insertions(+), 2 deletions(-)
> 
> diff --git a/kernel/module/dups.c b/kernel/module/dups.c
> index bd2149fbe117..e72fa393a2ec 100644
> --- a/kernel/module/dups.c
> +++ b/kernel/module/dups.c
> @@ -113,7 +113,7 @@ static void kmod_dup_request_complete(struct work_struct 
> *work)
>        * let this linger forever as this is just a boot optimization for
>        * possible abuses of vmalloc() incurred by finit_module() thrashing.
>        */
> -     queue_delayed_work(system_wq, &kmod_req->delete_work, 60 * HZ);
> +     queue_delayed_work(system_percpu_wq, &kmod_req->delete_work, 60 * HZ);
>  }
>  
>  bool kmod_dup_request_exists_wait(char *module_name, bool wait, int *dup_ret)
> @@ -240,7 +240,7 @@ void kmod_dup_request_announce(char *module_name, int ret)
>        * There is no rush. But we also don't want to hold the
>        * caller up forever or introduce any boot delays.
>        */
> -     queue_work(system_wq, &kmod_req->complete_work);
> +     queue_work(system_percpu_wq, &kmod_req->complete_work);
>  
>  out:
>       mutex_unlock(&kmod_dup_mutex);

The two work items queued by the dups.c code can run anywhere. I don't
see a reason why they need to be bound to a specific CPU.

If I understand the cover letter and its linked discussion correctly,
the aim is to eventually move users to unbound workqueues unless they
really need to use per-CPU workqueues.

If it helps, I believe you can already update this code to use the new
system_dfl_wq.

-- 
Thanks,
Petr

Reply via email to