Hi guys,        

The question
------------------
What are the possible drawbacks of enabling CONFIG_HOTPLUG_CPU for CPU
that does not have hardware support for hot-plug?

The question I'd like to ask is architecture agnostic, but the described 
behavior
is observed on MPCore Cortex-A9 CPU with Linux 3.4.59.

The issue
-------------
When Linux Kernel compiled in SMP mode, and CONFIG_HOTPLUG_CPU is not set,
and booted on single core CPU, then warning messages 
"... task blocked for more than 120 seconds ..." starts popping up in dmesg log.

For example:
INFO: task ksoftirqd/1:9 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.

To make the message disappear, CONFIG_HOTPLUG_CPU should be enabled.

Following the example of "ksoftirqd", the root cause of the issue is that
there are as many "ksoftirqd" threads created as CONFIG_NR_CPU
(see cpu_present_mask in kernel/cpu.c file). See below some details on
how "ksoftirqd" task is created and why it is not killed.

Now, the "ksoftirqd" task is *not* killed and just stays around in task queue
till the scheduler shouts "... task blocked ...".

        Details:
        ----------
        The first "ksoftirqd" task (for CPU[0]) is created as part of executing
        the registered early_initcall function spawn_ksoftirqd() in the below
        flow:

        start_kernel() ---> rest_init() ---> kernel_init() --->
        do_pre_smp_initcalls() ---> spawd_ksoftirqd() --->
        cpu_callback(... CPU_ONLINE ...)

        The "ksoftirqd" tasks for CPU[1 .. N-1] are created in the different 
flow.
        First of all, cpu_callback() from kernel/softirq.c ("ksoftirqd" task is
        created in this callback) is registered through CPU notifier in
        spawn_ksoftirqd(). Then this callback is called in the below flow:

        start_kernel() ---> rest_init() ---> kernel_init() --->
        smp_init() ---> for_each_present_cpu(cpu) { cpu_up(cpu) --->
        _cpu_up() ---> __cpu_nofity(CPU_UP_PREPARE) (here "ksoftirqd" task
        is created).

        Right after that, CPU[x] is attempted to be enabled (using __cpu_up) 
and,
        if __cpu_up(cpu) fails, then "ksoftirqd" task is killed using
        __cpu_notify(CPU_UP_CANCELLED) some lines below.

        Now, the code that actually kills the task (using kthread_stop) is 
wrapped up
        with #ifdef CONFIG_HOTPLUG_CPU (see kernel/softirq.c, function 
cpu_callback).

The solution
-----------------
The easy solution is to enable CONFIG_HOTPLUG_CPU which enables
the compilation of the code that kills "ksoftirqd" task.
What is the possible drawback of enabling CONFIG_HOTPLUG_CPU for CPU
that does not have hardware support hot-plug?

Thanks,
--- KostaZ
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to