On 2019/10/18 15:12, Peter Zijlstra wrote:
On Fri, Oct 18, 2019 at 11:45:54AM +0800, Zhenzhong Duan wrote:
If 'idle=nomwait' is specified or process matching what's in
processor_idle_dmi_table, we should't use MWAIT at bootup stage before
cpuidle driver loaded, even if it's preferred by default on Intel.

Add a check so that HALT instruction is used in those cases.
The comment in idle_setup():

        /*
         * If the boot option of "idle=nomwait" is added,
         * it means that mwait will be disabled for CPU C2/C3
         * states. In such case it won't touch the variable
         * of boot_option_idle_override.
         */
        boot_option_idle_override = IDLE_NOMWAIT;

explicitly states this option is for C2+

Yea, this is confusing. Other place referencing boot_option_idle_override tell me

"idle=nomwait" means not using mwait for all cstates. Maybe 'C2/C3' could be

removed from above comment?

See drivers/acpi/processor_idle.c:

                        if (cx.type == ACPI_STATE_C1 &&
                            (boot_option_idle_override == IDLE_NOMWAIT)) {
                                /*
                                 * In most cases the C1 space_id obtained from                                  * _CST object is FIXED_HARDWARE access mode.                                  * But when the option of idle=halt is added,                                  * the entry_method type should be changed from
                                 * CSTATE_FFH to CSTATE_HALT.
                                 * When the option of idle=nomwait is added,
                                 * the C1 entry_method type should be
                                 * CSTATE_HALT.
                                 */
                                cx.entry_method = ACPI_CSTATE_HALT;
                                snprintf(cx.desc, ACPI_CX_DESC_LEN, "ACPI HLT");
                        }

and drivers/acpi/processor_pdc.c:

        if (boot_option_idle_override == IDLE_NOMWAIT) {
                /*
                 * If mwait is disabled for CPU C-states, the C2C3_FFH access
                 * mode will be disabled in the parameter of _PDC object.
                 * Of course C1_FFH access mode will also be disabled.
                 */
                union acpi_object *obj;
                u32 *buffer = NULL;

                obj = pdc_in->pointer;
                buffer = (u32 *)(obj->buffer.pointer);
                buffer[2] &= ~(ACPI_PDC_C_C2C3_FFH | ACPI_PDC_C_C1_FFH);

        }


Signed-off-by: Zhenzhong Duan <zhenzhong.d...@oracle.com>
Cc: Thomas Gleixner <t...@linutronix.de>
Cc: Borislav Petkov <b...@alien8.de>
Cc: Ingo Molnar <mi...@redhat.com>
Cc: "H. Peter Anvin" <h...@zytor.com>
Cc: Boris Ostrovsky <boris.ostrov...@oracle.com>
---
  arch/x86/kernel/process.c | 4 ++++
  1 file changed, 4 insertions(+)

diff --git a/arch/x86/kernel/process.c b/arch/x86/kernel/process.c
index 5e94c43..37fc577 100644
--- a/arch/x86/kernel/process.c
+++ b/arch/x86/kernel/process.c
@@ -667,6 +667,10 @@ static void amd_e400_idle(void)
   */
  static int prefer_mwait_c1_over_halt(const struct cpuinfo_x86 *c)
  {
+       /* Don't use MWAIT-C1 if explicitly requested */
+       if (boot_option_idle_override == IDLE_NOMWAIT)
+               return 0;
And this is very much about C1...

OTOH, "idle=halt" should be forcing HLT over MWAIT, so did you want to
write:

        if (boot_option_idle_override == IDLE_HALT)
                return 0;

instead?

I think it's not necessory, if 'idle=halt' specified, select_idle_routine() returns early,

prefer_mwait_c1_over_halt() is never called.

Zhenzhong

Reply via email to