I've tried to get my hands a bit dirty on this bug to be more helpful than
just forwarding messages to Intel and back.
So, I've selected two Intel Xeon Sapphire Rapids (SPR) machines[1][2] to
perform tests and with the results I don't feel confident that we can 100%
blame intel-microcode just yet. With one of the machines[2] on jammy and
with latest microcode and hwe-6.8 kernel the issue doesn't reproduce. And
also, given that with kernel 5.15 and latest microcode the issue doesn't
happen could be seen as another clue that maybe we are dealing with a bug
in a different place. One thing that I tried when chasing for machines was
to try different GPU configurations, could it be related somehow? One has
a nvidia GPU[1] while the other doesn't[2].

Kevin, do you have access to these machines, or similar configurations
variations, and could try to help us out with your tests?

Below are my simplistic test routine in details:
1. Initial state
machine[1]:
   ubuntu@hidon:~$ uname -a
   Linux hidon 5.15.0-151-generic #161-Ubuntu SMP Tue Jul 22 14:25:40 UTC 2025 
x86_64 x86_64 x86_64 GNU/Linux
   ubuntu@hidon:~$ apt-cache policy intel-microcode
   intel-microcode:
   Installed: 3.20250512.0ubuntu0.22.04.1
   Candidate: 3.20250512.0ubuntu0.22.04.1
   Version table:
   *** 3.20250512.0ubuntu0.22.04.1 500
         500 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 Packages
         500 http://archive.ubuntu.com/ubuntu jammy-security/main amd64 Packages
         100 /var/lib/dpkg/status
      3.20210608.2ubuntu1 500
         500 http://archive.ubuntu.com/ubuntu jammy/main amd64 Packages

machine[2]:
   ubuntu@smilax:~$ uname -a
   Linux smilax 5.15.0-151-generic #161-Ubuntu SMP Tue Jul 22 14:25:40 UTC 2025 
x86_64 x86_64 x86_64 GNU/Linux
   ubuntu@smilax:~$ apt-cache policy intel-microcode
   intel-microcode:
   Installed: 3.20250512.0ubuntu0.22.04.1
   Candidate: 3.20250512.0ubuntu0.22.04.1
   Version table:
   *** 3.20250512.0ubuntu0.22.04.1 500
         500 http://archive.ubuntu.com/ubuntu jammy-updates/main amd64 Packages
         500 http://archive.ubuntu.com/ubuntu jammy-security/main amd64 Packages
         100 /var/lib/dpkg/status
      3.20210608.2ubuntu1 500
         500 http://archive.ubuntu.com/ubuntu jammy/main amd64 Packages

2. $ sudo rtcwake -m no -s 60;sudo systemctl suspend
   * machine[1] and machine[2]: system came back after ~60 secs.
3. $ sudo apt install linux-generic-hwe-22.04 #-> upgrade the kernel to 
6.8.0-65.68~22.04.1
4. $ sudo reboot
5. $ sudo rtcwake -m no -s 60;sudo systemctl suspend
   * machine[1]: system got stuck
   * machine[2]: system came back after ~60 secs.

Beside the simplistic tests, I have tried noble on machine[1] and
downgraded the microcode version and the issue was still reproducible,
which was quite interesting:
   $ uname -a
   Linux hidon 6.8.0-71-generic #71-Ubuntu SMP PREEMPT_DYNAMIC Tue Jul 22 
16:52:38 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux
   $ sudo apt install intel-microcode=3.20240312.1build1
   ubuntu@hidon:~$ apt-cache policy intel-microcode
   intel-microcode:
     Installed: 3.20240312.1build1
     Candidate: 3.20250512.0ubuntu0.24.04.1
     Version table:
        3.20250512.0ubuntu0.24.04.1 500
           500 http://archive.ubuntu.com/ubuntu noble-updates/main amd64 
Packages
           500 http://archive.ubuntu.com/ubuntu noble-security/main amd64 
Packages
    *** 3.20240312.1build1 500
           500 http://archive.ubuntu.com/ubuntu noble/main amd64 Packages
           100 /var/lib/dpkg/status
   $ sudo reboot
   $ sudo rtcwake -m no -s 60;sudo systemctl suspend
   * system got stuck

[1] https://certification.canonical.com/hardware/202311-32290/
[2] https://certification.canonical.com/hardware/202310-32219/

** Changed in: intel-microcode (Ubuntu)
       Status: New => Incomplete

** Changed in: intel-microcode (Ubuntu)
     Assignee: (unassigned) => Rodrigo Figueiredo Zaiden (rodrigo-zaiden)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux-hwe-6.8 in Ubuntu.
https://bugs.launchpad.net/bugs/2115854

Title:
  The system didn’t go into suspend and got stuck

Status in intel-microcode package in Ubuntu:
  Incomplete
Status in linux-hwe-6.8 package in Ubuntu:
  Invalid

Bug description:
  [Summary]

  During the SRU testing, I found there are many "workstation" machines didn't 
go into suspend.
  After executing systemctl suspend, I can see the screen was off, but the 
power led didn't shows a breathing LED, then the system got stuck, only way to 
recover it was by power cycling.

  After power cycling I used "journalctl -b -1" to see the previous log
  and saw following log.

  Jul 03 06:58:18 ubuntu /usr/libexec/gdm-x-session[1314]: (II) systemd-logind: 
got pause for 13:66
  Jul 03 06:58:18 ubuntu /usr/libexec/gdm-x-session[1314]: (II) systemd-logind: 
got pause for 13:69
  Jul 03 06:58:18 ubuntu systemd[1]: nvidia-suspend.service: Deactivated 
successfully.
  Jul 03 06:58:18 ubuntu systemd[1]: Finished NVIDIA system suspend actions.
  Jul 03 06:58:18 ubuntu systemd[1]: Starting System Suspend...
  Jul 03 06:58:18 ubuntu systemd-sleep[10113]: Entering sleep state 'suspend'...
  Jul 03 06:58:18 ubuntu kernel: PM: suspend entry (deep)


  [Expected result]

  The system should wake up by pressing the power button or by rtc
  wakeup event.

  [Actual result]

  The system got stuck.

  [Additional information]

  Following are the DUTs that are impacted.
  https://certification.canonical.com/hardware/202301-31152/
  https://certification.canonical.com/hardware/202301-31145/
  https://certification.canonical.com/hardware/202301-31147/
  https://certification.canonical.com/hardware/202301-31151/
  https://certification.canonical.com/hardware/202405-34048/
  https://certification.canonical.com/hardware/202405-34049/
  https://certification.canonical.com/hardware/202310-32234/
  https://certification.canonical.com/hardware/202305-31615/
  https://certification.canonical.com/hardware/202303-31328/
  https://certification.canonical.com/hardware/202209-30600/
  https://certification.canonical.com/hardware/202209-30601/

  ProblemType: Bug
  DistroRelease: Ubuntu 22.04
  Package: linux-image-6.8.0-62-generic 6.8.0-62.65~22.04.1
  ProcVersionSignature: Ubuntu 6.8.0-62.65~22.04.1-generic 6.8.12
  Uname: Linux 6.8.0-62-generic x86_64
  NonfreeKernelModules: nvidia_modeset nvidia
  ApportVersion: 2.20.11-0ubuntu82.8
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC0:  ubuntu     1266 F.... pulseaudio
   /dev/snd/controlC1:  ubuntu     1266 F.... pulseaudio
  CasperMD5CheckMismatches: ./preseed/project.cfg
  CasperMD5CheckResult: fail
  Date: Thu Jul  3 13:56:20 2025
  DistributionChannelDescriptor:
   # This is the distribution channel descriptor for the OEM CDs
   # For more information see 
http://wiki.ubuntu.com/DistributionChannelDescriptor
   canonical-oem-stella-jammy-amd64-20230524-405
  InstallationDate: Installed on 2025-07-02 (0 days ago)
  InstallationMedia: Ubuntu 22.04 LTS "Jammy Jellyfish" - 
pc-stella-jammy-amd64-20230524-405
  MachineType: HP HP Z4 G5 Workstation Desktop PC
  ProcEnviron:
   TERM=xterm-256color
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 simpledrmdrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.8.0-62-generic 
root=UUID=3ad449c1-8055-4491-8e0e-084d0cbd51c3 ro automatic-oem-config quiet 
splash vt.handoff=7
  PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No 
PulseAudio daemon running, or not running as session daemon.
  RelatedPackageVersions:
   linux-restricted-modules-6.8.0-62-generic N/A
   linux-backports-modules-6.8.0-62-generic  N/A
   linux-firmware                            20220329.git681281e4-0ubuntu3.37
  RfKill:
   
  SourcePackage: linux-hwe-6.8
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 04/19/2023
  dmi.bios.release: 1.15
  dmi.bios.vendor: HP
  dmi.bios.version: U61 Ver. 01.01.15
  dmi.board.name: 8962
  dmi.board.vendor: HP
  dmi.board.version: KBC Version 11.14.00
  dmi.chassis.type: 3
  dmi.chassis.vendor: HP
  dmi.ec.firmware.release: 17.20
  dmi.modalias: 
dmi:bvnHP:bvrU61Ver.01.01.15:bd04/19/2023:br1.15:efr17.20:svnHP:pnHPZ4G5WorkstationDesktopPC:pvr:rvnHP:rn8962:rvrKBCVersion11.14.00:cvnHP:ct3:cvr:sku57K36AV:
  dmi.product.family: 103C_53335X HP Workstation
  dmi.product.name: HP Z4 G5 Workstation Desktop PC
  dmi.product.sku: 57K36AV
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/intel-microcode/+bug/2115854/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to