Hi Bagus,

Thank you very much for the update and for attaching the log.

Having reviewed it, this confirms that the crash is a silent hard lockup
very early in boot (the log ends abruptly at ~0.76 s during pps_core
init) with no oops, BUG, WARNING, or driver errors visible at all. This
shows the problem is not specific to the 6.17 HWE kernel — it is also
happening on the 6.8 mainline series on the exact same HP ProLiant
DL380p Gen8 hardware and with an almost identical kernel command line.

I also noticed you’re using the same power-management tweaks
(pcie_aspm.policy=powersave, libata.force=noncq,
ahci.mobile_lpm_policy=1, IOMMU settings, etc.).

If you’re able, could you test a few quick kernel-parameter experiments
and report back? Please try one change at a time and stress the system
for a day or two under your normal workload:

1. Add pcie_aspm=off (or change to pcie_aspm.policy=performance)  
2. Add nomodeset (this disables the mgag200 DRM driver for the Matrox G200 
graphics)  
3. Temporarily remove libata.force=noncq ahci.mobile_lpm_policy=1

(You can edit them in /etc/default/grub under
GRUB_CMDLINE_LINUX_DEFAULT=, then run sudo update-grub and reboot.)

If you have any iLO remote console logs, NMI/watchdog events, or a kdump
vmcore from any of the crashes, those would be extremely valuable for
the Kernel Team.

I'm marking this now as Triaged as I believe there is enough information
here for the team to look at.

Thanks again — this extra data is really helpful!

** Changed in: linux-hwe-6.17 (Ubuntu)
       Status: Incomplete => Triaged

** Changed in: linux-hwe-6.17 (Ubuntu)
   Importance: Undecided => High

** Summary changed:

- Random double hard crash
+ Random hard crashes / silent lockups on HP ProLiant DL380p Gen8 (kernels 6.8 
and 6.17 HWE)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2148152

Title:
  Random hard crashes / silent lockups on HP ProLiant DL380p Gen8
  (kernels 6.8 and 6.17 HWE)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-hwe-6.17/+bug/2148152/+subscriptions


-- 
ubuntu-bugs mailing list
[email protected]
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to