Public bug reported:

We've been starting to run the systemd integration tests on Github
Actions, and noticed a soft lockup in of our VMs:

"""
 [  245.130625] INFO: task khugepaged:38 blocked for more than 122 seconds.
[  245.131669]       Not tainted 6.8.7-arch1-1 #1
[  245.132371] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
[  245.134651] INFO: task test-journal-ve:4563 blocked for more than 122 
seconds.
[  245.136035]       Not tainted 6.8.7-arch1-1 #1
[  245.136921] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
[  253.516332] systemd[1]: Unmounting efi.mount...
[  253.535696] systemd[1]: efi.mount: Deactivated successfully.
[  253.537261] systemd[1]: Unmounted efi.mount.
[  345.502294] systemd[1006]: Created slice background.slice.
[  345.504159] systemd[1006]: Starting systemd-tmpfiles-clean.service...
[  345.537927] systemd[1006]: Finished systemd-tmpfiles-clean.service.
[  368.017373] INFO: task khugepaged:38 blocked for more than 245 seconds.
[  368.019120]       Not tainted 6.8.7-arch1-1 #1
[  368.020219] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
[  368.021719] INFO: task test-journal-ve:4563 blocked for more than 245 
seconds.
[  368.023485]       Not tainted 6.8.7-arch1-1 #1
[  368.024593] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
[  490.891682] INFO: task khugepaged:38 blocked for more than 368 seconds.
[  490.893456]       Not tainted 6.8.7-arch1-1 #1
[  490.894594] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
[  490.896750] INFO: task test-journal-ve:4563 blocked for more than 368 
seconds.
[  490.897930]       Not tainted 6.8.7-arch1-1 #1
[  490.898880] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this 
message.
"""

This is the same lockup as was already investigated in
https://github.com/systemd/systemd-centos-ci/issues/720 and was traced
to a missing backported commit in the *host* kernel, specifically
d02c357e5bfa7dfd618b7b3015624beb71f58f1f (KVM: x86/mmu: Retry fault
before acquiring mmu_lock if mapping is changing). See the linked issue
for all the details.

Can this commit please be backported to linux-azure to fix these soft
lockups in VMs running on Github Actions?

** Affects: linux-azure (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/2062977

Title:
  Please backport d02c357e5bfa7dfd618b7b3015624beb71f58f1f

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux-azure/+bug/2062977/+subscriptions


-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to