[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux-azure- fips/6.8.0-1034.39+fips1 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-noble-linux-azure-fips' to 'verification-done-noble-linux-azure-fips'. If the problem still exists, change the tag 'verification-needed-noble-linux-azure-fips' to 'verification-failed-noble-linux-azure-fips'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-noble-linux-azure-fips-v2 verification-needed-noble-linux-azure-fips -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux-fips/6.8.0-72.72+fips1 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-noble-linux-fips' to 'verification-done-noble- linux-fips'. If the problem still exists, change the tag 'verification- needed-noble-linux-fips' to 'verification-failed-noble-linux-fips'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-noble-linux-fips-v2 verification-needed-noble-linux-fips -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Tags added: kernel-daily-bug -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux-gcp-tcpx/6.8.0-1002.3 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-jammy-linux-gcp-tcpx' to 'verification-done- jammy-linux-gcp-tcpx'. If the problem still exists, change the tag 'verification-needed-jammy-linux-gcp-tcpx' to 'verification-failed- jammy-linux-gcp-tcpx'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-jammy-linux-gcp-tcpx-v2 verification-needed-jammy-linux-gcp-tcpx -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux-gkeop/6.8.0-1002.4 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-noble-linux-gkeop' to 'verification-done-noble- linux-gkeop'. If the problem still exists, change the tag 'verification- needed-noble-linux-gkeop' to 'verification-failed-noble-linux-gkeop'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-noble-linux-gkeop-v2 verification-needed-noble-linux-gkeop -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Changed in: ubuntu-power-systems Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug was fixed in the package linux - 6.8.0-48.48 --- linux (6.8.0-48.48) noble; urgency=medium * noble/linux: 6.8.0-48.48 -proposed tracker (LP: #2082437) * [SRU][Noble] Bad EPP defaults cause performance regressions on select Intel CPUs (LP: #2077470) - x86/cpu/vfm: Update arch/x86/include/asm/intel-family.h - cpufreq: intel_pstate: Allow model specific EPPs - cpufreq: intel_pstate: Update default EPPs for Meteor Lake - cpufreq: intel_pstate: Switch to new Intel CPU model defines - cpufreq: intel_pstate: Update Meteor Lake EPPs - cpufreq: intel_pstate: Use Meteor Lake EPPs for Arrow Lake - cpufreq: intel_pstate: Update Balance performance EPP for Emerald Rapids * power: Enable intel_rapl driver (LP: #2078834) - powercap: intel_rapl: Add support for ArrowLake-H platform * x86/vmware: Add TDX hypercall support (LP: #2077729) - x86/vmware: Introduce VMware hypercall API - x86/vmware: Add TDX hypercall support * Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 (LP: #2076866) - mm/mempolicy: use numa_node_id() instead of cpu_to_node() - mm/numa_balancing: allow migrate on protnone reference with MPOL_PREFERRED_MANY policy - mm: convert folio_estimated_sharers() to folio_likely_mapped_shared() - mm: factor out the numa mapping rebuilding into a new helper - mm: support multi-size THP numa balancing - mm/migrate: make migrate_misplaced_folio() return 0 on success - mm/migrate: move NUMA hinting fault folio isolation + checks under PTL - mm: fix possible OOB in numa_rebuild_large_mapping() * Add 'mm: hold PTL from the first PTE while reclaiming a large folio' to fix L2 Guest hang during LTP Test (LP: #2076147) - mm: hold PTL from the first PTE while reclaiming a large folio * KOP L2 guest fails to boot with 1 core - SMT8 topology (LP: #2070329) - KVM: PPC: Book3S HV nestedv2: Add DPDES support in helper library for Guest state buffer - KVM: PPC: Book3S HV nestedv2: Fix doorbell emulation * L2 Guest migration: continuously dumping while running NFS guest migration (LP: #2076406) - KVM: PPC: Book3S HV: Fix the set_one_reg for MMCR3 - KVM: PPC: Book3S HV: Fix the get_one_reg of SDAR - KVM: PPC: Book3S HV: Add one-reg interface for DEXCR register - KVM: PPC: Book3S HV nestedv2: Keep nested guest DEXCR in sync - KVM: PPC: Book3S HV: Add one-reg interface for HASHKEYR register - KVM: PPC: Book3S HV nestedv2: Keep nested guest HASHKEYR in sync - KVM: PPC: Book3S HV: Add one-reg interface for HASHPKEYR register - KVM: PPC: Book3S HV nestedv2: Keep nested guest HASHPKEYR in sync * perf build disables tracepoint support (LP: #2076190) - [Packaging] perf: reenable libtraceevent * Please backport the more restrictive XSAVES deactivation for Zen1/2 arch (LP: #2077321) - x86/CPU/AMD: Improve the erratum 1386 workaround * Fix alsa scarlett2 driver in 6.8 (LP: #2076402) - ALSA: scarlett2: Move initialisation code lower in the source - ALSA: scarlett2: Implement handling of the ACK notification * rtw89: reset IDMEM mode to prevent download firmware failure (LP: #2077396) - wifi: rtw89: 885xb: reset IDMEM mode to prevent download firmware failure * CVE-2024-43858 - jfs: Fix array-index-out-of-bounds in diFree * CVE-2024-42280 - mISDN: Fix a use after free in hfcmulti_tx() * CVE-2024-42271 - net/iucv: fix use after free in iucv_sock_close() * [Ubuntu-24.04] FADump with recommended crash size is making the L1 hang (LP: #2060039) - powerpc/64s/radix/kfence: map __kfence_pool at page granularity * Noble update: upstream stable patchset 2024-09-09 (LP: #2079945) - ocfs2: add bounds checking to ocfs2_check_dir_entry() - jfs: don't walk off the end of ealist - fs/ntfs3: Add a check for attr_names and oatbl - fs/ntfs3: Validate ff offset - usb: gadget: midi2: Fix incorrect default MIDI2 protocol setup - ALSA: hda/realtek: Enable headset mic on Positivo SU C1400 - ALSA: hda/realtek: Fix the speaker output on Samsung Galaxy Book Pro 360 - arm64: dts: qcom: qrb4210-rb2: switch I2C2 to i2c-gpio - arm64: dts: qcom: msm8996: Disable SS instance in Parkmode for USB - arm64: dts: qcom: sm6350: Disable SS instance in Parkmode for USB - arm64: dts: qcom: ipq6018: Disable SS instance in Parkmode for USB - arm64: dts: qcom: sdm630: Disable SS instance in Parkmode for USB - ALSA: pcm_dmaengine: Don't synchronize DMA channel when DMA is paused - ALSA: seq: ump: Skip useless ports for static blocks - filelock: Fix fcntl/close race recovery compat path - tun: add missing verification for short frame - tap: add missing verification for short frame - s390/mm: Fix VM_FAULT_HWPOISON handling in do_exception() - ALSA: hda/tas2781: Add new quirk for Lenovo Hera2 Laptop - arm64: dts: qcom: sc7180: Disable S
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
Great Anushree! Thanks for your verification on this - that gives confidence to roll out the kernel! -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux-xilinx/6.8.0-1009.10 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-noble-linux-xilinx' to 'verification-done- noble-linux-xilinx'. If the problem still exists, change the tag 'verification-needed-noble-linux-xilinx' to 'verification-failed-noble- linux-xilinx'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-noble-linux-xilinx-v2 verification-needed-noble-linux-xilinx -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
Ok, many thx, Anushree! -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
--- Comment From [email protected] 2024-10-16 13:44 EDT--- I updated the logs for the different bugzilla to this bugzilla. Now updating the bugzilla along with proper logs to specify that it is working fine! HOST[SOURCE](MDC): OS : ubuntu Kernel: 6.8.0-48-generic qemu : QEMU emulator version 8.2.2 (Debian 1:8.2.2+ds-0ubuntu1.2) libvirt : libvirtd (libvirt) 10.0.0 HOST[DESTINATION](NON-MDC) OS: ubuntu Kernel: 6.8.0-48-generic qemu : QEMU emulator version 8.2.2 (Debian 1:8.2.2+ds-0ubuntu1.2) libvirt : libvirtd (libvirt) 10.0.0 GUEST: OS : ubuntu kernel: 6.8.0-47-generic I started the ltp memstress test and started the migration of the guest from MDC to non-MDC system and after 2 hours i saw that the system was still up! Setps i tried: 1) Start the LTP memstress test root@ubuntu:/opt/ltp ./runltp -f controllers -s memcg_stress -t 120m --- INFO: runltp script is deprecated, try kirk https://github.com/linux-test-project/kirk --- Checking for required user/group ids 'root' user id and group found. 'nobody' user id and group found. 'bin' user id and group found. 'daemon' user id and group found. Users group found. Sys group found. Required users/groups exist. If some fields are empty or look unusual you may have an old version. Compare to the current minimal requirements in Documentation/Changes. /etc/lsb-release /etc/os-release DISTRIB_ID=Ubuntu DISTRIB_RELEASE=24.04 DISTRIB_CODENAME=noble DISTRIB_DESCRIPTION="Ubuntu 24.04.1 LTS" PRETTY_NAME="Ubuntu 24.04.1 LTS" NAME="Ubuntu" VERSION_ID="24.04" VERSION="24.04.1 LTS (Noble Numbat)" VERSION_CODENAME=noble ID=ubuntu ID_LIKE=debian HOME_URL="https://www.ubuntu.com/"; SUPPORT_URL="https://help.ubuntu.com/"; BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"; PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"; UBUNTU_CODENAME=noble LOGO=ubuntu-logo uname: Linux ubuntu 6.8.0-47-generic #47-Ubuntu SMP Fri Sep 27 21:38:55 UTC 2024 ppc64le ppc64le ppc64le GNU/Linux /proc/cmdline BOOT_IMAGE=/vmlinux-6.8.0-47-generic root=/dev/mapper/ubuntu--vg-ubuntu--lv ro crashkernel=4096M Gnu C gcc (Ubuntu 13.2.0-23ubuntu4) 13.2.0 Clang Gnu make 4.3 util-linux 2.39.3 mount linux 2.39.3 (libmount 2.39.3: selinux, smack, btrfs, verity, namespaces, idmapping, statx, assert, debug) modutils 31 e2fsprogs 1.47.0 Linux C Librarygnu/libc.so.6 Dynamic linker (ldd) 2.39 Procps 4.0.4 iproute2 1.3.0 iputils20240117 ethtool6.7 Sh-utils 9.4 Modules Loaded xt_tcpudp nft_compat nf_tables qrtr cfg80211 binfmt_misc uio_pdrv_genirq vmx_crypto uio sch_fq_codel dm_multipath nfnetlink ip_tables x_tables autofs4 btrfs blake2b_generic raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 ibmvscsi poly1305_p10_crypto crct10dif_vpmsum xhci_pci crc32c_vpmsum xhci_pci_renesas aes_gcm_p10_crypto cpuinfo: Architecture: ppc64le Byte Order: Little Endian CPU(s): 16 On-line CPU(s) list:0-15 Model name: POWER10 (architected), altivec supported Model: 2.0 (pvr 0080 0200) Thread(s) per core: 2 Core(s) per socket: 8 Socket(s): 1 Virtualization features: Hypervisor vendor: KVM Virtualization type:para Caches (sum of all): L1d:256 KiB (8 instances) L1i:384 KiB (8 instances) NUMA: NUMA node(s): 1 NUMA node0 CPU(s): 0-15 Vulnerabilities: Gather data sampling: Not affected Itlb multihit: Not affected L1tf: Mitigation; RFI Flush, L1D private per thread Mds:Not affected Meltdown: Mitigation; RFI Flush, L1D private per thread Mmio stale data:Not affected Reg file data sampling: Not affected Retbleed: Not affected Spec rstack overflow: Not affected Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio) Spectre v1: Mitigation; __user pointer sanitization, ori31 specula tion barrier enabled Spectre v2: Mitigation; Software count cache flush (hardware accel erated), Software link stack flush Srbds: Not affected Tsx async abort:Not affected free reports: totalusedfree shared buff/cache available Mem:26067968 109952024681664 25024 46560024968448 Swap: 0 0 0 memory (/proc/meminfo): MemTotal: 26067968 kB MemFree:24681664 kB MemAvailable: 24968448 kB Buffers: 16448 kB Cached: 415488 kB SwapCached:0 kB Active: 454848 kB Inactive: 108800 kB Active(anon): 118336 kB Inactive(anon):50304 kB Active(
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
Many thx Anushree for the successful verification! (I'm updating the tags accordingly ...) ** Tags removed: verification-needed-noble-linux ** Tags added: verification-done-noble-linux -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
--- Comment From [email protected] 2024-10-14 03:01 EDT--- I have tried the scenario as mentioned here https://bugzilla.linux.ibm.com/show_bug.cgi?id=207985 and it is working fine in my case. The only difference is i have tried with default bridge not with macvtap. Here is my analysis: HOST(SOURCE & DESTINATION)- kernel: 6.8.0-45-generic qemu : QEMU emulator version 8.2.2 (Debian 1:8.2.2+ds-0ubuntu1.2) libvirt : libvirtd (libvirt) 10.0.0 GUEST- kernel: 6.8.0-45-generic Steps i tried: 1) Create an NFS backed guest[default bridge] 2) Start htx mdt.less o/p root@ubuntu:~/HTX# htxcmdline -sut localhost -run -mdt mdt.less Result Starts Here ECG (/usr/lpp/htx/mdt/mdt.less) Activated. # Result Ends Here # 3) virsh migrate --live --domain $guest qemu+ssh://$target/system --verbose --undefinesource --persistent --timeout 120 Migration: [100.00 %] Guest migration worked perfectly fine! Didn't see any trace messages. Tried migration 5-6 times back and forth! Thanks, Anushree Mathur ** Bug watch added: bugzilla.linux.ibm.com/ #207985 https://bugzilla.linux.ibm.com/show_bug.cgi?id=207985 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux- oracle-6.8/6.8.0-1015.15~22.04.1 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-jammy-linux- oracle-6.8' to 'verification-done-jammy-linux-oracle-6.8'. If the problem still exists, change the tag 'verification-needed-jammy-linux- oracle-6.8' to 'verification-failed-jammy-linux-oracle-6.8'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-jammy-linux-oracle-6.8-v2 verification-needed-jammy-linux-oracle-6.8 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux- aws-6.8/6.8.0-1018.19~22.04.1 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-jammy-linux- aws-6.8' to 'verification-done-jammy-linux-aws-6.8'. If the problem still exists, change the tag 'verification-needed-jammy-linux-aws-6.8' to 'verification-failed-jammy-linux-aws-6.8'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-jammy-linux-aws-6.8-v2 verification-needed-jammy-linux-aws-6.8 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
This bug is awaiting verification that the linux/6.8.0-48.48 kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-noble-linux' to 'verification-done-noble-linux'. If the problem still exists, change the tag 'verification-needed-noble- linux' to 'verification-failed-noble-linux'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: kernel-spammed-noble-linux-v2 verification-needed-noble-linux -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Changed in: ubuntu-power-systems Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Changed in: linux (Ubuntu Noble) Importance: Undecided => High ** Changed in: linux (Ubuntu Noble) Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
-- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Changed in: linux (Ubuntu Oracular) Status: Fix Committed => Fix Released -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
Pull request submitted to kernel team's mailing list: https://lists.ubuntu.com/archives/kernel-team/2024-September/thread.html#153390 changing status to 'In Progress'. ** Changed in: linux (Ubuntu Noble) Status: Triaged => In Progress ** Changed in: ubuntu-power-systems Status: Triaged => In Progress ** Changed in: linux (Ubuntu Noble) Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Description changed: SRU Justification: [ Impact ] * A KVM guest (VM) that got live migrated between two Power 10 systems (using nested virtualization, means KVM on top of PowerVM) will highly likely crash after about an hour. * At that point it looked like the live migration itself was already successful, but it wasn't, and the crash is caused due to it. [ Test Plan ] * Setting up two Power 10 systems (with firmware level FW1060 or newer, that supports nested KVM) with Ubuntu Server 24.04 for ppc64el. * Setup a qemu/KVM environment that allows to live migrate a KVM guest from one P10 system to the other. * (The disk type does not seem to matter, hence NFS based disk storage can be used for example). * After about an hour the live migrated guest is likely to crash. Hence wait for 2 hours (which increases the likeliness) and a crash due to: "migrate_misplaced_folio+0x540/0x5d0" occurs. [ Where problems could occur ] * The 'fix' to avoid calling folio_likely_mapped_shared for cases where folio might have already been unmapped and the move of the checks might have an impact on page table locks if done wrong, which may lead to wrong locks, blocked memory and finally crashes. * The direct folio calls in mm/huge_memory.c and mm/memory.c got now 'in-directed', which may lead to a different behaviour and side-effects. However, isolation is still done, just slightly different and instead of using numamigrate_isolate_folio, now in (the renamed) migrate_misplaced_folio_prepare. * Further upstream conversations: https://lkml.kernel.org/r/[email protected] https://lkml.kernel.org/r/[email protected] https://lkml.kernel.org/r/[email protected] * Fixing a confusing return code, now to just return 0, on success is clarifying the return code handling and usage, and was mainly done in preparation of further changes, but can have bad side effects if the return code was used in other code places already as is. * Further upstream conversations: https://lkml.kernel.org/r/[email protected] https://lkml.kernel.org/r/[email protected] * Fixing the fact that NUMA balancing prohibits mTHP (multi-size Transparent Hugepage Support) seems to be unreasonable since its an exclusive mapping. Allowing this seems to bring significant performance improvements see commit message d2136d749d76), but introduced significant changes PTE mapping and modifications and even relies on further commits: 859d4adc3415 ("mm: numa: do not trap faults on shared data section pages") 80d47f5de5e3 ("mm: don't try to NUMA-migrate COW pages that have other uses") This case cause issues on systems configured for THP, may confuse the ordering, which may even lead to memory corruption. And this may especially hit (NUMA) systems with high core numbers, where balancing is more often needed. * Further upstream conversations: https://lore.kernel.org/all/[email protected]/ https://lkml.kernel.org/r/c33a5c0b0a0323b1f8ed53772f50501f4b196e25.1712132950.git.baolin.w...@linux.alibaba.com https://lkml.kernel.org/r/d28d276d599c26df7f38c9de8446f60e22dd1950.1711683069.git.baolin.w...@linux.alibaba.com * The refactoring of the code for NUMA mapping rebuilding and moving it into a new helper, seems to be straight forward, since the active code stays unchanged, however the new function needs to be callable, but this is the case since its all in mm/memory.c. * Further upstream conversations: https://lkml.kernel.org/r/[email protected] https://lkml.kernel.org/r/[email protected] https://lkml.kernel.org/r/8bc2586bdd8dbbe6d83c09b77b360ec8fcac3736.1711683069.git.baolin.w...@linux.alibaba.com * The refactoring of folio_estimated_sharers to folio_likely_mapped_shared is more significant, since the logic changed from (folio_estimated_sharers) 'estimate the number of sharers of a folio' to (folio_likely_mapped_shared) 'estimate if the folio is mapped into the page tables of more than one MM'. * Since this is an estimation, the results may be unpredictable (especially for bigger folios), and not like expected or assumed (there are quite some side-notes in the code comments of bb34f78d72c2, that mention potential fuzzy results), hence this may lead to unforeseen behavior. * The condition statements became clearer since it's now based on (more or less obvious) number counts, but can still be erroneous in case folio_estimated_sharers does incorrect calculations.
[Bug 2076866] Re: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0
** Summary changed: - Guest crahses post migration with migrate_misplaced_folio+0x4cc/0x5d0 + Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2076866 Title: Guest crashes post migration with migrate_misplaced_folio+0x4cc/0x5d0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-power-systems/+bug/2076866/+subscriptions -- ubuntu-bugs mailing list [email protected] https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs
