hi, Jeff, On Mon, Aug 19, 2024 at 09:38:19AM +0800, Oliver Sang wrote: > hi, Jeff, > > On Sun, Aug 18, 2024 at 05:28:41PM +0800, Oliver Sang wrote: > > hi, Jeff, > > > > On Thu, Aug 15, 2024 at 07:58:57PM -0700, Jeff Xu wrote: > > > Hi Oliver > > > > [...] > > > > > > could you exlictly point to two commit-id? > > > sure > > > > > > this patch > > > 8be7258a: mseal: add mseal syscall > > > ff388fe5c: mseal: wire up mseal syscall > > > > I failed to apply this patch set to "8be7258a: mseal: add mseal syscall" > > look your patch set again > [PATCH v1 1/2] mseal:selftest mremap across VMA boundaries > just for kselftests > > and I can apply > [PATCH v1 2/2] mseal: refactor mremap to remove can_modify_mm > upon "8be7258a: mseal: add mseal syscall" cleanly > > so I will start test for this [PATCH v1 2/2] > > BTW, I will firstly use our default setting - "60s testtime; reboot between > each > run; run 10 times", since we've already have the data for 8be7258a and > ff388fe5c > then we could give you an update kind of quickly. > > as some private mail discussed, you want some special run method, could you > elaborate them here? thanks
here is a quick update before you give us more details about special run method. by our default run method (60s testtime; reboot between each run; run 10 times), your "[PATCH v1 2/2] mseal: refactor mremap to remove can_modify_mm" could resolve regression partically. ========================================================================================= compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: gcc-12/performance/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/pagemove/stress-ng/60s commit: ff388fe5c4 ("mseal: wire up mseal syscall") 8be7258aad ("mseal: add mseal syscall") 2a78ece39f <-- your "[PATCH v1 2/2] mseal: refactor mremap to remove can_modify_mm" ff388fe5c481d39c 8be7258aad44b5e25977a98db13 2a78ece39f13ea6f3f9679a6c66 ---------------- --------------------------- --------------------------- %stddev %change %stddev %change %stddev \ | \ | \ 4957 +1.3% 5023 +1.0% 5008 time.percent_of_cpu_this_job_got 2915 +1.5% 2959 +1.2% 2949 time.system_time 65.96 -7.3% 61.16 -5.5% 62.30 time.user_time 41535878 -4.0% 39873501 -2.6% 40452264 proc-vmstat.numa_hit 41466104 -4.0% 39806121 -2.6% 40384854 proc-vmstat.numa_local 77297398 -4.1% 74165258 -2.6% 75286134 proc-vmstat.pgalloc_normal 77016866 -4.1% 73886027 -2.6% 75012630 proc-vmstat.pgfree 18386219 -5.0% 17474214 -2.9% 17850959 stress-ng.pagemove.ops 306421 -5.0% 291207 -2.9% 297490 stress-ng.pagemove.ops_per_sec 4957 +1.3% 5023 +1.0% 5008 stress-ng.time.percent_of_cpu_this_job_got 2915 +1.5% 2959 +1.2% 2949 stress-ng.time.system_time 3.349e+10 ± 4% +3.0% 3.447e+10 ± 2% +4.1% 3.484e+10 perf-stat.i.branch-instructions 1.13 -2.1% 1.10 -2.2% 1.10 perf-stat.i.cpi 0.89 +2.2% 0.91 +2.0% 0.91 perf-stat.i.ipc 1.04 -6.9% 0.97 -4.9% 0.99 perf-stat.overall.MPKI 1.13 -2.3% 1.10 -2.0% 1.10 perf-stat.overall.cpi 1081 +5.0% 1136 +3.0% 1114 perf-stat.overall.cycles-between-cache-misses 0.89 +2.3% 0.91 +2.0% 0.91 perf-stat.overall.ipc 3.295e+10 ± 3% +2.9% 3.392e+10 ± 2% +4.0% 3.427e+10 perf-stat.ps.branch-instructions 1.674e+11 ± 3% +1.8% 1.704e+11 ± 2% +3.3% 1.73e+11 perf-stat.ps.instructions 1.046e+13 +2.7% 1.074e+13 +1.7% 1.064e+13 perf-stat.total.instructions 75.05 -2.0 73.02 -0.9 74.18 perf-profile.calltrace.cycles-pp.move_vma.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe.mremap 36.83 -1.6 35.19 -1.2 35.62 perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap.do_syscall_64 25.02 -1.4 23.65 -0.9 24.12 perf-profile.calltrace.cycles-pp.copy_vma.move_vma.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe 19.94 -1.1 18.87 -0.8 19.19 perf-profile.calltrace.cycles-pp.__split_vma.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 14.78 -0.8 14.01 -0.5 14.28 perf-profile.calltrace.cycles-pp.vma_merge.copy_vma.move_vma.__do_sys_mremap.do_syscall_64 1.48 -0.5 0.99 -0.5 1.00 perf-profile.calltrace.cycles-pp.mas_find.do_vmi_munmap.move_vma.__do_sys_mremap.do_syscall_64 7.88 -0.4 7.47 -0.3 7.62 perf-profile.calltrace.cycles-pp.move_page_tables.move_vma.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.73 -0.4 6.37 -0.2 6.51 perf-profile.calltrace.cycles-pp.vm_area_dup.__split_vma.do_vmi_align_munmap.do_vmi_munmap.move_vma 6.16 -0.3 5.82 -0.3 5.90 perf-profile.calltrace.cycles-pp.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap.move_vma 6.12 -0.3 5.79 -0.2 5.93 perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.do_vmi_align_munmap.do_vmi_munmap 5.79 -0.3 5.48 -0.2 5.59 perf-profile.calltrace.cycles-pp.move_ptes.move_page_tables.move_vma.__do_sys_mremap.do_syscall_64 5.54 -0.3 5.25 -0.2 5.32 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap.do_vmi_munmap 5.56 -0.3 5.28 -0.2 5.36 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_merge.copy_vma.move_vma.__do_sys_mremap 5.19 -0.3 4.92 -0.2 4.98 perf-profile.calltrace.cycles-pp.mas_wr_store_entry.mas_store_prealloc.vma_complete.__split_vma.do_vmi_align_munmap 5.21 -0.3 4.95 -0.2 5.02 perf-profile.calltrace.cycles-pp.mas_wr_store_entry.mas_store_prealloc.vma_merge.copy_vma.move_vma 4.09 -0.2 3.85 -0.2 3.93 perf-profile.calltrace.cycles-pp.vm_area_dup.copy_vma.move_vma.__do_sys_mremap.do_syscall_64 4.69 -0.2 4.46 -0.2 4.51 perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_wr_store_entry.mas_store_prealloc.vma_merge.copy_vma 3.56 -0.2 3.36 -0.1 3.43 perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.vm_area_dup.copy_vma.move_vma.__do_sys_mremap 3.40 -0.2 3.22 -0.1 3.29 perf-profile.calltrace.cycles-pp.flush_tlb_mm_range.move_ptes.move_page_tables.move_vma.__do_sys_mremap 1.35 -0.2 1.16 -0.1 1.24 perf-profile.calltrace.cycles-pp.mas_find.do_vmi_munmap.do_munmap.mremap_to.__do_sys_mremap 4.00 -0.2 3.82 -0.1 3.86 perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_wr_store_entry.mas_store_prealloc.vma_complete.__split_vma 2.23 -0.2 2.05 -0.1 2.12 perf-profile.calltrace.cycles-pp.find_vma_prev.copy_vma.move_vma.__do_sys_mremap.do_syscall_64 8.26 -0.2 8.10 -0.2 8.06 perf-profile.calltrace.cycles-pp.unmap_region.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 1.97 ± 3% -0.2 1.81 ± 3% -0.1 1.88 ± 4% perf-profile.calltrace.cycles-pp.mod_objcg_state.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.vm_area_dup.__split_vma 3.11 ± 2% -0.2 2.96 -0.1 3.05 perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.do_vmi_align_munmap 0.97 -0.2 0.81 -0.1 0.87 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.do_vmi_munmap.do_munmap.mremap_to 2.27 -0.2 2.11 -0.1 2.16 perf-profile.calltrace.cycles-pp.mas_preallocate.__split_vma.do_vmi_align_munmap.do_vmi_munmap.move_vma 3.25 -0.1 3.10 -0.1 3.17 perf-profile.calltrace.cycles-pp.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 3.14 -0.1 3.00 -0.1 3.06 perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.do_vmi_align_munmap.do_vmi_munmap.move_vma 2.98 -0.1 2.85 -0.1 2.87 ± 2% perf-profile.calltrace.cycles-pp.anon_vma_clone.__split_vma.do_vmi_align_munmap.do_vmi_munmap.move_vma 1.27 ± 2% -0.1 1.15 ± 4% -0.1 1.19 ± 6% perf-profile.calltrace.cycles-pp.__memcpy.mas_wr_node_store.mas_wr_store_entry.mas_store_prealloc.vma_merge 2.45 -0.1 2.34 -0.1 2.38 perf-profile.calltrace.cycles-pp.flush_tlb_func.flush_tlb_mm_range.move_ptes.move_page_tables.move_vma 2.05 -0.1 1.94 -0.1 1.97 perf-profile.calltrace.cycles-pp.mas_preallocate.vma_merge.copy_vma.move_vma.__do_sys_mremap 2.44 -0.1 2.33 -0.1 2.38 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap.do_vmi_munmap 2.22 -0.1 2.11 -0.1 2.15 perf-profile.calltrace.cycles-pp.native_flush_tlb_one_user.flush_tlb_func.flush_tlb_mm_range.move_ptes.move_page_tables 1.76 ± 2% -0.1 1.65 ± 2% -0.1 1.66 ± 4% perf-profile.calltrace.cycles-pp.vma_prepare.vma_merge.copy_vma.move_vma.__do_sys_mremap 1.86 -0.1 1.75 -0.1 1.78 perf-profile.calltrace.cycles-pp.vma_link.copy_vma.move_vma.__do_sys_mremap.do_syscall_64 1.40 -0.1 1.30 -0.1 1.34 perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_preallocate.__split_vma.do_vmi_align_munmap.do_vmi_munmap 1.39 -0.1 1.30 -0.1 1.33 perf-profile.calltrace.cycles-pp.mas_alloc_nodes.mas_preallocate.vma_merge.copy_vma.move_vma 0.55 -0.1 0.46 ± 30% -0.0 0.52 perf-profile.calltrace.cycles-pp.mas_find.find_vma_prev.copy_vma.move_vma.__do_sys_mremap 1.25 -0.1 1.16 -0.1 1.20 perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_preallocate.__split_vma.do_vmi_align_munmap 0.94 -0.1 0.86 -0.1 0.87 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.do_vmi_munmap.move_vma.__do_sys_mremap 1.23 -0.1 1.15 -0.1 1.17 perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_preallocate.vma_merge.copy_vma 1.54 -0.1 1.47 -0.0 1.49 perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.unmap_region.do_vmi_align_munmap 0.73 -0.1 0.66 -0.0 0.69 perf-profile.calltrace.cycles-pp.mas_walk.find_vma_prev.copy_vma.move_vma.__do_sys_mremap 1.15 -0.1 1.09 -0.1 1.10 perf-profile.calltrace.cycles-pp.___slab_alloc.kmem_cache_alloc_noprof.vm_area_dup.__split_vma.do_vmi_align_munmap 0.60 ± 2% -0.1 0.54 -0.0 0.58 perf-profile.calltrace.cycles-pp.security_mmap_addr.__get_unmapped_area.mremap_to.__do_sys_mremap.do_syscall_64 1.27 -0.1 1.21 -0.0 1.24 perf-profile.calltrace.cycles-pp.mas_wr_store_entry.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.move_vma 0.80 ± 2% -0.1 0.74 ± 2% -0.0 0.76 ± 2% perf-profile.calltrace.cycles-pp.__call_rcu_common.mas_wr_node_store.mas_wr_store_entry.mas_store_prealloc.vma_merge 0.72 -0.1 0.66 -0.0 0.69 perf-profile.calltrace.cycles-pp.mas_prev.vma_merge.copy_vma.move_vma.__do_sys_mremap 0.78 -0.1 0.73 -0.0 0.75 perf-profile.calltrace.cycles-pp.___slab_alloc.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_preallocate.__split_vma 0.69 ± 2% -0.1 0.64 ± 3% -0.0 0.66 ± 4% perf-profile.calltrace.cycles-pp.mod_objcg_state.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.vm_area_dup.copy_vma 1.63 -0.1 1.58 -0.1 1.57 perf-profile.calltrace.cycles-pp.__get_unmapped_area.mremap_to.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe 1.02 -0.1 0.97 -0.0 0.98 perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.unmap_region 0.77 -0.0 0.72 -0.0 0.74 perf-profile.calltrace.cycles-pp.___slab_alloc.kmem_cache_alloc_noprof.mas_alloc_nodes.mas_preallocate.vma_merge 0.62 -0.0 0.57 -0.0 0.60 perf-profile.calltrace.cycles-pp.mas_prev_setup.mas_prev.vma_merge.copy_vma.move_vma 0.67 -0.0 0.62 -0.0 0.64 perf-profile.calltrace.cycles-pp.percpu_counter_add_batch.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 0.86 -0.0 0.81 -0.0 0.83 perf-profile.calltrace.cycles-pp.mtree_load.vma_to_resize.mremap_to.__do_sys_mremap.do_syscall_64 1.12 -0.0 1.08 -0.0 1.09 perf-profile.calltrace.cycles-pp.clear_bhb_loop.mremap 0.56 -0.0 0.51 -0.0 0.53 perf-profile.calltrace.cycles-pp.mas_walk.mas_prev_setup.mas_prev.vma_merge.copy_vma 0.68 ± 2% -0.0 0.63 -0.0 0.65 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.mremap 0.81 -0.0 0.77 -0.0 0.80 perf-profile.calltrace.cycles-pp.mtree_load.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe.mremap 1.02 -0.0 0.97 -0.0 0.98 perf-profile.calltrace.cycles-pp.vma_to_resize.mremap_to.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.95 ± 2% -0.0 0.90 ± 2% -0.0 0.93 perf-profile.calltrace.cycles-pp.__memcg_slab_free_hook.kmem_cache_free.unlink_anon_vmas.free_pgtables.unmap_region 0.98 -0.0 0.94 -0.0 0.95 perf-profile.calltrace.cycles-pp.mas_find.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 0.78 -0.0 0.74 -0.0 0.75 perf-profile.calltrace.cycles-pp.mas_store_prealloc.vma_link.copy_vma.move_vma.__do_sys_mremap 0.70 -0.0 0.66 -0.0 0.67 perf-profile.calltrace.cycles-pp.__call_rcu_common.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 0.69 -0.0 0.65 -0.0 0.66 perf-profile.calltrace.cycles-pp.___slab_alloc.kmem_cache_alloc_noprof.vm_area_dup.copy_vma.move_vma 0.69 -0.0 0.65 -0.0 0.65 perf-profile.calltrace.cycles-pp.mas_preallocate.vma_link.copy_vma.move_vma.__do_sys_mremap 0.62 -0.0 0.59 -0.0 0.60 perf-profile.calltrace.cycles-pp.mas_prev_slot.do_vmi_align_munmap.do_vmi_munmap.move_vma.__do_sys_mremap 1.16 -0.0 1.12 -0.0 1.13 perf-profile.calltrace.cycles-pp.anon_vma_clone.copy_vma.move_vma.__do_sys_mremap.do_syscall_64 0.76 ± 2% -0.0 0.72 -0.0 0.72 ± 2% perf-profile.calltrace.cycles-pp.allocate_slab.___slab_alloc.kmem_cache_alloc_noprof.vm_area_dup.__split_vma 1.01 -0.0 0.97 -0.0 0.99 perf-profile.calltrace.cycles-pp.mt_find.vma_merge.copy_vma.move_vma.__do_sys_mremap 0.60 -0.0 0.57 -0.0 0.58 perf-profile.calltrace.cycles-pp.__pte_offset_map_lock.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas 0.88 -0.0 0.85 -0.0 0.85 perf-profile.calltrace.cycles-pp.userfaultfd_unmap_complete.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe.mremap 0.62 ± 2% -0.0 0.59 ± 2% -0.0 0.60 perf-profile.calltrace.cycles-pp.get_old_pud.move_page_tables.move_vma.__do_sys_mremap.do_syscall_64 0.59 -0.0 0.56 -0.0 0.56 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.mremap 0.65 -0.0 0.62 ± 2% -0.0 0.63 perf-profile.calltrace.cycles-pp.mas_update_gap.mas_store_gfp.do_vmi_align_munmap.do_vmi_munmap.move_vma 0.81 +0.0 0.82 -0.0 0.79 perf-profile.calltrace.cycles-pp.thp_get_unmapped_area_vmflags.__get_unmapped_area.mremap_to.__do_sys_mremap.do_syscall_64 2.76 +0.0 2.78 ± 2% -0.1 2.67 perf-profile.calltrace.cycles-pp.unlink_anon_vmas.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap 3.47 +0.0 3.51 -0.1 3.37 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.do_vmi_align_munmap.do_vmi_munmap.move_vma 0.76 +0.1 0.83 +0.1 0.85 perf-profile.calltrace.cycles-pp.__madvise 0.66 +0.1 0.73 +0.1 0.75 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__madvise 0.67 +0.1 0.74 +0.1 0.76 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__madvise 0.63 +0.1 0.70 +0.1 0.72 perf-profile.calltrace.cycles-pp.__x64_sys_madvise.do_syscall_64.entry_SYSCALL_64_after_hwframe.__madvise 0.62 +0.1 0.70 +0.1 0.71 perf-profile.calltrace.cycles-pp.do_madvise.__x64_sys_madvise.do_syscall_64.entry_SYSCALL_64_after_hwframe.__madvise 0.00 +0.9 0.86 +0.9 0.92 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.can_modify_mm.do_vmi_munmap.do_munmap 0.00 +0.9 0.88 +0.0 0.00 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.can_modify_mm.mremap_to.__do_sys_mremap 83.81 +0.9 84.69 +0.6 84.44 perf-profile.calltrace.cycles-pp.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe.mremap 0.00 +0.9 0.90 ± 2% +0.9 0.91 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.can_modify_mm.do_vmi_munmap.move_vma 0.00 +1.1 1.10 +0.0 0.00 perf-profile.calltrace.cycles-pp.mas_find.can_modify_mm.mremap_to.__do_sys_mremap.do_syscall_64 0.00 +1.2 1.21 +1.3 1.28 perf-profile.calltrace.cycles-pp.mas_find.can_modify_mm.do_vmi_munmap.do_munmap.mremap_to 2.10 +1.5 3.60 +1.7 3.79 perf-profile.calltrace.cycles-pp.do_munmap.mremap_to.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.00 +1.5 1.52 +1.5 1.52 perf-profile.calltrace.cycles-pp.mas_find.can_modify_mm.do_vmi_munmap.move_vma.__do_sys_mremap 1.59 +1.5 3.12 +1.7 3.31 perf-profile.calltrace.cycles-pp.do_vmi_munmap.do_munmap.mremap_to.__do_sys_mremap.do_syscall_64 0.00 +1.6 1.61 +0.0 0.00 perf-profile.calltrace.cycles-pp.can_modify_mm.mremap_to.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe 0.00 +1.7 1.73 +1.8 1.83 perf-profile.calltrace.cycles-pp.can_modify_mm.do_vmi_munmap.do_munmap.mremap_to.__do_sys_mremap 0.00 +2.0 2.01 +2.0 2.04 perf-profile.calltrace.cycles-pp.can_modify_mm.do_vmi_munmap.move_vma.__do_sys_mremap.do_syscall_64 5.34 +3.0 8.38 +1.6 6.92 perf-profile.calltrace.cycles-pp.mremap_to.__do_sys_mremap.do_syscall_64.entry_SYSCALL_64_after_hwframe.mremap 75.22 -2.0 73.18 -0.9 74.34 perf-profile.children.cycles-pp.move_vma 37.04 -1.6 35.40 -1.2 35.83 perf-profile.children.cycles-pp.do_vmi_align_munmap 25.09 -1.4 23.72 -0.9 24.20 perf-profile.children.cycles-pp.copy_vma 20.04 -1.1 18.96 -0.8 19.28 perf-profile.children.cycles-pp.__split_vma 19.87 -1.0 18.84 -0.6 19.24 perf-profile.children.cycles-pp.rcu_core 19.85 -1.0 18.82 -0.6 19.22 perf-profile.children.cycles-pp.rcu_do_batch 19.89 -1.0 18.86 -0.6 19.26 perf-profile.children.cycles-pp.handle_softirqs 17.55 -0.9 16.67 -0.5 17.02 perf-profile.children.cycles-pp.kmem_cache_free 15.32 -0.8 14.49 -0.5 14.78 perf-profile.children.cycles-pp.kmem_cache_alloc_noprof 15.17 -0.8 14.39 -0.5 14.66 perf-profile.children.cycles-pp.vma_merge 12.12 -0.6 11.48 -0.4 11.70 perf-profile.children.cycles-pp.__slab_free 12.19 -0.6 11.56 -0.5 11.73 perf-profile.children.cycles-pp.mas_wr_store_entry 11.99 -0.6 11.36 -0.5 11.53 perf-profile.children.cycles-pp.mas_store_prealloc 10.88 -0.6 10.28 -0.4 10.50 perf-profile.children.cycles-pp.vm_area_dup 9.90 -0.5 9.41 -0.4 9.53 perf-profile.children.cycles-pp.mas_wr_node_store 8.39 -0.5 7.92 -0.3 8.13 perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook 7.99 -0.4 7.58 -0.3 7.73 perf-profile.children.cycles-pp.move_page_tables 6.70 -0.4 6.33 -0.3 6.43 perf-profile.children.cycles-pp.vma_complete 5.87 -0.3 5.55 -0.2 5.66 perf-profile.children.cycles-pp.move_ptes 5.12 -0.3 4.81 -0.2 4.90 perf-profile.children.cycles-pp.mas_preallocate 6.05 -0.3 5.74 -0.2 5.85 perf-profile.children.cycles-pp.vm_area_free_rcu_cb 2.98 -0.3 2.69 ± 4% -0.2 2.80 ± 6% perf-profile.children.cycles-pp.__memcpy 3.46 ± 2% -0.2 3.25 -0.1 3.36 ± 3% perf-profile.children.cycles-pp.mod_objcg_state 3.47 -0.2 3.26 -0.2 3.32 perf-profile.children.cycles-pp.___slab_alloc 2.44 -0.2 2.25 -0.1 2.33 perf-profile.children.cycles-pp.find_vma_prev 2.92 -0.2 2.73 -0.1 2.79 perf-profile.children.cycles-pp.mas_alloc_nodes 3.46 -0.2 3.27 -0.1 3.34 perf-profile.children.cycles-pp.flush_tlb_mm_range 3.47 -0.2 3.29 -0.2 3.32 ± 2% perf-profile.children.cycles-pp.down_write 3.33 -0.2 3.16 -0.1 3.25 perf-profile.children.cycles-pp.__memcg_slab_free_hook 4.23 -0.2 4.07 -0.1 4.08 ± 2% perf-profile.children.cycles-pp.anon_vma_clone 8.33 -0.2 8.17 -0.2 8.13 perf-profile.children.cycles-pp.unmap_region 3.35 -0.1 3.20 -0.1 3.26 perf-profile.children.cycles-pp.mas_store_gfp 2.21 -0.1 2.07 -0.1 2.10 perf-profile.children.cycles-pp.__cond_resched 3.19 -0.1 3.05 -0.1 3.11 perf-profile.children.cycles-pp.unmap_vmas 2.12 -0.1 1.99 -0.1 2.04 perf-profile.children.cycles-pp.__call_rcu_common 2.66 -0.1 2.54 -0.1 2.60 perf-profile.children.cycles-pp.mtree_load 2.24 -0.1 2.12 ± 2% -0.1 2.13 ± 3% perf-profile.children.cycles-pp.vma_prepare 2.50 -0.1 2.38 -0.1 2.42 perf-profile.children.cycles-pp.flush_tlb_func 2.04 ± 2% -0.1 1.93 -0.1 1.96 ± 2% perf-profile.children.cycles-pp.allocate_slab 2.46 -0.1 2.35 -0.1 2.41 perf-profile.children.cycles-pp.rcu_cblist_dequeue 2.48 -0.1 2.38 -0.1 2.42 perf-profile.children.cycles-pp.unmap_page_range 2.23 -0.1 2.12 -0.1 2.16 perf-profile.children.cycles-pp.native_flush_tlb_one_user 1.77 -0.1 1.67 -0.1 1.70 perf-profile.children.cycles-pp.mas_wr_walk 1.88 -0.1 1.78 -0.1 1.80 perf-profile.children.cycles-pp.vma_link 1.84 -0.1 1.75 -0.1 1.77 perf-profile.children.cycles-pp.up_write 0.97 ± 2% -0.1 0.88 -0.1 0.89 perf-profile.children.cycles-pp.rcu_all_qs 1.40 -0.1 1.32 -0.1 1.34 ± 2% perf-profile.children.cycles-pp.shuffle_freelist 1.03 -0.1 0.95 -0.0 0.99 perf-profile.children.cycles-pp.mas_prev 0.92 -0.1 0.85 -0.0 0.88 perf-profile.children.cycles-pp.mas_prev_setup 1.58 -0.1 1.51 -0.1 1.53 perf-profile.children.cycles-pp.zap_pmd_range 1.24 -0.1 1.17 -0.0 1.20 perf-profile.children.cycles-pp.mas_prev_slot 1.57 -0.1 1.49 -0.1 1.49 perf-profile.children.cycles-pp.mas_update_gap 0.62 -0.1 0.56 -0.0 0.60 perf-profile.children.cycles-pp.security_mmap_addr 0.90 -0.1 0.84 -0.0 0.86 perf-profile.children.cycles-pp.percpu_counter_add_batch 0.86 -0.1 0.80 -0.0 0.81 perf-profile.children.cycles-pp._raw_spin_lock_irqsave 0.98 -0.1 0.92 -0.0 0.95 perf-profile.children.cycles-pp.mas_pop_node 1.68 -0.1 1.62 -0.1 1.62 perf-profile.children.cycles-pp.__get_unmapped_area 1.23 -0.1 1.18 -0.0 1.20 perf-profile.children.cycles-pp.__pte_offset_map_lock 0.49 ± 2% -0.1 0.43 -0.1 0.43 ± 2% perf-profile.children.cycles-pp.setup_object 1.09 -0.1 1.03 -0.0 1.05 perf-profile.children.cycles-pp.zap_pte_range 1.07 ± 2% -0.1 1.02 ± 2% -0.1 1.00 perf-profile.children.cycles-pp.mas_leaf_max_gap 0.70 ± 2% -0.0 0.65 -0.0 0.67 perf-profile.children.cycles-pp.syscall_return_via_sysret 1.18 -0.0 1.14 -0.0 1.15 perf-profile.children.cycles-pp.clear_bhb_loop 0.51 ± 3% -0.0 0.47 -0.0 0.49 ± 3% perf-profile.children.cycles-pp.anon_vma_interval_tree_insert 1.04 -0.0 1.00 -0.0 1.01 perf-profile.children.cycles-pp.vma_to_resize 0.57 -0.0 0.53 -0.0 0.54 perf-profile.children.cycles-pp.mas_wr_end_piv 0.44 ± 2% -0.0 0.40 ± 2% -0.0 0.40 perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath 1.14 -0.0 1.10 -0.0 1.12 perf-profile.children.cycles-pp.mt_find 0.90 -0.0 0.87 -0.0 0.87 perf-profile.children.cycles-pp.userfaultfd_unmap_complete 0.62 -0.0 0.59 -0.0 0.60 perf-profile.children.cycles-pp.__put_partials 0.45 ± 6% -0.0 0.42 -0.0 0.43 perf-profile.children.cycles-pp._raw_spin_lock 0.48 -0.0 0.45 ± 2% -0.0 0.46 perf-profile.children.cycles-pp.mas_prev_range 0.61 -0.0 0.58 -0.0 0.59 perf-profile.children.cycles-pp.entry_SYSCALL_64 0.31 ± 3% -0.0 0.28 ± 3% -0.0 0.31 perf-profile.children.cycles-pp.security_vm_enough_memory_mm 0.33 ± 3% -0.0 0.30 ± 2% -0.0 0.31 ± 4% perf-profile.children.cycles-pp.mas_put_in_tree 0.32 ± 2% -0.0 0.29 ± 2% -0.0 0.30 perf-profile.children.cycles-pp.tlb_finish_mmu 0.46 -0.0 0.44 ± 2% -0.0 0.46 perf-profile.children.cycles-pp.rcu_segcblist_enqueue 0.33 -0.0 0.31 -0.0 0.32 perf-profile.children.cycles-pp.mas_destroy 0.36 -0.0 0.34 -0.0 0.34 perf-profile.children.cycles-pp.__rb_insert_augmented 0.39 -0.0 0.37 -0.0 0.38 ± 2% perf-profile.children.cycles-pp.down_write_killable 0.29 -0.0 0.27 ± 2% -0.0 0.28 perf-profile.children.cycles-pp.tlb_gather_mmu 0.26 -0.0 0.24 ± 2% -0.0 0.25 ± 2% perf-profile.children.cycles-pp.syscall_exit_to_user_mode 0.16 ± 2% -0.0 0.14 ± 3% -0.0 0.14 ± 3% perf-profile.children.cycles-pp.mas_wr_append 0.30 ± 2% -0.0 0.28 ± 2% -0.0 0.29 ± 2% perf-profile.children.cycles-pp.__vm_enough_memory 0.32 -0.0 0.30 ± 2% -0.0 0.31 perf-profile.children.cycles-pp.pte_offset_map_nolock 2.83 +0.0 2.85 ± 2% -0.1 2.74 perf-profile.children.cycles-pp.unlink_anon_vmas 0.84 +0.0 0.86 -0.0 0.81 perf-profile.children.cycles-pp.thp_get_unmapped_area_vmflags 0.08 ± 5% +0.0 0.10 ± 3% -0.0 0.08 ± 6% perf-profile.children.cycles-pp.mm_get_unmapped_area_vmflags 3.52 +0.0 3.56 -0.1 3.42 perf-profile.children.cycles-pp.free_pgtables 0.78 +0.1 0.85 +0.1 0.86 perf-profile.children.cycles-pp.__madvise 0.63 +0.1 0.70 +0.1 0.72 perf-profile.children.cycles-pp.__x64_sys_madvise 0.63 +0.1 0.70 +0.1 0.71 perf-profile.children.cycles-pp.do_madvise 0.00 +0.1 0.09 ± 3% +0.1 0.10 ± 5% perf-profile.children.cycles-pp.can_modify_mm_madv 1.31 +0.2 1.46 +0.2 1.50 perf-profile.children.cycles-pp.mas_next_slot 83.90 +0.9 84.79 +0.6 84.53 perf-profile.children.cycles-pp.__do_sys_mremap 40.45 +1.4 41.90 +2.1 42.57 perf-profile.children.cycles-pp.do_vmi_munmap 2.12 +1.5 3.62 +1.7 3.82 perf-profile.children.cycles-pp.do_munmap 3.63 +2.4 5.98 +1.7 5.29 perf-profile.children.cycles-pp.mas_walk 5.40 +3.0 8.44 +1.6 6.97 perf-profile.children.cycles-pp.mremap_to 5.26 +3.2 8.48 +2.3 7.58 perf-profile.children.cycles-pp.mas_find 0.00 +5.5 5.46 +3.9 3.93 perf-profile.children.cycles-pp.can_modify_mm 11.49 -0.6 10.89 -0.4 11.10 perf-profile.self.cycles-pp.__slab_free 4.32 -0.3 4.06 -0.2 4.16 perf-profile.self.cycles-pp.__memcg_slab_post_alloc_hook 1.96 -0.2 1.77 ± 4% -0.1 1.84 ± 6% perf-profile.self.cycles-pp.__memcpy 2.36 -0.1 2.25 ± 2% -0.1 2.25 ± 3% perf-profile.self.cycles-pp.down_write 2.42 -0.1 2.31 -0.0 2.38 perf-profile.self.cycles-pp.rcu_cblist_dequeue 2.33 -0.1 2.23 -0.1 2.28 perf-profile.self.cycles-pp.mtree_load 2.21 -0.1 2.10 -0.1 2.14 perf-profile.self.cycles-pp.native_flush_tlb_one_user 1.62 -0.1 1.54 -0.0 1.57 perf-profile.self.cycles-pp.__memcg_slab_free_hook 1.52 -0.1 1.44 -0.1 1.46 perf-profile.self.cycles-pp.mas_wr_walk 1.44 -0.1 1.36 -0.1 1.38 ± 2% perf-profile.self.cycles-pp.__call_rcu_common 1.53 -0.1 1.45 -0.0 1.48 perf-profile.self.cycles-pp.up_write 1.72 -0.1 1.65 -0.0 1.70 perf-profile.self.cycles-pp.mod_objcg_state 0.69 ± 2% -0.1 0.63 -0.1 0.63 perf-profile.self.cycles-pp.rcu_all_qs 1.14 ± 2% -0.1 1.08 -0.0 1.09 ± 2% perf-profile.self.cycles-pp.shuffle_freelist 1.18 -0.1 1.12 -0.0 1.17 perf-profile.self.cycles-pp.vma_merge 1.38 -0.1 1.33 -0.0 1.35 perf-profile.self.cycles-pp.do_vmi_align_munmap 0.51 ± 2% -0.1 0.45 -0.0 0.49 perf-profile.self.cycles-pp.security_mmap_addr 0.62 -0.1 0.56 ± 2% -0.1 0.56 perf-profile.self.cycles-pp.mremap 0.89 -0.1 0.83 -0.0 0.85 perf-profile.self.cycles-pp.___slab_alloc 0.99 -0.1 0.94 -0.0 0.96 perf-profile.self.cycles-pp.mas_prev_slot 1.00 -0.0 0.95 -0.0 0.96 perf-profile.self.cycles-pp.mas_preallocate 0.98 -0.0 0.93 -0.0 0.95 perf-profile.self.cycles-pp.move_ptes 0.85 -0.0 0.80 -0.0 0.82 perf-profile.self.cycles-pp.mas_pop_node 0.94 -0.0 0.90 -0.0 0.91 ± 2% perf-profile.self.cycles-pp.vm_area_free_rcu_cb 1.09 -0.0 1.04 -0.0 1.06 perf-profile.self.cycles-pp.__cond_resched 0.77 -0.0 0.72 -0.0 0.74 perf-profile.self.cycles-pp.percpu_counter_add_batch 0.94 ± 2% -0.0 0.89 ± 2% -0.1 0.87 perf-profile.self.cycles-pp.mas_leaf_max_gap 1.17 -0.0 1.12 -0.0 1.14 perf-profile.self.cycles-pp.clear_bhb_loop 0.68 -0.0 0.63 -0.0 0.65 perf-profile.self.cycles-pp.__split_vma 0.79 -0.0 0.75 -0.0 0.77 perf-profile.self.cycles-pp.mas_wr_store_entry 1.22 -0.0 1.18 -0.0 1.18 perf-profile.self.cycles-pp.move_vma 0.43 ± 2% -0.0 0.40 ± 2% -0.0 0.40 perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath 1.49 -0.0 1.45 +0.0 1.49 perf-profile.self.cycles-pp.kmem_cache_free 0.44 -0.0 0.40 -0.0 0.40 perf-profile.self.cycles-pp.do_munmap 0.45 -0.0 0.42 -0.0 0.43 perf-profile.self.cycles-pp.mas_wr_end_piv 0.89 -0.0 0.86 -0.0 0.88 perf-profile.self.cycles-pp.mas_store_gfp 0.78 -0.0 0.75 -0.0 0.76 perf-profile.self.cycles-pp.userfaultfd_unmap_complete 0.66 -0.0 0.62 -0.0 0.64 perf-profile.self.cycles-pp.mas_store_prealloc 0.60 -0.0 0.58 -0.0 0.59 perf-profile.self.cycles-pp.unmap_region 0.36 ± 4% -0.0 0.33 ± 3% -0.0 0.34 ± 2% perf-profile.self.cycles-pp.syscall_return_via_sysret 0.55 -0.0 0.52 -0.0 0.53 perf-profile.self.cycles-pp.get_old_pud 0.99 -0.0 0.97 -0.0 0.98 perf-profile.self.cycles-pp.mt_find 0.61 -0.0 0.58 -0.0 0.60 perf-profile.self.cycles-pp.copy_vma 0.43 ± 3% -0.0 0.40 -0.0 0.41 ± 4% perf-profile.self.cycles-pp.anon_vma_interval_tree_insert 0.49 -0.0 0.47 -0.0 0.48 perf-profile.self.cycles-pp.find_vma_prev 0.71 -0.0 0.68 -0.0 0.70 perf-profile.self.cycles-pp.unmap_page_range 0.27 -0.0 0.25 -0.0 0.26 perf-profile.self.cycles-pp.mas_prev_setup 0.47 -0.0 0.45 -0.0 0.46 ± 2% perf-profile.self.cycles-pp.flush_tlb_mm_range 0.37 ± 6% -0.0 0.35 -0.0 0.35 perf-profile.self.cycles-pp._raw_spin_lock 0.41 -0.0 0.39 -0.0 0.40 perf-profile.self.cycles-pp._raw_spin_lock_irqsave 0.40 -0.0 0.37 -0.0 0.38 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack 0.27 -0.0 0.25 ± 2% -0.0 0.25 ± 3% perf-profile.self.cycles-pp.mas_put_in_tree 0.49 -0.0 0.47 -0.0 0.49 perf-profile.self.cycles-pp.refill_obj_stock 0.48 -0.0 0.46 -0.0 0.47 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe 0.27 ± 2% -0.0 0.25 -0.0 0.26 perf-profile.self.cycles-pp.tlb_finish_mmu 0.24 ± 2% -0.0 0.22 -0.0 0.23 perf-profile.self.cycles-pp.mas_prev 0.28 -0.0 0.26 -0.0 0.27 ± 2% perf-profile.self.cycles-pp.mas_alloc_nodes 0.40 -0.0 0.39 -0.0 0.40 perf-profile.self.cycles-pp.__pte_offset_map_lock 0.14 ± 3% -0.0 0.12 ± 2% -0.0 0.13 ± 3% perf-profile.self.cycles-pp.syscall_exit_to_user_mode 0.26 -0.0 0.24 ± 2% -0.0 0.25 perf-profile.self.cycles-pp.__rb_insert_augmented 0.28 -0.0 0.26 -0.0 0.27 perf-profile.self.cycles-pp.alloc_new_pud 0.28 -0.0 0.26 -0.0 0.27 ± 2% perf-profile.self.cycles-pp.flush_tlb_func 0.20 ± 2% -0.0 0.19 -0.0 0.19 ± 2% perf-profile.self.cycles-pp.__get_unmapped_area 0.47 -0.0 0.46 -0.0 0.45 perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown_vmflags 0.06 -0.0 0.05 ± 5% -0.0 0.05 perf-profile.self.cycles-pp.vma_dup_policy 0.06 ± 6% +0.0 0.07 -0.0 0.06 ± 8% perf-profile.self.cycles-pp.mm_get_unmapped_area_vmflags 0.11 ± 4% +0.0 0.12 ± 4% +0.0 0.12 ± 4% perf-profile.self.cycles-pp.free_pgd_range 0.21 +0.0 0.22 ± 2% -0.0 0.20 ± 2% perf-profile.self.cycles-pp.thp_get_unmapped_area_vmflags 0.45 +0.0 0.48 +0.0 0.50 perf-profile.self.cycles-pp.do_vmi_munmap 0.27 +0.0 0.32 -0.0 0.26 perf-profile.self.cycles-pp.free_pgtables 0.36 ± 2% +0.1 0.44 -0.0 0.35 perf-profile.self.cycles-pp.unlink_anon_vmas 1.07 +0.1 1.19 +0.2 1.22 perf-profile.self.cycles-pp.mas_next_slot 1.49 +0.5 2.01 +0.4 1.86 perf-profile.self.cycles-pp.mas_find 0.00 +1.4 1.37 +0.9 0.93 perf-profile.self.cycles-pp.can_modify_mm 3.14 +2.1 5.23 +1.5 4.60 perf-profile.self.cycles-pp.mas_walk > > > > > > to avoid the impact of other changes, better to apply the patch upon > > 8be7258a > > directly. > > > > if you prefer other base for this patch, please let us know. then we will > > supply the results for 4 commits in fact: > > > > this patch > > the base of this patch > > 8be7258a: mseal: add mseal syscall > > ff388fe5c: mseal: wire up mseal syscall > > > > > > > > > > > > > > > Thank you for your time and assistance in helping me on understanding > > > > > this issue. > > > > > > > > due to resource constraint, please expect that we need several days to > > > > finish > > > > this test request. > > > No problem. > > > > > > Thanks for your help! > > > -Jeff > > > > > > > > > > > > > Best regards, > > > > > -Jeff > > > > > > > > > > > -Jeff > > > > > > > > > > > > > [1] > > > > > > > https://lore.kernel.org/lkml/202408041602.caa0372-oliver.s...@intel.com/ > > > > > > > [2] > > > > > > > https://github.com/peaktocreek/mmperf/blob/main/run_stress_ng.c > > > > > > > > > > > > > > > > > > > > > Jeff Xu (2): > > > > > > > mseal:selftest mremap across VMA boundaries. > > > > > > > mseal: refactor mremap to remove can_modify_mm > > > > > > > > > > > > > > mm/internal.h | 24 ++ > > > > > > > mm/mremap.c | 77 +++---- > > > > > > > mm/mseal.c | 17 -- > > > > > > > tools/testing/selftests/mm/mseal_test.c | 293 > > > > > > > +++++++++++++++++++++++- > > > > > > > 4 files changed, 353 insertions(+), 58 deletions(-) > > > > > > > > > > > > > > -- > > > > > > > 2.46.0.76.ge559c4bf1a-goog > > > > > > >