Ping... "Huang, Ying" <ying.hu...@intel.com> writes:
> Hi, Josef, > > Do you have time to take a look at the regression? > > kernel test robot <xiaolong...@intel.com> writes: > >> Greeting, >> >> FYI, we noticed a -12.3% regression of blogbench.write_score and a +9.6% >> improvement >> of blogbench.read_score due to commit: >> >> >> commit: 9092c71bb724dba2ecba849eae69e5c9d39bd3d2 ("mm: use sc->priority for >> slab shrink targets") >> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master >> >> in testcase: blogbench >> on test machine: 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G >> memory >> with following parameters: >> >> disk: 1SSD >> fs: btrfs >> cpufreq_governor: performance >> >> test-description: Blogbench is a portable filesystem benchmark that tries to >> reproduce the load of a real-world busy file server. >> test-url: https://www.pureftpd.org/project/blogbench >> >> >> >> Details are as below: >> --------------------------------------------------------------------------------------------------> >> >> >> To reproduce: >> >> git clone https://github.com/intel/lkp-tests.git >> cd lkp-tests >> bin/lkp install job.yaml # job file is attached in this email >> bin/lkp run job.yaml >> >> ========================================================================================= >> compiler/cpufreq_governor/disk/fs/kconfig/rootfs/tbox_group/testcase: >> >> gcc-7/performance/1SSD/btrfs/x86_64-rhel-7.2/debian-x86_64-2016-08-31.cgz/lkp-bdw-de1/blogbench >> >> commit: >> fcb2b0c577 ("mm: show total hugetlb memory consumption in /proc/meminfo") >> 9092c71bb7 ("mm: use sc->priority for slab shrink targets") >> >> fcb2b0c577f145c7 9092c71bb724dba2ecba849eae >> ---------------- -------------------------- >> %stddev %change %stddev >> \ | \ >> 3256 -12.3% 2854 blogbench.write_score >> 1235237 2% +9.6% 1354163 blogbench.read_score >> 28050912 -10.1% 25212230 >> blogbench.time.file_system_outputs >> 6481995 3% +25.0% 8105320 2% >> blogbench.time.involuntary_context_switches >> 906.00 +13.7% 1030 >> blogbench.time.percent_of_cpu_this_job_got >> 2552 +14.0% 2908 blogbench.time.system_time >> 173.80 +8.4% 188.32 blogbench.time.user_time >> 19353936 +3.6% 20045728 >> blogbench.time.voluntary_context_switches >> 8719514 +13.0% 9850451 softirqs.RCU >> 2.97 5% -0.7 2.30 3% mpstat.cpu.idle% >> 24.92 -6.5 18.46 mpstat.cpu.iowait% >> 0.65 2% +0.1 0.75 mpstat.cpu.soft% >> 67.76 +6.7 74.45 mpstat.cpu.sys% >> 50206 -10.7% 44858 vmstat.io.bo >> 49.25 -9.1% 44.75 2% vmstat.procs.b >> 224125 -1.8% 220135 vmstat.system.cs >> 48903 +10.7% 54134 vmstat.system.in >> 3460654 +10.8% 3834883 meminfo.Active >> 3380666 +11.0% 3752872 meminfo.Active(file) >> 1853849 -17.4% 1530415 meminfo.Inactive >> 1836507 -17.6% 1513054 meminfo.Inactive(file) >> 551311 -10.3% 494265 meminfo.SReclaimable >> 196525 -12.6% 171775 meminfo.SUnreclaim >> 747837 -10.9% 666040 meminfo.Slab >> 8.904e+08 -24.9% 6.683e+08 cpuidle.C1.time >> 22971020 -12.8% 20035820 cpuidle.C1.usage >> 2.518e+08 3% -31.7% 1.72e+08 cpuidle.C1E.time >> 821393 2% -33.3% 548003 cpuidle.C1E.usage >> 75460078 2% -23.3% 57903768 2% cpuidle.C3.time >> 136506 3% -25.3% 101956 3% cpuidle.C3.usage >> 56892498 4% -23.3% 43608427 4% cpuidle.C6.time >> 85034 3% -33.9% 56184 3% cpuidle.C6.usage >> 24373567 -24.5% 18395538 cpuidle.POLL.time >> 449033 2% -10.8% 400493 cpuidle.POLL.usage >> 1832 +9.3% 2002 turbostat.Avg_MHz >> 22967645 -12.8% 20032521 turbostat.C1 >> 18.43 -4.6 13.85 turbostat.C1% >> 821328 2% -33.3% 547948 turbostat.C1E >> 5.21 3% -1.6 3.56 turbostat.C1E% >> 136377 3% -25.3% 101823 3% turbostat.C3 >> 1.56 2% -0.4 1.20 3% turbostat.C3% >> 84404 3% -34.0% 55743 3% turbostat.C6 >> 1.17 4% -0.3 0.90 4% turbostat.C6% >> 25.93 -26.2% 19.14 turbostat.CPU%c1 >> 0.12 3% -19.1% 0.10 9% turbostat.CPU%c3 >> 14813304 +10.7% 16398388 turbostat.IRQ >> 38.19 +3.6% 39.56 turbostat.PkgWatt >> 4.51 +4.5% 4.71 turbostat.RAMWatt >> 8111200 13% -63.2% 2986242 48% >> proc-vmstat.compact_daemon_free_scanned >> 1026719 30% -81.2% 193485 30% >> proc-vmstat.compact_daemon_migrate_scanned >> 2444 21% -63.3% 897.50 20% proc-vmstat.compact_daemon_wake >> 8111200 13% -63.2% 2986242 48% proc-vmstat.compact_free_scanned >> 755491 32% -81.6% 138856 28% proc-vmstat.compact_isolated >> 1026719 30% -81.2% 193485 30% >> proc-vmstat.compact_migrate_scanned >> 137.75 34% +2.8e+06% 3801062 2% proc-vmstat.kswapd_inodesteal >> 6749 20% -53.6% 3131 12% >> proc-vmstat.kswapd_low_wmark_hit_quickly >> 844991 +11.2% 939487 proc-vmstat.nr_active_file >> 3900576 -10.5% 3490567 proc-vmstat.nr_dirtied >> 459789 -17.8% 377930 proc-vmstat.nr_inactive_file >> 137947 -10.3% 123720 proc-vmstat.nr_slab_reclaimable >> 49165 -12.6% 42989 >> proc-vmstat.nr_slab_unreclaimable >> 1382 11% -26.2% 1020 20% proc-vmstat.nr_writeback >> 3809266 -10.7% 3403350 proc-vmstat.nr_written >> 844489 +11.2% 938974 proc-vmstat.nr_zone_active_file >> 459855 -17.8% 378121 >> proc-vmstat.nr_zone_inactive_file >> 7055 18% -52.0% 3389 11% proc-vmstat.pageoutrun >> 33764911 2% +21.3% 40946445 proc-vmstat.pgactivate >> 42044161 2% +12.1% 47139065 proc-vmstat.pgdeactivate >> 92153 20% -69.1% 28514 24% proc-vmstat.pgmigrate_success >> 15212270 -10.7% 13591573 proc-vmstat.pgpgout >> 42053817 2% +12.1% 47151755 proc-vmstat.pgrefill >> 11297 107% +1025.4% 127138 21% proc-vmstat.pgscan_direct >> 19930162 -24.0% 15141439 proc-vmstat.pgscan_kswapd >> 19423629 -24.0% 14758807 proc-vmstat.pgsteal_kswapd >> 10868768 +184.8% 30950752 proc-vmstat.slabs_scanned > > The slab scan number increased a lot. > >> 3361780 3% -22.9% 2593327 3% proc-vmstat.workingset_activate >> 4994722 2% -43.2% 2835020 2% proc-vmstat.workingset_refault >> 316427 -9.3% 286844 >> slabinfo.Acpi-Namespace.active_objs >> 3123 -9.4% 2829 >> slabinfo.Acpi-Namespace.active_slabs >> 318605 -9.4% 288623 >> slabinfo.Acpi-Namespace.num_objs >> 3123 -9.4% 2829 >> slabinfo.Acpi-Namespace.num_slabs >> 220514 -40.7% 130747 >> slabinfo.btrfs_delayed_node.active_objs >> 9751 -25.3% 7283 >> slabinfo.btrfs_delayed_node.active_slabs >> 263293 -25.3% 196669 >> slabinfo.btrfs_delayed_node.num_objs >> 9751 -25.3% 7283 >> slabinfo.btrfs_delayed_node.num_slabs >> 6383 8% -12.0% 5615 2% >> slabinfo.btrfs_delayed_ref_head.num_objs >> 9496 +15.5% 10969 >> slabinfo.btrfs_extent_buffer.active_objs >> 9980 +20.5% 12022 >> slabinfo.btrfs_extent_buffer.num_objs >> 260933 -10.7% 233136 >> slabinfo.btrfs_extent_map.active_objs >> 9392 -10.6% 8396 >> slabinfo.btrfs_extent_map.active_slabs >> 263009 -10.6% 235107 >> slabinfo.btrfs_extent_map.num_objs >> 9392 -10.6% 8396 >> slabinfo.btrfs_extent_map.num_slabs >> 271938 -10.3% 243802 >> slabinfo.btrfs_inode.active_objs >> 9804 -10.6% 8768 >> slabinfo.btrfs_inode.active_slabs >> 273856 -10.4% 245359 slabinfo.btrfs_inode.num_objs >> 9804 -10.6% 8768 slabinfo.btrfs_inode.num_slabs >> 7085 5% -5.5% 6692 2% slabinfo.btrfs_path.num_objs >> 311936 -16.4% 260797 slabinfo.dentry.active_objs >> 7803 -9.6% 7058 slabinfo.dentry.active_slabs >> 327759 -9.6% 296439 slabinfo.dentry.num_objs >> 7803 -9.6% 7058 slabinfo.dentry.num_slabs >> 2289 -23.3% 1755 6% >> slabinfo.proc_inode_cache.active_objs >> 2292 -19.0% 1856 6% >> slabinfo.proc_inode_cache.num_objs >> 261546 -12.3% 229485 >> slabinfo.radix_tree_node.active_objs >> 9404 -11.9% 8288 >> slabinfo.radix_tree_node.active_slabs >> 263347 -11.9% 232089 >> slabinfo.radix_tree_node.num_objs >> 9404 -11.9% 8288 >> slabinfo.radix_tree_node.num_slabs > > The slab size decreased with the new commit. > > From perf-profile result, > > 26.81 ± 2% -6.5 20.35 ± 2% > perf-profile.calltrace.cycles-pp.secondary_startup_64 > 24.48 ± 2% -5.8 18.73 > perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64 > 24.48 ± 2% -5.8 18.73 > perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64 > 24.48 ± 2% -5.8 18.73 > perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64 > 22.80 ± 2% -5.5 17.30 ± 2% > perf-profile.calltrace.cycles-pp.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64 > 20.20 ± 2% -4.3 15.85 > perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary > 23.03 ± 2% -2.6 20.42 > perf-profile.calltrace.cycles-pp.sys_rename.entry_SYSCALL_64_fastpath > 17.02 ± 2% -1.8 15.17 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_rename.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath > 17.03 ± 2% -1.8 15.19 ± 2% > perf-profile.calltrace.cycles-pp.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath > 13.70 ± 2% -1.2 12.47 ± 3% > perf-profile.calltrace.cycles-pp.__btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath > 5.06 ± 3% -1.1 3.96 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_async_run_delayed_root.normal_work_helper.process_one_work.worker_thread.kthread > 7.37 ± 4% -0.9 6.49 ± 2% > perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot > 7.32 ± 4% -0.9 6.45 ± 2% > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.finish_wait.btrfs_tree_lock.btrfs_lock_root_node > 1.54 ± 4% -0.7 0.81 ± 7% > perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary > 2.33 ± 2% -0.7 1.62 ± 4% > perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_kernel.secondary_startup_64 > 2.33 ± 2% -0.7 1.62 ± 4% > perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_kernel.secondary_startup_64 > 2.33 ± 2% -0.7 1.62 ± 4% > perf-profile.calltrace.cycles-pp.start_kernel.secondary_startup_64 > 2.23 ± 3% -0.7 1.53 ± 4% > perf-profile.calltrace.cycles-pp.cpuidle_enter_state.do_idle.cpu_startup_entry.start_kernel.secondary_startup_64 > 2.13 ± 2% -0.7 1.46 ± 4% > perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_kernel > 5.59 -0.7 4.94 ± 3% > perf-profile.calltrace.cycles-pp.__dentry_kill.dput.sys_rename.entry_SYSCALL_64_fastpath > 5.60 -0.7 4.94 ± 3% > perf-profile.calltrace.cycles-pp.dput.sys_rename.entry_SYSCALL_64_fastpath > 6.96 -0.7 6.31 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename > 5.59 -0.7 4.94 ± 3% > perf-profile.calltrace.cycles-pp.evict.__dentry_kill.dput.sys_rename.entry_SYSCALL_64_fastpath > 5.58 -0.6 4.94 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_evict_inode.evict.__dentry_kill.dput.sys_rename > 6.94 ± 2% -0.6 6.30 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_rename.vfs_rename > 6.66 ± 4% -0.6 6.10 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_rename.vfs_rename > 6.66 ± 4% -0.6 6.10 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename > 3.38 ± 3% -0.5 2.84 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_delete_delayed_items.btrfs_async_run_delayed_root.normal_work_helper.process_one_work > 3.40 ± 3% -0.5 2.86 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_delete_delayed_items.btrfs_async_run_delayed_root.normal_work_helper.process_one_work.worker_thread > 7.30 ± 4% -0.4 6.86 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode > 1.18 ± 4% -0.4 0.76 ± 2% > perf-profile.calltrace.cycles-pp.__btrfs_update_delayed_inode.btrfs_async_run_delayed_root.normal_work_helper.process_one_work.worker_thread > 1.16 ± 4% -0.4 0.74 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_async_run_delayed_root.normal_work_helper.process_one_work > 5.96 ± 2% -0.4 5.54 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode > 5.96 ± 2% -0.4 5.54 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_rename > 1.16 ± 4% -0.4 0.74 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_async_run_delayed_root.normal_work_helper > 2.95 ± 3% -0.4 2.54 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items.btrfs_async_run_delayed_root > 2.95 ± 3% -0.4 2.54 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items.btrfs_async_run_delayed_root.normal_work_helper > 0.94 ± 2% -0.3 0.59 ± 4% > perf-profile.calltrace.cycles-pp.shrink_inactive_list.shrink_node_memcg.shrink_node.kswapd.kthread > 1.25 -0.3 0.91 ± 2% > perf-profile.calltrace.cycles-pp.shrink_node_memcg.shrink_node.kswapd.kthread.ret_from_fork > 0.84 ± 2% -0.3 0.52 ± 3% > perf-profile.calltrace.cycles-pp.shrink_page_list.shrink_inactive_list.shrink_node_memcg.shrink_node.kswapd > 1.01 -0.3 0.71 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_create.path_openat.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath > 0.58 ± 4% -0.2 0.34 ± 70% > perf-profile.calltrace.cycles-pp.queued_write_lock_slowpath.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items > 1.73 ± 2% -0.2 1.50 ± 2% > perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref > 1.67 ± 4% -0.2 1.44 ± 3% > perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item > 1.89 -0.2 1.67 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_commit_inode_delayed_inode.btrfs_evict_inode.evict.__dentry_kill.dput > 1.88 -0.2 1.66 ± 2% > perf-profile.calltrace.cycles-pp.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode.btrfs_evict_inode.evict.__dentry_kill > 1.90 -0.2 1.68 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_truncate_inode_items.btrfs_evict_inode.evict.__dentry_kill.dput > 1.78 -0.2 1.57 ± 5% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_del_orphan_item.btrfs_orphan_del.btrfs_evict_inode.evict > 1.88 -0.2 1.67 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_truncate_inode_items.btrfs_evict_inode.evict.__dentry_kill > 1.78 -0.2 1.57 ± 5% > perf-profile.calltrace.cycles-pp.btrfs_orphan_del.btrfs_evict_inode.evict.__dentry_kill.dput > 1.78 -0.2 1.57 ± 5% > perf-profile.calltrace.cycles-pp.btrfs_del_orphan_item.btrfs_orphan_del.btrfs_evict_inode.evict.__dentry_kill > 1.87 -0.2 1.66 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode.btrfs_evict_inode > 1.87 -0.2 1.66 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode.btrfs_evict_inode.evict > 0.75 ± 11% -0.2 0.57 ± 7% > perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.__btrfs_drop_extents > 0.75 ± 11% -0.2 0.57 ± 7% > perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.__btrfs_drop_extents.insert_reserved_file_extent > 1.39 ± 3% -0.2 1.23 ± 4% > perf-profile.calltrace.cycles-pp.prepare_to_wait_event.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items > 1.84 -0.2 1.68 ± 3% > perf-profile.calltrace.cycles-pp.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename > 1.84 -0.2 1.68 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath > 1.62 -0.2 1.46 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_truncate_inode_items.btrfs_evict_inode.evict > 0.87 ± 5% -0.2 0.72 ± 5% > perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items > 1.81 -0.2 1.66 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_rename > 1.81 -0.2 1.66 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_rename.vfs_rename > 1.62 -0.2 1.46 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_truncate_inode_items.btrfs_evict_inode > 1.69 ± 2% -0.1 1.55 ± 2% > perf-profile.calltrace.cycles-pp.end_bio_extent_readpage.normal_work_helper.process_one_work.worker_thread.kthread > 1.58 -0.1 1.44 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode > 1.58 -0.1 1.44 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode > 2.89 ± 3% -0.1 2.77 ± 2% > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath.queued_write_lock_slowpath.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot > 1.51 -0.1 1.39 ± 5% > perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_orphan_item.btrfs_orphan_del > 1.51 -0.1 1.39 ± 5% > perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_orphan_item.btrfs_orphan_del.btrfs_evict_inode > 0.94 -0.1 0.82 > perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64 > 0.93 -0.1 0.81 > perf-profile.calltrace.cycles-pp.__schedule.schedule_idle.do_idle.cpu_startup_entry.start_secondary > 1.52 -0.1 1.44 ± 3% > perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode > 0.71 ± 3% -0.1 0.66 ± 2% > perf-profile.calltrace.cycles-pp.prepare_to_wait_event.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_truncate_inode_items > 0.60 ± 2% -0.0 0.56 ± 2% > perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common.wake_up_page_bit > 1.61 ± 2% +0.1 1.67 ± 2% > perf-profile.calltrace.cycles-pp.kswapd.kthread.ret_from_fork > 1.61 ± 2% +0.1 1.67 ± 2% > perf-profile.calltrace.cycles-pp.shrink_node.kswapd.kthread.ret_from_fork > 0.55 +0.1 0.68 ± 4% > perf-profile.calltrace.cycles-pp.find_get_entry.pagecache_get_page.generic_file_read_iter.__vfs_read.vfs_read > 0.57 +0.1 0.70 ± 4% > perf-profile.calltrace.cycles-pp.pagecache_get_page.generic_file_read_iter.__vfs_read.vfs_read.sys_read > 0.59 ± 3% +0.3 0.87 > perf-profile.calltrace.cycles-pp.__account_scheduler_latency.enqueue_entity.enqueue_task_fair.ttwu_do_activate.try_to_wake_up > 1.29 ± 3% +0.3 1.58 > perf-profile.calltrace.cycles-pp.enqueue_task_fair.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common > 0.54 +0.3 0.83 ± 2% > perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent > 0.84 +0.3 1.18 ± 3% > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.prepare_to_wait_event.btrfs_tree_read_lock.btrfs_read_lock_root_node > 0.71 ± 4% +0.3 1.05 > perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock > 0.86 +0.4 1.21 ± 3% > perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.prepare_to_wait_event.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot > 0.90 +0.4 1.28 ± 3% > perf-profile.calltrace.cycles-pp.prepare_to_wait_event.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item > 0.35 ± 71% +0.4 0.79 ± 3% > perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_inode > 0.98 ± 4% +0.4 1.41 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget > 0.98 ± 5% +0.4 1.43 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry > 0.94 ± 3% +0.4 1.39 > perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.btrfs_clear_path_blocking > 0.96 ± 3% +0.5 1.42 > perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot > 1.05 ± 2% +0.5 1.52 ± 4% > perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent > 1.06 ± 3% +0.5 1.53 ± 4% > perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage > 0.97 ± 3% +0.5 1.45 > perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item > 0.62 ± 4% +0.5 1.13 > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath.queued_read_lock_slowpath.btrfs_clear_lock_blocking_rw.btrfs_clear_path_blocking.btrfs_search_slot > 0.65 ± 4% +0.5 1.19 > perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_clear_lock_blocking_rw.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item > 0.17 ±141% +0.6 0.73 > perf-profile.calltrace.cycles-pp.save_stack_trace_tsk.__account_scheduler_latency.enqueue_entity.enqueue_task_fair.ttwu_do_activate > 0.00 +0.6 0.57 ± 2% > perf-profile.calltrace.cycles-pp.task_work_run.exit_to_usermode_loop.syscall_return_slowpath.entry_SYSCALL_64_fastpath > 0.00 +0.6 0.59 ± 2% > perf-profile.calltrace.cycles-pp.exit_to_usermode_loop.syscall_return_slowpath.entry_SYSCALL_64_fastpath > 0.00 +0.6 0.60 ± 2% > perf-profile.calltrace.cycles-pp.syscall_return_slowpath.entry_SYSCALL_64_fastpath > 0.56 ± 4% +0.6 1.17 > perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_file_extent > 1.17 +0.6 1.78 ± 2% > perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.finish_wait.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot > 1.16 +0.6 1.77 ± 2% > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.finish_wait.btrfs_tree_read_lock.btrfs_read_lock_root_node > 1.18 +0.6 1.80 ± 2% > perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item > 0.57 ± 7% +0.6 1.21 ± 2% > perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_inode > 0.75 ± 4% +0.6 1.38 > perf-profile.calltrace.cycles-pp.btrfs_clear_lock_blocking_rw.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry > 1.95 +0.7 2.61 > perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string.copyout.copy_page_to_iter.generic_file_read_iter.__vfs_read > 1.96 +0.7 2.62 > perf-profile.calltrace.cycles-pp.copyout.copy_page_to_iter.generic_file_read_iter.__vfs_read.vfs_read > 0.00 +0.7 0.70 > perf-profile.calltrace.cycles-pp.__save_stack_trace.save_stack_trace_tsk.__account_scheduler_latency.enqueue_entity.enqueue_task_fair > 2.10 +0.7 2.80 > perf-profile.calltrace.cycles-pp.copy_page_to_iter.generic_file_read_iter.__vfs_read.vfs_read.sys_read > 0.85 ± 5% +0.7 1.58 > perf-profile.calltrace.cycles-pp.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent > 0.84 ± 6% +0.7 1.57 > perf-profile.calltrace.cycles-pp.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget > 0.00 +0.8 0.75 ± 4% > perf-profile.calltrace.cycles-pp.super_cache_scan.shrink_slab.shrink_node.kswapd.kthread > 0.00 +0.8 0.75 ± 5% > perf-profile.calltrace.cycles-pp.shrink_slab.shrink_node.kswapd.kthread.ret_from_fork > 1.07 ± 5% +0.9 1.98 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage > 1.09 ± 7% +0.9 2.01 ± 2% > perf-profile.calltrace.cycles-pp.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry > 4.90 ± 2% +1.3 6.19 > perf-profile.calltrace.cycles-pp.__do_page_cache_readahead.ondemand_readahead.generic_file_read_iter.__vfs_read.vfs_read > 4.90 ± 2% +1.3 6.20 > perf-profile.calltrace.cycles-pp.ondemand_readahead.generic_file_read_iter.__vfs_read.vfs_read.sys_read > 4.44 ± 2% +1.3 5.74 > perf-profile.calltrace.cycles-pp.extent_readpages.__do_page_cache_readahead.ondemand_readahead.generic_file_read_iter.__vfs_read > 2.87 ± 3% +1.4 4.29 > perf-profile.calltrace.cycles-pp.__extent_readpages.extent_readpages.__do_page_cache_readahead.ondemand_readahead.generic_file_read_iter > 2.22 ± 4% +1.4 3.65 > perf-profile.calltrace.cycles-pp.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage.__extent_readpages.extent_readpages > 2.22 ± 4% +1.4 3.65 > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage.__extent_readpages > 2.27 ± 6% +1.4 3.72 > perf-profile.calltrace.cycles-pp.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry.btrfs_lookup.path_openat > 2.27 ± 6% +1.4 3.72 > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry.btrfs_lookup > 2.72 ± 3% +1.4 4.17 > perf-profile.calltrace.cycles-pp.__do_readpage.__extent_readpages.extent_readpages.__do_page_cache_readahead.ondemand_readahead > 2.33 ± 4% +1.5 3.79 > perf-profile.calltrace.cycles-pp.btrfs_get_extent.__do_readpage.__extent_readpages.extent_readpages.__do_page_cache_readahead > 2.38 +1.6 3.95 > perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item > 2.63 ± 6% +1.7 4.33 > perf-profile.calltrace.cycles-pp.btrfs_iget.btrfs_lookup_dentry.btrfs_lookup.path_openat.do_filp_open > 1.68 ± 4% +2.1 3.79 > perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item > 8.66 +2.2 10.90 > perf-profile.calltrace.cycles-pp.generic_file_read_iter.__vfs_read.vfs_read.sys_read.entry_SYSCALL_64_fastpath > 8.71 +2.3 10.97 > perf-profile.calltrace.cycles-pp.__vfs_read.vfs_read.sys_read.entry_SYSCALL_64_fastpath > 8.85 +2.3 11.15 > perf-profile.calltrace.cycles-pp.vfs_read.sys_read.entry_SYSCALL_64_fastpath > 9.02 +2.3 11.33 > perf-profile.calltrace.cycles-pp.sys_read.entry_SYSCALL_64_fastpath > 2.71 ± 3% +2.6 5.35 > perf-profile.calltrace.cycles-pp.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry > 4.63 +2.7 7.29 > perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry > 4.65 +2.7 7.32 > perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup > 1.94 ± 2% +2.8 4.77 > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot > 2.81 ± 4% +3.3 6.15 > perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot > 3.58 ± 2% +3.4 6.98 > perf-profile.calltrace.cycles-pp.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup > 8.69 +6.3 15.03 > perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup.path_openat > 8.75 +6.4 15.12 > perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup.path_openat.do_filp_open > 47.34 +8.1 55.46 > perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_fastpath > 11.47 +8.1 19.60 > perf-profile.calltrace.cycles-pp.btrfs_lookup.path_openat.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath > 11.46 +8.1 19.59 > perf-profile.calltrace.cycles-pp.btrfs_lookup_dentry.btrfs_lookup.path_openat.do_filp_open.do_sys_open > 13.90 +8.3 22.20 > perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath > 13.91 +8.3 22.22 > perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath > 14.13 +8.4 22.52 > perf-profile.calltrace.cycles-pp.do_sys_open.entry_SYSCALL_64_fastpath > > The cycles for dentry lookup increased much too. Is this the reason why > write score decreased? > > If you need more information, please let me know. > > Best Regards, > Huang, Ying > >> 1140424 12% +40.2% 1598980 14% >> sched_debug.cfs_rq:/.MIN_vruntime.max >> 790.55 +13.0% 893.20 3% >> sched_debug.cfs_rq:/.exec_clock.stddev >> 1140425 12% +40.2% 1598982 14% >> sched_debug.cfs_rq:/.max_vruntime.max >> 0.83 10% +21.5% 1.00 8% >> sched_debug.cfs_rq:/.nr_running.avg >> 3.30 99% +266.3% 12.09 13% >> sched_debug.cfs_rq:/.removed.load_avg.avg >> 153.02 97% +266.6% 560.96 13% >> sched_debug.cfs_rq:/.removed.runnable_sum.avg >> 569.93 102% +173.2% 1556 14% >> sched_debug.cfs_rq:/.removed.runnable_sum.stddev >> 1.42 60% +501.5% 8.52 34% >> sched_debug.cfs_rq:/.removed.util_avg.avg >> 19.88 59% +288.9% 77.29 16% >> sched_debug.cfs_rq:/.removed.util_avg.max >> 5.05 58% +342.3% 22.32 22% >> sched_debug.cfs_rq:/.removed.util_avg.stddev >> 791.44 3% +47.7% 1168 8% sched_debug.cfs_rq:/.util_avg.avg >> 1305 6% +33.2% 1738 5% sched_debug.cfs_rq:/.util_avg.max >> 450.25 11% +66.2% 748.17 14% sched_debug.cfs_rq:/.util_avg.min >> 220.82 8% +21.1% 267.46 5% >> sched_debug.cfs_rq:/.util_avg.stddev >> 363118 11% -23.8% 276520 11% sched_debug.cpu.avg_idle.avg >> 726003 8% -30.8% 502313 4% sched_debug.cpu.avg_idle.max >> 202629 3% -32.2% 137429 18% sched_debug.cpu.avg_idle.stddev >> 31.96 28% +54.6% 49.42 14% sched_debug.cpu.cpu_load[3].min >> 36.21 25% +64.0% 59.38 6% sched_debug.cpu.cpu_load[4].min >> 1007 5% +20.7% 1216 7% sched_debug.cpu.curr->pid.avg >> 4.50 5% +14.8% 5.17 5% sched_debug.cpu.nr_running.max >> 2476195 -11.8% 2185022 sched_debug.cpu.nr_switches.max >> 212888 -26.6% 156172 3% >> sched_debug.cpu.nr_switches.stddev >> 3570 2% -58.7% 1474 2% >> sched_debug.cpu.nr_uninterruptible.max >> -803.67 -28.7% -573.38 >> sched_debug.cpu.nr_uninterruptible.min >> 1004 2% -50.4% 498.55 3% >> sched_debug.cpu.nr_uninterruptible.stddev >> 2478809 -11.7% 2189310 sched_debug.cpu.sched_count.max >> 214130 -26.5% 157298 3% >> sched_debug.cpu.sched_count.stddev >> 489430 2% -16.6% 408309 2% sched_debug.cpu.sched_goidle.avg >> 724333 2% -28.2% 520263 2% sched_debug.cpu.sched_goidle.max >> 457611 -18.1% 374746 3% sched_debug.cpu.sched_goidle.min >> 62957 2% -47.4% 33138 3% >> sched_debug.cpu.sched_goidle.stddev >> 676053 2% -15.4% 571816 2% sched_debug.cpu.ttwu_local.max >> 42669 3% +22.3% 52198 sched_debug.cpu.ttwu_local.min >> 151873 2% -18.3% 124118 2% sched_debug.cpu.ttwu_local.stddev >> >> >> >> >> blogbench.write_score >> >> >> >> 3300 >> +-+------------------------------------------------------------------+ >> 3250 +-+ +. .+ +. .+ : : : +. .+ .+.+.+. >> .| >> |: +. .+ +.+.+.+ + + + : +. : : +. + +.+ + + >> | >> 3200 +-+ + +.+ + : + + : + + >> | >> 3150 +-+.+ ++ +.+ >> | >> 3100 +-+ >> | >> 3050 +-+ >> | >> | >> | >> 3000 +-+ >> | >> 2950 +-+ O O >> | >> 2900 +-O O O O >> | >> 2850 +-+ O O O O O O O OO O O O >> | >> | O O O O >> | >> 2800 O-+ O O >> | >> 2750 >> +-+------------------------------------------------------------------+ >> >> >> >> >> >> [*] bisect-good sample >> [O] bisect-bad sample >> >> >> >> Disclaimer: >> Results have been estimated based on internal Intel analysis and are provided >> for informational purposes only. Any difference in system hardware or >> software >> design or configuration may affect actual performance. >> >> >> Thanks, >> Xiaolong > _______________________________________________ > LKP mailing list > l...@lists.01.org > https://lists.01.org/mailman/listinfo/lkp -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html