Ping...

"Huang, Ying" <ying.hu...@intel.com> writes:

> Hi, Josef,
>
> Do you have time to take a look at the regression?
>
> kernel test robot <xiaolong...@intel.com> writes:
>
>> Greeting,
>>
>> FYI, we noticed a -12.3% regression of blogbench.write_score and a +9.6% 
>> improvement
>> of blogbench.read_score due to commit:
>>
>>
>> commit: 9092c71bb724dba2ecba849eae69e5c9d39bd3d2 ("mm: use sc->priority for 
>> slab shrink targets")
>> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
>>
>> in testcase: blogbench
>> on test machine: 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G 
>> memory
>> with following parameters:
>>
>>      disk: 1SSD
>>      fs: btrfs
>>      cpufreq_governor: performance
>>
>> test-description: Blogbench is a portable filesystem benchmark that tries to 
>> reproduce the load of a real-world busy file server.
>> test-url: https://www.pureftpd.org/project/blogbench
>>
>>
>>
>> Details are as below:
>> -------------------------------------------------------------------------------------------------->
>>
>>
>> To reproduce:
>>
>>         git clone https://github.com/intel/lkp-tests.git
>>         cd lkp-tests
>>         bin/lkp install job.yaml  # job file is attached in this email
>>         bin/lkp run     job.yaml
>>
>> =========================================================================================
>> compiler/cpufreq_governor/disk/fs/kconfig/rootfs/tbox_group/testcase:
>>   
>> gcc-7/performance/1SSD/btrfs/x86_64-rhel-7.2/debian-x86_64-2016-08-31.cgz/lkp-bdw-de1/blogbench
>>
>> commit: 
>>   fcb2b0c577 ("mm: show total hugetlb memory consumption in /proc/meminfo")
>>   9092c71bb7 ("mm: use sc->priority for slab shrink targets")
>>
>> fcb2b0c577f145c7 9092c71bb724dba2ecba849eae 
>> ---------------- -------------------------- 
>>          %stddev     %change         %stddev
>>              \          |                \  
>>       3256           -12.3%       2854        blogbench.write_score
>>    1235237   2%      +9.6%    1354163        blogbench.read_score
>>   28050912           -10.1%   25212230        
>> blogbench.time.file_system_outputs
>>    6481995   3%     +25.0%    8105320   2%  
>> blogbench.time.involuntary_context_switches
>>     906.00           +13.7%       1030        
>> blogbench.time.percent_of_cpu_this_job_got
>>       2552           +14.0%       2908        blogbench.time.system_time
>>     173.80            +8.4%     188.32        blogbench.time.user_time
>>   19353936            +3.6%   20045728        
>> blogbench.time.voluntary_context_switches
>>    8719514           +13.0%    9850451        softirqs.RCU
>>       2.97   5%      -0.7        2.30   3%  mpstat.cpu.idle%
>>      24.92            -6.5       18.46        mpstat.cpu.iowait%
>>       0.65   2%      +0.1        0.75        mpstat.cpu.soft%
>>      67.76            +6.7       74.45        mpstat.cpu.sys%
>>      50206           -10.7%      44858        vmstat.io.bo
>>      49.25            -9.1%      44.75   2%  vmstat.procs.b
>>     224125            -1.8%     220135        vmstat.system.cs
>>      48903           +10.7%      54134        vmstat.system.in
>>    3460654           +10.8%    3834883        meminfo.Active
>>    3380666           +11.0%    3752872        meminfo.Active(file)
>>    1853849           -17.4%    1530415        meminfo.Inactive
>>    1836507           -17.6%    1513054        meminfo.Inactive(file)
>>     551311           -10.3%     494265        meminfo.SReclaimable
>>     196525           -12.6%     171775        meminfo.SUnreclaim
>>     747837           -10.9%     666040        meminfo.Slab
>>  8.904e+08           -24.9%  6.683e+08        cpuidle.C1.time
>>   22971020           -12.8%   20035820        cpuidle.C1.usage
>>  2.518e+08   3%     -31.7%   1.72e+08        cpuidle.C1E.time
>>     821393   2%     -33.3%     548003        cpuidle.C1E.usage
>>   75460078   2%     -23.3%   57903768   2%  cpuidle.C3.time
>>     136506   3%     -25.3%     101956   3%  cpuidle.C3.usage
>>   56892498   4%     -23.3%   43608427   4%  cpuidle.C6.time
>>      85034   3%     -33.9%      56184   3%  cpuidle.C6.usage
>>   24373567           -24.5%   18395538        cpuidle.POLL.time
>>     449033   2%     -10.8%     400493        cpuidle.POLL.usage
>>       1832            +9.3%       2002        turbostat.Avg_MHz
>>   22967645           -12.8%   20032521        turbostat.C1
>>      18.43            -4.6       13.85        turbostat.C1%
>>     821328   2%     -33.3%     547948        turbostat.C1E
>>       5.21   3%      -1.6        3.56        turbostat.C1E%
>>     136377   3%     -25.3%     101823   3%  turbostat.C3
>>       1.56   2%      -0.4        1.20   3%  turbostat.C3%
>>      84404   3%     -34.0%      55743   3%  turbostat.C6
>>       1.17   4%      -0.3        0.90   4%  turbostat.C6%
>>      25.93           -26.2%      19.14        turbostat.CPU%c1
>>       0.12   3%     -19.1%       0.10   9%  turbostat.CPU%c3
>>   14813304           +10.7%   16398388        turbostat.IRQ
>>      38.19            +3.6%      39.56        turbostat.PkgWatt
>>       4.51            +4.5%       4.71        turbostat.RAMWatt
>>    8111200  13%     -63.2%    2986242  48%  
>> proc-vmstat.compact_daemon_free_scanned
>>    1026719  30%     -81.2%     193485  30%  
>> proc-vmstat.compact_daemon_migrate_scanned
>>       2444  21%     -63.3%     897.50  20%  proc-vmstat.compact_daemon_wake
>>    8111200  13%     -63.2%    2986242  48%  proc-vmstat.compact_free_scanned
>>     755491  32%     -81.6%     138856  28%  proc-vmstat.compact_isolated
>>    1026719  30%     -81.2%     193485  30%  
>> proc-vmstat.compact_migrate_scanned
>>     137.75  34%  +2.8e+06%    3801062   2%  proc-vmstat.kswapd_inodesteal
>>       6749  20%     -53.6%       3131  12%  
>> proc-vmstat.kswapd_low_wmark_hit_quickly
>>     844991           +11.2%     939487        proc-vmstat.nr_active_file
>>    3900576           -10.5%    3490567        proc-vmstat.nr_dirtied
>>     459789           -17.8%     377930        proc-vmstat.nr_inactive_file
>>     137947           -10.3%     123720        proc-vmstat.nr_slab_reclaimable
>>      49165           -12.6%      42989        
>> proc-vmstat.nr_slab_unreclaimable
>>       1382  11%     -26.2%       1020  20%  proc-vmstat.nr_writeback
>>    3809266           -10.7%    3403350        proc-vmstat.nr_written
>>     844489           +11.2%     938974        proc-vmstat.nr_zone_active_file
>>     459855           -17.8%     378121        
>> proc-vmstat.nr_zone_inactive_file
>>       7055  18%     -52.0%       3389  11%  proc-vmstat.pageoutrun
>>   33764911   2%     +21.3%   40946445        proc-vmstat.pgactivate
>>   42044161   2%     +12.1%   47139065        proc-vmstat.pgdeactivate
>>      92153  20%     -69.1%      28514  24%  proc-vmstat.pgmigrate_success
>>   15212270           -10.7%   13591573        proc-vmstat.pgpgout
>>   42053817   2%     +12.1%   47151755        proc-vmstat.pgrefill
>>      11297 107%   +1025.4%     127138  21%  proc-vmstat.pgscan_direct
>>   19930162           -24.0%   15141439        proc-vmstat.pgscan_kswapd
>>   19423629           -24.0%   14758807        proc-vmstat.pgsteal_kswapd
>>   10868768          +184.8%   30950752        proc-vmstat.slabs_scanned
>
> The slab scan number increased a lot.
>
>>    3361780   3%     -22.9%    2593327   3%  proc-vmstat.workingset_activate
>>    4994722   2%     -43.2%    2835020   2%  proc-vmstat.workingset_refault
>>     316427            -9.3%     286844        
>> slabinfo.Acpi-Namespace.active_objs
>>       3123            -9.4%       2829        
>> slabinfo.Acpi-Namespace.active_slabs
>>     318605            -9.4%     288623        
>> slabinfo.Acpi-Namespace.num_objs
>>       3123            -9.4%       2829        
>> slabinfo.Acpi-Namespace.num_slabs
>>     220514           -40.7%     130747        
>> slabinfo.btrfs_delayed_node.active_objs
>>       9751           -25.3%       7283        
>> slabinfo.btrfs_delayed_node.active_slabs
>>     263293           -25.3%     196669        
>> slabinfo.btrfs_delayed_node.num_objs
>>       9751           -25.3%       7283        
>> slabinfo.btrfs_delayed_node.num_slabs
>>       6383   8%     -12.0%       5615   2%  
>> slabinfo.btrfs_delayed_ref_head.num_objs
>>       9496           +15.5%      10969        
>> slabinfo.btrfs_extent_buffer.active_objs
>>       9980           +20.5%      12022        
>> slabinfo.btrfs_extent_buffer.num_objs
>>     260933           -10.7%     233136        
>> slabinfo.btrfs_extent_map.active_objs
>>       9392           -10.6%       8396        
>> slabinfo.btrfs_extent_map.active_slabs
>>     263009           -10.6%     235107        
>> slabinfo.btrfs_extent_map.num_objs
>>       9392           -10.6%       8396        
>> slabinfo.btrfs_extent_map.num_slabs
>>     271938           -10.3%     243802        
>> slabinfo.btrfs_inode.active_objs
>>       9804           -10.6%       8768        
>> slabinfo.btrfs_inode.active_slabs
>>     273856           -10.4%     245359        slabinfo.btrfs_inode.num_objs
>>       9804           -10.6%       8768        slabinfo.btrfs_inode.num_slabs
>>       7085   5%      -5.5%       6692   2%  slabinfo.btrfs_path.num_objs
>>     311936           -16.4%     260797        slabinfo.dentry.active_objs
>>       7803            -9.6%       7058        slabinfo.dentry.active_slabs
>>     327759            -9.6%     296439        slabinfo.dentry.num_objs
>>       7803            -9.6%       7058        slabinfo.dentry.num_slabs
>>       2289           -23.3%       1755   6%  
>> slabinfo.proc_inode_cache.active_objs
>>       2292           -19.0%       1856   6%  
>> slabinfo.proc_inode_cache.num_objs
>>     261546           -12.3%     229485        
>> slabinfo.radix_tree_node.active_objs
>>       9404           -11.9%       8288        
>> slabinfo.radix_tree_node.active_slabs
>>     263347           -11.9%     232089        
>> slabinfo.radix_tree_node.num_objs
>>       9404           -11.9%       8288        
>> slabinfo.radix_tree_node.num_slabs
>
> The slab size decreased with the new commit.
>
> From perf-profile result,
>
>      26.81 ±  2%      -6.5       20.35 ±  2%  
> perf-profile.calltrace.cycles-pp.secondary_startup_64
>      24.48 ±  2%      -5.8       18.73        
> perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
>      24.48 ±  2%      -5.8       18.73        
> perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64
>      24.48 ±  2%      -5.8       18.73        
> perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64
>      22.80 ±  2%      -5.5       17.30 ±  2%  
> perf-profile.calltrace.cycles-pp.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
>      20.20 ±  2%      -4.3       15.85        
> perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary
>      23.03 ±  2%      -2.6       20.42        
> perf-profile.calltrace.cycles-pp.sys_rename.entry_SYSCALL_64_fastpath
>      17.02 ±  2%      -1.8       15.17 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_rename.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath
>      17.03 ±  2%      -1.8       15.19 ±  2%  
> perf-profile.calltrace.cycles-pp.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath
>      13.70 ±  2%      -1.2       12.47 ±  3%  
> perf-profile.calltrace.cycles-pp.__btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath
>       5.06 ±  3%      -1.1        3.96 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_async_run_delayed_root.normal_work_helper.process_one_work.worker_thread.kthread
>       7.37 ±  4%      -0.9        6.49 ±  2%  
> perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot
>       7.32 ±  4%      -0.9        6.45 ±  2%  
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.finish_wait.btrfs_tree_lock.btrfs_lock_root_node
>       1.54 ±  4%      -0.7        0.81 ±  7%  
> perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_secondary
>       2.33 ±  2%      -0.7        1.62 ±  4%  
> perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_kernel.secondary_startup_64
>       2.33 ±  2%      -0.7        1.62 ±  4%  
> perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_kernel.secondary_startup_64
>       2.33 ±  2%      -0.7        1.62 ±  4%  
> perf-profile.calltrace.cycles-pp.start_kernel.secondary_startup_64
>       2.23 ±  3%      -0.7        1.53 ±  4%  
> perf-profile.calltrace.cycles-pp.cpuidle_enter_state.do_idle.cpu_startup_entry.start_kernel.secondary_startup_64
>       2.13 ±  2%      -0.7        1.46 ±  4%  
> perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.do_idle.cpu_startup_entry.start_kernel
>       5.59            -0.7        4.94 ±  3%  
> perf-profile.calltrace.cycles-pp.__dentry_kill.dput.sys_rename.entry_SYSCALL_64_fastpath
>       5.60            -0.7        4.94 ±  3%  
> perf-profile.calltrace.cycles-pp.dput.sys_rename.entry_SYSCALL_64_fastpath
>       6.96            -0.7        6.31 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename
>       5.59            -0.7        4.94 ±  3%  
> perf-profile.calltrace.cycles-pp.evict.__dentry_kill.dput.sys_rename.entry_SYSCALL_64_fastpath
>       5.58            -0.6        4.94 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_evict_inode.evict.__dentry_kill.dput.sys_rename
>       6.94 ±  2%      -0.6        6.30 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_rename.vfs_rename
>       6.66 ±  4%      -0.6        6.10 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_rename.vfs_rename
>       6.66 ±  4%      -0.6        6.10 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename
>       3.38 ±  3%      -0.5        2.84 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_delete_delayed_items.btrfs_async_run_delayed_root.normal_work_helper.process_one_work
>       3.40 ±  3%      -0.5        2.86 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_delete_delayed_items.btrfs_async_run_delayed_root.normal_work_helper.process_one_work.worker_thread
>       7.30 ±  4%      -0.4        6.86 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode
>       1.18 ±  4%      -0.4        0.76 ±  2%  
> perf-profile.calltrace.cycles-pp.__btrfs_update_delayed_inode.btrfs_async_run_delayed_root.normal_work_helper.process_one_work.worker_thread
>       1.16 ±  4%      -0.4        0.74 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_async_run_delayed_root.normal_work_helper.process_one_work
>       5.96 ±  2%      -0.4        5.54 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode
>       5.96 ±  2%      -0.4        5.54 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref.__btrfs_unlink_inode.btrfs_rename
>       1.16 ±  4%      -0.4        0.74 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_async_run_delayed_root.normal_work_helper
>       2.95 ±  3%      -0.4        2.54 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items.btrfs_async_run_delayed_root
>       2.95 ±  3%      -0.4        2.54 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items.btrfs_async_run_delayed_root.normal_work_helper
>       0.94 ±  2%      -0.3        0.59 ±  4%  
> perf-profile.calltrace.cycles-pp.shrink_inactive_list.shrink_node_memcg.shrink_node.kswapd.kthread
>       1.25            -0.3        0.91 ±  2%  
> perf-profile.calltrace.cycles-pp.shrink_node_memcg.shrink_node.kswapd.kthread.ret_from_fork
>       0.84 ±  2%      -0.3        0.52 ±  3%  
> perf-profile.calltrace.cycles-pp.shrink_page_list.shrink_inactive_list.shrink_node_memcg.shrink_node.kswapd
>       1.01            -0.3        0.71 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_create.path_openat.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath
>       0.58 ±  4%      -0.2        0.34 ± 70%  
> perf-profile.calltrace.cycles-pp.queued_write_lock_slowpath.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items
>       1.73 ±  2%      -0.2        1.50 ±  2%  
> perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_inode_ref
>       1.67 ±  4%      -0.2        1.44 ±  3%  
> perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item
>       1.89            -0.2        1.67 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_commit_inode_delayed_inode.btrfs_evict_inode.evict.__dentry_kill.dput
>       1.88            -0.2        1.66 ±  2%  
> perf-profile.calltrace.cycles-pp.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode.btrfs_evict_inode.evict.__dentry_kill
>       1.90            -0.2        1.68 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_truncate_inode_items.btrfs_evict_inode.evict.__dentry_kill.dput
>       1.78            -0.2        1.57 ±  5%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_del_orphan_item.btrfs_orphan_del.btrfs_evict_inode.evict
>       1.88            -0.2        1.67 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_truncate_inode_items.btrfs_evict_inode.evict.__dentry_kill
>       1.78            -0.2        1.57 ±  5%  
> perf-profile.calltrace.cycles-pp.btrfs_orphan_del.btrfs_evict_inode.evict.__dentry_kill.dput
>       1.78            -0.2        1.57 ±  5%  
> perf-profile.calltrace.cycles-pp.btrfs_del_orphan_item.btrfs_orphan_del.btrfs_evict_inode.evict.__dentry_kill
>       1.87            -0.2        1.66 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode.btrfs_evict_inode
>       1.87            -0.2        1.66 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode.btrfs_evict_inode.evict
>       0.75 ± 11%      -0.2        0.57 ±  7%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.__btrfs_drop_extents
>       0.75 ± 11%      -0.2        0.57 ±  7%  
> perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.__btrfs_drop_extents.insert_reserved_file_extent
>       1.39 ±  3%      -0.2        1.23 ±  4%  
> perf-profile.calltrace.cycles-pp.prepare_to_wait_event.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items
>       1.84            -0.2        1.68 ±  3%  
> perf-profile.calltrace.cycles-pp.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename
>       1.84            -0.2        1.68 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_unlink_inode.btrfs_rename.vfs_rename.sys_rename.entry_SYSCALL_64_fastpath
>       1.62            -0.2        1.46 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_truncate_inode_items.btrfs_evict_inode.evict
>       0.87 ±  5%      -0.2        0.72 ±  5%  
> perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_delete_delayed_items
>       1.81            -0.2        1.66 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_rename
>       1.81            -0.2        1.66 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode.btrfs_rename.vfs_rename
>       1.62            -0.2        1.46 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_truncate_inode_items.btrfs_evict_inode
>       1.69 ±  2%      -0.1        1.55 ±  2%  
> perf-profile.calltrace.cycles-pp.end_bio_extent_readpage.normal_work_helper.process_one_work.worker_thread.kthread
>       1.58            -0.1        1.44 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode
>       1.58            -0.1        1.44 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.__btrfs_update_delayed_inode.btrfs_commit_inode_delayed_inode
>       2.89 ±  3%      -0.1        2.77 ±  2%  
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath.queued_write_lock_slowpath.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot
>       1.51            -0.1        1.39 ±  5%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_orphan_item.btrfs_orphan_del
>       1.51            -0.1        1.39 ±  5%  
> perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_del_orphan_item.btrfs_orphan_del.btrfs_evict_inode
>       0.94            -0.1        0.82        
> perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64
>       0.93            -0.1        0.81        
> perf-profile.calltrace.cycles-pp.__schedule.schedule_idle.do_idle.cpu_startup_entry.start_secondary
>       1.52            -0.1        1.44 ±  3%  
> perf-profile.calltrace.cycles-pp.btrfs_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.__btrfs_unlink_inode.btrfs_unlink_inode
>       0.71 ±  3%      -0.1        0.66 ±  2%  
> perf-profile.calltrace.cycles-pp.prepare_to_wait_event.btrfs_tree_lock.btrfs_lock_root_node.btrfs_search_slot.btrfs_truncate_inode_items
>       0.60 ±  2%      -0.0        0.56 ±  2%  
> perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common.wake_up_page_bit
>       1.61 ±  2%      +0.1        1.67 ±  2%  
> perf-profile.calltrace.cycles-pp.kswapd.kthread.ret_from_fork
>       1.61 ±  2%      +0.1        1.67 ±  2%  
> perf-profile.calltrace.cycles-pp.shrink_node.kswapd.kthread.ret_from_fork
>       0.55            +0.1        0.68 ±  4%  
> perf-profile.calltrace.cycles-pp.find_get_entry.pagecache_get_page.generic_file_read_iter.__vfs_read.vfs_read
>       0.57            +0.1        0.70 ±  4%  
> perf-profile.calltrace.cycles-pp.pagecache_get_page.generic_file_read_iter.__vfs_read.vfs_read.sys_read
>       0.59 ±  3%      +0.3        0.87        
> perf-profile.calltrace.cycles-pp.__account_scheduler_latency.enqueue_entity.enqueue_task_fair.ttwu_do_activate.try_to_wake_up
>       1.29 ±  3%      +0.3        1.58        
> perf-profile.calltrace.cycles-pp.enqueue_task_fair.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common
>       0.54            +0.3        0.83 ±  2%  
> perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent
>       0.84            +0.3        1.18 ±  3%  
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.prepare_to_wait_event.btrfs_tree_read_lock.btrfs_read_lock_root_node
>       0.71 ±  4%      +0.3        1.05        
> perf-profile.calltrace.cycles-pp.ttwu_do_activate.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock
>       0.86            +0.4        1.21 ±  3%  
> perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.prepare_to_wait_event.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot
>       0.90            +0.4        1.28 ±  3%  
> perf-profile.calltrace.cycles-pp.prepare_to_wait_event.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item
>       0.35 ± 71%      +0.4        0.79 ±  3%  
> perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_inode
>       0.98 ±  4%      +0.4        1.41 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget
>       0.98 ±  5%      +0.4        1.43 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry
>       0.94 ±  3%      +0.4        1.39        
> perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.btrfs_clear_path_blocking
>       0.96 ±  3%      +0.5        1.42        
> perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot
>       1.05 ±  2%      +0.5        1.52 ±  4%  
> perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent
>       1.06 ±  3%      +0.5        1.53 ±  4%  
> perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage
>       0.97 ±  3%      +0.5        1.45        
> perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item
>       0.62 ±  4%      +0.5        1.13        
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath.queued_read_lock_slowpath.btrfs_clear_lock_blocking_rw.btrfs_clear_path_blocking.btrfs_search_slot
>       0.65 ±  4%      +0.5        1.19        
> perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_clear_lock_blocking_rw.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item
>       0.17 ±141%      +0.6        0.73        
> perf-profile.calltrace.cycles-pp.save_stack_trace_tsk.__account_scheduler_latency.enqueue_entity.enqueue_task_fair.ttwu_do_activate
>       0.00            +0.6        0.57 ±  2%  
> perf-profile.calltrace.cycles-pp.task_work_run.exit_to_usermode_loop.syscall_return_slowpath.entry_SYSCALL_64_fastpath
>       0.00            +0.6        0.59 ±  2%  
> perf-profile.calltrace.cycles-pp.exit_to_usermode_loop.syscall_return_slowpath.entry_SYSCALL_64_fastpath
>       0.00            +0.6        0.60 ±  2%  
> perf-profile.calltrace.cycles-pp.syscall_return_slowpath.entry_SYSCALL_64_fastpath
>       0.56 ±  4%      +0.6        1.17        
> perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_file_extent
>       1.17            +0.6        1.78 ±  2%  
> perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.finish_wait.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot
>       1.16            +0.6        1.77 ±  2%  
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.finish_wait.btrfs_tree_read_lock.btrfs_read_lock_root_node
>       1.18            +0.6        1.80 ±  2%  
> perf-profile.calltrace.cycles-pp.finish_wait.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item
>       0.57 ±  7%      +0.6        1.21 ±  2%  
> perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_inode
>       0.75 ±  4%      +0.6        1.38        
> perf-profile.calltrace.cycles-pp.btrfs_clear_lock_blocking_rw.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry
>       1.95            +0.7        2.61        
> perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string.copyout.copy_page_to_iter.generic_file_read_iter.__vfs_read
>       1.96            +0.7        2.62        
> perf-profile.calltrace.cycles-pp.copyout.copy_page_to_iter.generic_file_read_iter.__vfs_read.vfs_read
>       0.00            +0.7        0.70        
> perf-profile.calltrace.cycles-pp.__save_stack_trace.save_stack_trace_tsk.__account_scheduler_latency.enqueue_entity.enqueue_task_fair
>       2.10            +0.7        2.80        
> perf-profile.calltrace.cycles-pp.copy_page_to_iter.generic_file_read_iter.__vfs_read.vfs_read.sys_read
>       0.85 ±  5%      +0.7        1.58        
> perf-profile.calltrace.cycles-pp.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent
>       0.84 ±  6%      +0.7        1.57        
> perf-profile.calltrace.cycles-pp.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget
>       0.00            +0.8        0.75 ±  4%  
> perf-profile.calltrace.cycles-pp.super_cache_scan.shrink_slab.shrink_node.kswapd.kthread
>       0.00            +0.8        0.75 ±  5%  
> perf-profile.calltrace.cycles-pp.shrink_slab.shrink_node.kswapd.kthread.ret_from_fork
>       1.07 ±  5%      +0.9        1.98 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage
>       1.09 ±  7%      +0.9        2.01 ±  2%  
> perf-profile.calltrace.cycles-pp.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry
>       4.90 ±  2%      +1.3        6.19        
> perf-profile.calltrace.cycles-pp.__do_page_cache_readahead.ondemand_readahead.generic_file_read_iter.__vfs_read.vfs_read
>       4.90 ±  2%      +1.3        6.20        
> perf-profile.calltrace.cycles-pp.ondemand_readahead.generic_file_read_iter.__vfs_read.vfs_read.sys_read
>       4.44 ±  2%      +1.3        5.74        
> perf-profile.calltrace.cycles-pp.extent_readpages.__do_page_cache_readahead.ondemand_readahead.generic_file_read_iter.__vfs_read
>       2.87 ±  3%      +1.4        4.29        
> perf-profile.calltrace.cycles-pp.__extent_readpages.extent_readpages.__do_page_cache_readahead.ondemand_readahead.generic_file_read_iter
>       2.22 ±  4%      +1.4        3.65        
> perf-profile.calltrace.cycles-pp.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage.__extent_readpages.extent_readpages
>       2.22 ±  4%      +1.4        3.65        
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_file_extent.btrfs_get_extent.__do_readpage.__extent_readpages
>       2.27 ±  6%      +1.4        3.72        
> perf-profile.calltrace.cycles-pp.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry.btrfs_lookup.path_openat
>       2.27 ±  6%      +1.4        3.72        
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_inode.btrfs_iget.btrfs_lookup_dentry.btrfs_lookup
>       2.72 ±  3%      +1.4        4.17        
> perf-profile.calltrace.cycles-pp.__do_readpage.__extent_readpages.extent_readpages.__do_page_cache_readahead.ondemand_readahead
>       2.33 ±  4%      +1.5        3.79        
> perf-profile.calltrace.cycles-pp.btrfs_get_extent.__do_readpage.__extent_readpages.extent_readpages.__do_page_cache_readahead
>       2.38            +1.6        3.95        
> perf-profile.calltrace.cycles-pp.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item
>       2.63 ±  6%      +1.7        4.33        
> perf-profile.calltrace.cycles-pp.btrfs_iget.btrfs_lookup_dentry.btrfs_lookup.path_openat.do_filp_open
>       1.68 ±  4%      +2.1        3.79        
> perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item
>       8.66            +2.2       10.90        
> perf-profile.calltrace.cycles-pp.generic_file_read_iter.__vfs_read.vfs_read.sys_read.entry_SYSCALL_64_fastpath
>       8.71            +2.3       10.97        
> perf-profile.calltrace.cycles-pp.__vfs_read.vfs_read.sys_read.entry_SYSCALL_64_fastpath
>       8.85            +2.3       11.15        
> perf-profile.calltrace.cycles-pp.vfs_read.sys_read.entry_SYSCALL_64_fastpath
>       9.02            +2.3       11.33        
> perf-profile.calltrace.cycles-pp.sys_read.entry_SYSCALL_64_fastpath
>       2.71 ±  3%      +2.6        5.35        
> perf-profile.calltrace.cycles-pp.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry
>       4.63            +2.7        7.29        
> perf-profile.calltrace.cycles-pp.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry
>       4.65            +2.7        7.32        
> perf-profile.calltrace.cycles-pp.btrfs_read_lock_root_node.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup
>       1.94 ±  2%      +2.8        4.77        
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath.queued_read_lock_slowpath.btrfs_tree_read_lock.btrfs_read_lock_root_node.btrfs_search_slot
>       2.81 ±  4%      +3.3        6.15        
> perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__wake_up_common_lock.btrfs_clear_path_blocking.btrfs_search_slot
>       3.58 ±  2%      +3.4        6.98        
> perf-profile.calltrace.cycles-pp.btrfs_clear_path_blocking.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup
>       8.69            +6.3       15.03        
> perf-profile.calltrace.cycles-pp.btrfs_search_slot.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup.path_openat
>       8.75            +6.4       15.12        
> perf-profile.calltrace.cycles-pp.btrfs_lookup_dir_item.btrfs_lookup_dentry.btrfs_lookup.path_openat.do_filp_open
>      47.34            +8.1       55.46        
> perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_fastpath
>      11.47            +8.1       19.60        
> perf-profile.calltrace.cycles-pp.btrfs_lookup.path_openat.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath
>      11.46            +8.1       19.59        
> perf-profile.calltrace.cycles-pp.btrfs_lookup_dentry.btrfs_lookup.path_openat.do_filp_open.do_sys_open
>      13.90            +8.3       22.20        
> perf-profile.calltrace.cycles-pp.path_openat.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath
>      13.91            +8.3       22.22        
> perf-profile.calltrace.cycles-pp.do_filp_open.do_sys_open.entry_SYSCALL_64_fastpath
>      14.13            +8.4       22.52        
> perf-profile.calltrace.cycles-pp.do_sys_open.entry_SYSCALL_64_fastpath
>
> The cycles for dentry lookup increased much too.  Is this the reason why
> write score decreased?
>
> If you need more information, please let me know.
>
> Best Regards,
> Huang, Ying
>
>>    1140424  12%     +40.2%    1598980  14%  
>> sched_debug.cfs_rq:/.MIN_vruntime.max
>>     790.55           +13.0%     893.20   3%  
>> sched_debug.cfs_rq:/.exec_clock.stddev
>>    1140425  12%     +40.2%    1598982  14%  
>> sched_debug.cfs_rq:/.max_vruntime.max
>>       0.83  10%     +21.5%       1.00   8%  
>> sched_debug.cfs_rq:/.nr_running.avg
>>       3.30  99%    +266.3%      12.09  13%  
>> sched_debug.cfs_rq:/.removed.load_avg.avg
>>     153.02  97%    +266.6%     560.96  13%  
>> sched_debug.cfs_rq:/.removed.runnable_sum.avg
>>     569.93 102%    +173.2%       1556  14%  
>> sched_debug.cfs_rq:/.removed.runnable_sum.stddev
>>       1.42  60%    +501.5%       8.52  34%  
>> sched_debug.cfs_rq:/.removed.util_avg.avg
>>      19.88  59%    +288.9%      77.29  16%  
>> sched_debug.cfs_rq:/.removed.util_avg.max
>>       5.05  58%    +342.3%      22.32  22%  
>> sched_debug.cfs_rq:/.removed.util_avg.stddev
>>     791.44   3%     +47.7%       1168   8%  sched_debug.cfs_rq:/.util_avg.avg
>>       1305   6%     +33.2%       1738   5%  sched_debug.cfs_rq:/.util_avg.max
>>     450.25  11%     +66.2%     748.17  14%  sched_debug.cfs_rq:/.util_avg.min
>>     220.82   8%     +21.1%     267.46   5%  
>> sched_debug.cfs_rq:/.util_avg.stddev
>>     363118  11%     -23.8%     276520  11%  sched_debug.cpu.avg_idle.avg
>>     726003   8%     -30.8%     502313   4%  sched_debug.cpu.avg_idle.max
>>     202629   3%     -32.2%     137429  18%  sched_debug.cpu.avg_idle.stddev
>>      31.96  28%     +54.6%      49.42  14%  sched_debug.cpu.cpu_load[3].min
>>      36.21  25%     +64.0%      59.38   6%  sched_debug.cpu.cpu_load[4].min
>>       1007   5%     +20.7%       1216   7%  sched_debug.cpu.curr->pid.avg
>>       4.50   5%     +14.8%       5.17   5%  sched_debug.cpu.nr_running.max
>>    2476195           -11.8%    2185022        sched_debug.cpu.nr_switches.max
>>     212888           -26.6%     156172   3%  
>> sched_debug.cpu.nr_switches.stddev
>>       3570   2%     -58.7%       1474   2%  
>> sched_debug.cpu.nr_uninterruptible.max
>>    -803.67           -28.7%    -573.38        
>> sched_debug.cpu.nr_uninterruptible.min
>>       1004   2%     -50.4%     498.55   3%  
>> sched_debug.cpu.nr_uninterruptible.stddev
>>    2478809           -11.7%    2189310        sched_debug.cpu.sched_count.max
>>     214130           -26.5%     157298   3%  
>> sched_debug.cpu.sched_count.stddev
>>     489430   2%     -16.6%     408309   2%  sched_debug.cpu.sched_goidle.avg
>>     724333   2%     -28.2%     520263   2%  sched_debug.cpu.sched_goidle.max
>>     457611           -18.1%     374746   3%  sched_debug.cpu.sched_goidle.min
>>      62957   2%     -47.4%      33138   3%  
>> sched_debug.cpu.sched_goidle.stddev
>>     676053   2%     -15.4%     571816   2%  sched_debug.cpu.ttwu_local.max
>>      42669   3%     +22.3%      52198        sched_debug.cpu.ttwu_local.min
>>     151873   2%     -18.3%     124118   2%  sched_debug.cpu.ttwu_local.stddev
>>
>>
>>                                                                              
>>    
>>                                blogbench.write_score                         
>>    
>>                                                                              
>>    
>>   3300 
>> +-+------------------------------------------------------------------+   
>>   3250 +-+          +.       .+   +. .+  :      : :        +.   .+ .+.+.+. 
>> .|   
>>        |:    +.   .+  +.+.+.+  + +  +    :   +. : :    +. +  +.+  +       + 
>> |   
>>   3200 +-+  +  +.+              +         : +  +   :  +  +                  
>> |   
>>   3150 +-+.+                              ++       +.+                      
>> |   
>>   3100 +-+                                                                  
>> |   
>>   3050 +-+                                                                  
>> |   
>>        |                                                                    
>> |   
>>   3000 +-+                                                                  
>> |   
>>   2950 +-+                    O   O                                         
>> |   
>>   2900 +-O     O   O                  O                                     
>> |   
>>   2850 +-+       O  O   O O O   O       O OO O   O   O                      
>> |   
>>        |   O                        O          O   O                        
>> |   
>>   2800 O-+   O        O                                                     
>> |   
>>   2750 
>> +-+------------------------------------------------------------------+   
>>                                                                              
>>    
>>                                                                              
>>                                                                              
>>       
>> [*] bisect-good sample
>> [O] bisect-bad  sample
>>
>>
>>
>> Disclaimer:
>> Results have been estimated based on internal Intel analysis and are provided
>> for informational purposes only. Any difference in system hardware or 
>> software
>> design or configuration may affect actual performance.
>>
>>
>> Thanks,
>> Xiaolong
> _______________________________________________
> LKP mailing list
> l...@lists.01.org
> https://lists.01.org/mailman/listinfo/lkp
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to