Greeting,

FYI, we noticed a -21.2% regression of blogbench.write_score due to commit:


commit: 5c0338c68706be53b3dc472e4308961c36e4ece1 ("workqueue: restore 
WQ_UNBOUND/max_active==1 to be ordered")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master

in testcase: blogbench
on test machine: 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G memory
with following parameters:

        disk: 1SSD
        fs: btrfs
        cpufreq_governor: performance

test-description: Blogbench is a portable filesystem benchmark that tries to 
reproduce the load of a real-world busy file server.
test-url: https://www.pureftpd.org/project/blogbench


Details are as below:
-------------------------------------------------------------------------------------------------->


To reproduce:

        git clone https://github.com/intel/lkp-tests.git
        cd lkp-tests
        bin/lkp install job.yaml  # job file is attached in this email
        bin/lkp run     job.yaml

=========================================================================================
compiler/cpufreq_governor/disk/fs/kconfig/rootfs/tbox_group/testcase:
  
gcc-6/performance/1SSD/btrfs/x86_64-rhel-7.2/debian-x86_64-2016-08-31.cgz/lkp-bdw-de1/blogbench

commit: 
  74cbd96bc2 ("Merge tag 'md/4.13-rc2' of 
git://git.kernel.org/pub/scm/linux/kernel/git/shli/md")
  5c0338c687 ("workqueue: restore WQ_UNBOUND/max_active==1 to be ordered")

74cbd96bc2e00f5d 5c0338c68706be53b3dc472e43 
---------------- -------------------------- 
         %stddev     %change         %stddev
             \          |                \  
      3974           -21.2%       3131 ±  4%  blogbench.write_score
   1608428 ±  2%      +9.7%    1764811 ±  6%  blogbench.read_score
 1.043e+08           -58.0%   43770754 ± 19%  blogbench.time.file_system_inputs
  33045282           -23.0%   25456502 ±  4%  blogbench.time.file_system_outputs
   1889747           -51.2%     922952        
blogbench.time.involuntary_context_switches
      1066            -9.1%     969.60 ±  4%  
blogbench.time.percent_of_cpu_this_job_got
      3058            -9.6%       2764 ±  4%  blogbench.time.system_time
  20367518 ±  2%     -58.2%    8523363 ± 17%  
blogbench.time.voluntary_context_switches
     56387 ±  6%      -9.4%      51101 ±  2%  
interrupts.CAL:Function_call_interrupts
    599681 ±  2%     -42.0%     347640 ± 14%  softirqs.SCHED
     21.79 ±  3%      +6.4       28.17 ± 10%  mpstat.cpu.iowait%
      0.02 ± 22%      +0.2        0.23 ± 15%  mpstat.cpu.soft%
      3.08 ±  5%      +0.6        3.72 ±  6%  mpstat.cpu.usr%
     63227 ±  3%     +13.0%      71434 ±  6%  meminfo.CmaFree
   1915913 ±  4%     -25.3%    1431623 ±  4%  meminfo.Inactive
   1897786 ±  4%     -25.5%    1413549 ±  4%  meminfo.Inactive(file)
   1269568 ±  4%     +27.1%    1613730 ±  9%  meminfo.MemFree
      8434 ±  5%     -55.9%       3719 ± 18%  iostat.sda.r/s
    168471 ±  5%     -56.9%      72537 ± 19%  iostat.sda.rkB/s
     10.68 ±  8%     -37.2%       6.70 ± 11%  iostat.sda.rrqm/s
    732.25 ±  5%     -30.4%     509.76 ±  2%  iostat.sda.w/s
      1748 ±  4%     -18.2%       1429 ±  5%  iostat.sda.wrqm/s
 8.165e+08 ±  2%     -20.7%  6.477e+08 ± 14%  cpuidle.C1.time
  26339770 ±  2%     -59.2%   10742045 ± 18%  cpuidle.C1.usage
 1.775e+08 ±  6%    +236.0%  5.962e+08 ±  6%  cpuidle.C1E.time
    607354 ±  4%     +51.8%     921820 ±  9%  cpuidle.C1E.usage
    134897 ± 25%     -38.6%      82860 ± 15%  cpuidle.C3.usage
    518720 ±  2%     -60.1%     207069 ± 19%  cpuidle.POLL.usage
    168530 ±  5%     -56.9%      72617 ± 19%  vmstat.io.bi
     57048 ±  5%     -22.2%      44397 ±  4%  vmstat.io.bo
    429.10 ±  5%     +21.2%     520.20 ±  8%  vmstat.memory.buff
   1289089 ±  4%     +26.6%    1631377 ±  8%  vmstat.memory.free
     40.40 ±  4%     -17.6%      33.30 ± 11%  vmstat.procs.b
    207214 ±  5%     -34.2%     136272 ±  9%  vmstat.system.cs
     27043 ±  2%     -26.6%      19840 ±  3%  vmstat.system.in
    216066 ±  3%     +12.8%     243680 ±  2%  
slabinfo.btrfs_delayed_node.active_objs
     11273 ±  4%     -27.3%       8194 ±  3%  
slabinfo.btrfs_extent_buffer.active_objs
     11710 ±  4%     -26.2%       8636 ±  3%  
slabinfo.btrfs_extent_buffer.num_objs
      5056 ±  3%     -12.2%       4441 ±  6%  
slabinfo.btrfs_ordered_extent.active_objs
      5187 ±  3%     -11.9%       4568 ±  6%  
slabinfo.btrfs_ordered_extent.num_objs
     19452 ± 10%     +66.7%      32428 ±  8%  slabinfo.kmalloc-256.active_objs
    624.40 ±  9%     +65.7%       1034 ±  8%  slabinfo.kmalloc-256.active_slabs
     19996 ±  9%     +65.7%      33129 ±  8%  slabinfo.kmalloc-256.num_objs
    624.40 ±  9%     +65.7%       1034 ±  8%  slabinfo.kmalloc-256.num_slabs
 6.933e+11            -7.8%  6.395e+11 ±  2%  perf-stat.branch-instructions
      0.62            -0.1        0.47 ±  4%  perf-stat.branch-miss-rate%
 4.329e+09           -30.0%  3.032e+09 ±  2%  perf-stat.branch-misses
  64545297 ±  2%     -35.9%   41378261 ±  9%  perf-stat.context-switches
 9.103e+12            -8.0%  8.376e+12 ±  3%  perf-stat.cpu-cycles
   6729558 ±  3%     -72.4%    1858286 ± 17%  perf-stat.cpu-migrations
     26.24 ±  5%     +23.6       49.83 ±  9%  perf-stat.iTLB-load-miss-rate%
 4.122e+08 ±  7%     +26.5%  5.214e+08 ±  5%  perf-stat.iTLB-load-misses
 1.157e+09 ±  2%     -54.2%  5.302e+08 ± 14%  perf-stat.iTLB-loads
      8459 ±  7%     -24.3%       6399 ±  5%  
perf-stat.instructions-per-iTLB-miss
      0.38            +4.4%       0.40        perf-stat.ipc
    319949 ±  5%      -3.9%     307528        perf-stat.minor-faults
    319952 ±  5%      -3.9%     307494        perf-stat.page-faults
  84255382 ±  9%     -95.6%    3708415 ± 83%  
proc-vmstat.compact_daemon_free_scanned
   1829109 ± 18%     -97.6%      44241 ± 93%  
proc-vmstat.compact_daemon_migrate_scanned
      1059 ± 11%     -92.8%      75.90 ±100%  proc-vmstat.compact_daemon_wake
  84255382 ±  9%     -95.6%    3708415 ± 83%  proc-vmstat.compact_free_scanned
    781008 ± 21%     -98.8%       9090 ± 89%  proc-vmstat.compact_isolated
   1829109 ± 18%     -97.6%      44241 ± 93%  
proc-vmstat.compact_migrate_scanned
    179.80 ±  9%     -40.8%     106.50 ± 16%  
proc-vmstat.kswapd_high_wmark_hit_quickly
     22499 ± 62%    -100.0%       5.50 ±125%  proc-vmstat.kswapd_inodesteal
      4237 ±  8%     -83.2%     710.60 ± 28%  
proc-vmstat.kswapd_low_wmark_hit_quickly
   4539912           -24.2%    3441760 ±  4%  proc-vmstat.nr_dirtied
    316909 ±  4%     +27.1%     402889 ±  9%  proc-vmstat.nr_free_pages
    474932 ±  4%     -25.5%     353600 ±  4%  proc-vmstat.nr_inactive_file
    207449 ± 14%     -54.1%      95277 ± 24%  
proc-vmstat.nr_vmscan_immediate_reclaim
   4442638           -24.2%    3366510 ±  4%  proc-vmstat.nr_written
    474994 ±  4%     -25.5%     353713 ±  4%  proc-vmstat.nr_zone_inactive_file
  20945924           -35.0%   13606065 ±  7%  proc-vmstat.numa_hit
  20945924           -35.0%   13606065 ±  7%  proc-vmstat.numa_local
      4632 ±  7%     -75.8%       1119 ± 18%  proc-vmstat.pageoutrun
  26805123 ±  2%     -35.6%   17261359 ±  6%  proc-vmstat.pgactivate
   6702031           -15.2%    5684286 ±  4%  proc-vmstat.pgalloc_dma32
  17277542           -27.9%   12464886 ±  4%  proc-vmstat.pgalloc_normal
  31715903 ±  2%     -39.0%   19354510 ±  8%  proc-vmstat.pgdeactivate
  22331249           -26.4%   16425277 ±  5%  proc-vmstat.pgfree
     62247 ±  8%     -94.5%       3437 ± 76%  proc-vmstat.pgmigrate_success
  52404647           -58.0%   22020215 ± 19%  proc-vmstat.pgpgin
  17745601           -24.2%   13447782 ±  4%  proc-vmstat.pgpgout
  31724465 ±  2%     -39.0%   19363354 ±  8%  proc-vmstat.pgrefill
    229386 ± 12%     -48.0%     119203 ± 28%  proc-vmstat.pgrotated
  16121910           -54.1%    7392371 ± 17%  proc-vmstat.pgscan_kswapd
  15714718           -54.8%    7109527 ± 17%  proc-vmstat.pgsteal_kswapd
   8744243           -55.3%    3910348 ± 18%  proc-vmstat.slabs_scanned
   1941347 ± 11%     -75.0%     485169 ± 26%  proc-vmstat.workingset_activate
   3087977 ± 13%     -76.7%     718263 ± 28%  proc-vmstat.workingset_refault


Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.


Thanks,
Xiaolong
---

#! jobs/blogbench-1ssd.yaml
suite: blogbench
testcase: blogbench
category: benchmark
disk: 1SSD
fs: btrfs
blogbench: 
job_origin: 
"/lkp/lkp/.src-20170721-133542/allot/cyclic:linux-devel:devel-hourly/lkp-bdw-de1/blogbench-1ssd.yaml"

#! queue options
queue: bisect
testbox: lkp-bdw-de1
tbox_group: lkp-bdw-de1
submit_id: 5973470b0b9a93631584a7d0
job_file: 
"/lkp/scheduled/lkp-bdw-de1/blogbench-1SSD-btrfs-performance-debian-x86_64-2016-08-31.cgz-5c0338c68706be53b3dc472e4308961c36e4ece1-20170722-25365-6b8zgb-0.yaml"
id: d3e7da0613559b94d7a99eb30e80352859bdd7bc

#! hosts/lkp-bdw-de1
model: Broadwell-DE
nr_cpu: 16
memory: 8G
ssd_partitions: "/dev/disk/by-id/ata-INTEL_SSDSC2BB480G6_BTWA6075012T480FGN"
swap_partitions: 
rootfs_partition: 
kernel_cmdline: console=ttyS1,115200 console=tty0
brand: Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz

#! include/category/benchmark
kmsg: 
iostat: 
heartbeat: 
vmstat: 
numa-numastat: 
numa-vmstat: 
numa-meminfo: 
proc-vmstat: 
proc-stat: 
meminfo: 
slabinfo: 
interrupts: 
lock_stat: 
latency_stats: 
softirqs: 
bdi_dev_mapping: 
diskstats: 
nfsstat: 
cpuidle: 
cpufreq-stats: 
turbostat: 
sched_debug: 
perf-stat: 
mpstat: 
perf-profile: 

#! include/category/ALL
cpufreq_governor: performance

#! include/disk/nr_ssd
need_kconfig:
- CONFIG_BLK_DEV_SD
- CONFIG_SCSI
- CONFIG_BLOCK=y
- CONFIG_SATA_AHCI
- CONFIG_SATA_AHCI_PLATFORM
- CONFIG_ATA
- CONFIG_PCI=y
- CONFIG_BTRFS_FS

#! include/queue/cyclic
commit: 5c0338c68706be53b3dc472e4308961c36e4ece1

#! include/fs/btrfs

#! default params
kconfig: x86_64-rhel-7.2
compiler: gcc-6
rootfs: debian-x86_64-2016-08-31.cgz
enqueue_time: 2017-07-22 20:37:33.211024853 +08:00
_id: 5973470b0b9a93631584a7d0
_rt: 
"/result/blogbench/1SSD-btrfs-performance/lkp-bdw-de1/debian-x86_64-2016-08-31.cgz/x86_64-rhel-7.2/gcc-6/5c0338c68706be53b3dc472e4308961c36e4ece1"

#! schedule options
user: lkp
head_commit: dd5e317b37c687cf696b54b1148559ec09682d18
base_commit: 5771a8c08880cdca3bfb4a3fc6d309d6bba20877
branch: linux-devel/devel-hourly-2017072207
result_root: 
"/result/blogbench/1SSD-btrfs-performance/lkp-bdw-de1/debian-x86_64-2016-08-31.cgz/x86_64-rhel-7.2/gcc-6/5c0338c68706be53b3dc472e4308961c36e4ece1/0"
LKP_SERVER: inn
max_uptime: 3232.98
initrd: "/osimage/debian/debian-x86_64-2016-08-31.cgz"
bootloader_append:
- root=/dev/ram0
- user=lkp
- 
job=/lkp/scheduled/lkp-bdw-de1/blogbench-1SSD-btrfs-performance-debian-x86_64-2016-08-31.cgz-5c0338c68706be53b3dc472e4308961c36e4ece1-20170722-25365-6b8zgb-0.yaml
- ARCH=x86_64
- kconfig=x86_64-rhel-7.2
- branch=linux-devel/devel-hourly-2017072207
- commit=5c0338c68706be53b3dc472e4308961c36e4ece1
- 
BOOT_IMAGE=/pkg/linux/x86_64-rhel-7.2/gcc-6/5c0338c68706be53b3dc472e4308961c36e4ece1/vmlinuz-4.13.0-rc1-00060-g5c0338c
- console=ttyS1,115200 console=tty0
- max_uptime=3232
- 
RESULT_ROOT=/result/blogbench/1SSD-btrfs-performance/lkp-bdw-de1/debian-x86_64-2016-08-31.cgz/x86_64-rhel-7.2/gcc-6/5c0338c68706be53b3dc472e4308961c36e4ece1/0
- LKP_SERVER=inn
- debug
- apic=debug
- sysrq_always_enabled
- rcupdate.rcu_cpu_stall_timeout=100
- net.ifnames=0
- printk.devkmsg=on
- panic=-1
- softlockup_panic=1
- nmi_watchdog=panic
- oops=panic
- load_ramdisk=2
- prompt_ramdisk=0
- drbd.minor_count=8
- earlyprintk=ttyS1,115200
- systemd.log_level=err
- console=ttyS1,115200
- console=tty0
- rw
lkp_initrd: "/lkp/lkp/lkp-x86_64.cgz"
modules_initrd: 
"/pkg/linux/x86_64-rhel-7.2/gcc-6/5c0338c68706be53b3dc472e4308961c36e4ece1/modules.cgz"
bm_initrd: 
"/osimage/deps/debian-x86_64-2016-08-31.cgz/lkp_2017-05-01.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/rsync-rootfs_2016-11-15.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/run-ipconfig_2016-11-15.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/fs_2016-11-15.cgz,/osimage/pkg/debian-x86_64-2016-08-31.cgz/blogbench-x86_64_2016-09-01.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/iostat_2016-11-15.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/turbostat_2016-11-15.cgz,/osimage/pkg/debian-x86_64-2016-08-31.cgz/turbostat-x86_64-d5256b2_2017-06-20.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/perf_2017-07-07.cgz,/osimage/pkg/debian-x86_64-2016-08-31.cgz/perf-x86_64-af3c8d98508d_2017-07-10.cgz,/osimage/deps/debian-x86_64-2016-08-31.cgz/hw_2016-11-15.cgz"
site: inn

#! /lkp/lkp/.src-20170721-133542/include/site/inn
LKP_CGI_PORT: 80
LKP_CIFS_PORT: 139
oom-killer: 
watchdog: 

#! runtime status
repeat_to: 5

#! user overrides
kernel: 
"/pkg/linux/x86_64-rhel-7.2/gcc-6/5c0338c68706be53b3dc472e4308961c36e4ece1/vmlinuz-4.13.0-rc1-00060-g5c0338c"
dequeue_time: 2017-07-22 20:42:28.496576889 +08:00
job_state: finished
loadavg: '130.02'
dmsetup remove_all
wipefs -a --force /dev/sda
mkfs -t btrfs /dev/sda
mkdir -p /fs/sda
mount -t btrfs /dev/sda /fs/sda

for cpu_dir in /sys/devices/system/cpu/cpu[0-9]*
do
        online_file="$cpu_dir"/online
        [ -f "$online_file" ] && [ "$(cat "$online_file")" -eq 0 ] && continue

        file="$cpu_dir"/cpufreq/scaling_governor
        [ -f "$file" ] && echo "$scaling_governor" > "$file"
done

./blogbench -d /fs/sda

Reply via email to