* Arnaldo Carvalho de Melo <a...@kernel.org> wrote: > From: Arnaldo Carvalho de Melo <a...@redhat.com> > > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > Test results at the end. > > The following changes since commit 6a6b12e2125591e24891e6860410795ea53aed11: > > Merge tag 'perf-core-for-mingo-20161114' of > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/core > (2016-11-15 09:45:04 +0100) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git > tags/perf-core-for-mingo-20161123 > > for you to fetch changes up to a407b0678bc1c39d70af5fdbe6421c164b69a8c0: > > perf sched timehist: Add -V/--cpu-visual option (2016-11-23 10:44:09 -0300) > > ---------------------------------------------------------------- > perf/core improvements and fixes: > > New tool: > > - 'perf sched timehist' provides an analysis of scheduling events. > > Example usage: > perf sched record -- sleep 1 > perf sched timehist > > By default it shows the individual schedule events, including the wait > time (time between sched-out and next sched-in events for the task), the > task scheduling delay (time between wakeup and actually running) and run > time for the task: > > time cpu task name wait time sch delay run time > [tid/pid] (msec) (msec) (msec) > -------- ------ ---------------- --------- --------- -------- > 1.874569 [0011] gcc[31949] 0.014 0.000 1.148 > 1.874591 [0010] gcc[31951] 0.000 0.000 0.024 > 1.874603 [0010] migration/10[59] 3.350 0.004 0.011 > 1.874604 [0011] <idle> 1.148 0.000 0.035 > 1.874723 [0005] <idle> 0.016 0.000 1.383 > 1.874746 [0005] gcc[31949] 0.153 0.078 0.022 > ... > > Times are in msec.usec. (David Ahern, Namhyung Kim) > > Improvements: > > - Make 'perf c2c report' support -f/--force, to allow skipping the > ownership check for root users, for instance, just like the other > tools (Jiri Olsa) > > - Allow sorting cachelines by total number of HITMs, in addition to > local and remote numbers (Jiri Olsa) > > Fixes: > > - Make sure errors aren't suppressed by the TUI reset at the end of > a 'perf c2c report' session (Jiri Olsa) > > Infrastructure: > > - Initial work on having the annotate code better support multiple > architectures, including the ability to cross-annotate, i.e. to > annotate perf.data files collected on an ARM system on a x86_64 > workstation (Arnaldo Carvalho de Melo, Ravi Bangoria, Kim Phillips) > > - Use USECS_PER_SEC instead of hard coded number in libtraceevent (Steven > Rostedt) > > - Add retrieval of preempt count and latency flags in libtraceevent (Steven > Rostedt) > > Signed-off-by: Arnaldo Carvalho de Melo <a...@redhat.com> > > ---------------------------------------------------------------- > Arnaldo Carvalho de Melo (3): > perf annotate: Start supporting cross arch annotation > perf annotate: Allow arches to specify functions to skip > perf annotate: Add per arch instructions annotate handlers > > David Ahern (5): > perf sched timehist: Introduce timehist command > perf sched timehist: Add summary options > perf sched timehist: Add -w/--wakeups option > perf sched timehist: Add call graph options > perf sched timehist: Add -V/--cpu-visual option > > Jiri Olsa (6): > perf tools: Show event fd in debug output > perf c2c report: Setup browser after opening perf.data > perf c2c report: Add -f/--force option > perf c2c report: Add struct c2c_stats::tot_hitm field > perf c2c report: Display total HITMs on default > perf c2c: Support cascading options > > Namhyung Kim (2): > perf symbols: Print symbol offsets conditionally > perf evsel: Support printing callchains with arrows > > Steven Rostedt (2): > tools lib traceevent: Use USECS_PER_SEC instead of hardcoded number > tools lib traceevent: Add retrieval of preempt count and latency flags > > tools/lib/traceevent/event-parse.c | 41 +- > tools/lib/traceevent/event-parse.h | 5 +- > tools/perf/Documentation/perf-c2c.txt | 8 + > tools/perf/Documentation/perf-sched.txt | 66 +- > tools/perf/arch/arm/annotate/instructions.c | 90 +++ > tools/perf/arch/x86/annotate/instructions.c | 78 +++ > tools/perf/builtin-c2c.c | 80 ++- > tools/perf/builtin-sched.c | 914 > +++++++++++++++++++++++++++- > tools/perf/builtin-top.c | 2 +- > tools/perf/ui/browsers/annotate.c | 2 +- > tools/perf/ui/gtk/annotate.c | 2 +- > tools/perf/util/annotate.c | 251 ++++---- > tools/perf/util/annotate.h | 6 +- > tools/perf/util/evsel.c | 6 +- > tools/perf/util/evsel.h | 1 + > tools/perf/util/evsel_fprintf.c | 12 +- > tools/perf/util/mem-events.c | 12 +- > tools/perf/util/mem-events.h | 1 + > tools/perf/util/symbol.h | 3 +- > tools/perf/util/symbol_fprintf.c | 11 +- > 20 files changed, 1406 insertions(+), 185 deletions(-) > create mode 100644 tools/perf/arch/arm/annotate/instructions.c > create mode 100644 tools/perf/arch/x86/annotate/instructions.c > > # uname -a > Linux jouet 4.8.6-201.fc24.x86_64 #1 SMP Thu Nov 3 14:38:57 UTC 2016 x86_64 > x86_64 x86_64 GNU/Linux > # perf test > 1: vmlinux symtab matches kallsyms : Ok > 2: detect openat syscall event : Ok > 3: detect openat syscall event on all cpus : Ok > 4: read samples using the mmap interface : Ok > 5: parse events tests : Ok > 6: Validate PERF_RECORD_* events & perf_sample fields : Ok > 7: Test perf pmu format parsing : Ok > 8: Test dso data read : Ok > 9: Test dso data cache : Ok > 10: Test dso data reopen : Ok > 11: roundtrip evsel->name check : Ok > 12: Check parsing of sched tracepoints fields : Ok > 13: Generate and check syscalls:sys_enter_openat event fields: Ok > 14: struct perf_event_attr setup : Ok > 15: Test matching and linking multiple hists : Ok > 16: Try 'import perf' in python, checking link problems : Ok > 17: Test breakpoint overflow signal handler : Ok > 18: Test breakpoint overflow sampling : Ok > 19: Test number of exit event of a simple workload : Ok > 20: Test software clock events have valid period values : Ok > 21: Test object code reading : Ok > 22: Test sample parsing : Ok > 23: Test using a dummy software event to keep tracking : Ok > 24: Test parsing with no sample_id_all bit set : Ok > 25: Test filtering hist entries : Ok > 26: Test mmap thread lookup : Ok > 27: Test thread mg sharing : Ok > 28: Test output sorting of hist entries : Ok > 29: Test cumulation of child hist entries : Ok > 30: Test tracking with sched_switch : Ok > 31: Filter fds with revents mask in a fdarray : Ok > 32: Add fd to a fdarray, making it autogrow : Ok > 33: Test kmod_path__parse function : Ok > 34: Test thread map : Ok > 35: Test LLVM searching and compiling : > 35.1: Basic BPF llvm compiling test : Ok > 35.2: Test kbuild searching : Ok > 35.3: Compile source for BPF prologue generation test : Ok > 35.4: Compile source for BPF relocation test : Ok > 36: Test topology in session : Ok > 37: Test BPF filter : > 37.1: Test basic BPF filtering : Ok > 37.2: Test BPF prologue generation : Ok > 37.3: Test BPF relocation checker : Ok > 38: Test thread map synthesize : Ok > 39: Test cpu map synthesize : Ok > 40: Test stat config synthesize : Ok > 41: Test stat synthesize : Ok > 42: Test stat round synthesize : Ok > 43: Test attr update synthesize : Ok > 44: Test events times : Ok > 45: Test backward reading from ring buffer : Ok > 46: Test cpu map print : Ok > 47: Test SDT event probing : Ok > 48: Test is_printable_array function : Ok > 49: Test bitmap print : Ok > 50: x86 rdpmc test : Ok > 51: Test converting perf time to TSC : Ok > 52: Test dwarf unwind : Ok > 53: Test x86 instruction decoder - new instructions : Ok > 54: Test intel cqm nmi context read : Skip > # > > # dm > 1 alpine:3.4: Ok > 2 android-ndk:r12b-arm: Ok > 3 archlinux:latest: Ok > 4 centos:5: Ok > 5 centos:6: Ok > 6 centos:7: Ok > 7 debian:7: Ok > 8 debian:8: Ok > 9 debian:experimental: Ok > 10 fedora:20: Ok > 11 fedora:21: Ok > 12 fedora:22: Ok > 13 fedora:23: Ok > 14 fedora:24: Ok > 15 fedora:24-x-ARC-uClibc: Ok > 16 fedora:rawhide: Ok > 17 mageia:5: Ok > 18 opensuse:13.2: Ok > 19 opensuse:42.1: Ok > 20 opensuse:tumbleweed: Ok > 21 ubuntu:12.04.5: Ok > 22 ubuntu:14.04: Ok > 23 ubuntu:14.04.4: Ok > 24 ubuntu:15.10: Ok > 25 ubuntu:16.04: Ok > 26 ubuntu:16.04-x-arm: Ok > 27 ubuntu:16.04-x-arm64: Ok > 28 ubuntu:16.04-x-powerpc: Ok > 29 ubuntu:16.04-x-powerpc64: Ok > 30 ubuntu:16.04-x-powerpc64el: Ok > 31 ubuntu:16.04-x-s390: Ok > 32 ubuntu:16.10: Ok > # > > $ make -C tools/perf build-test > make: Entering directory '/home/acme/git/linux/tools/perf' > - tarpkg: ./tests/perf-targz-src-pkg . > make_no_libperl_O: make NO_LIBPERL=1 > make_no_demangle_O: make NO_DEMANGLE=1 > make_no_libnuma_O: make NO_LIBNUMA=1 > make_install_bin_O: make install-bin > make_install_prefix_O: make install prefix=/tmp/krava > make_util_map_o_O: make util/map.o > make_no_libdw_dwarf_unwind_O: make NO_LIBDW_DWARF_UNWIND=1 > make_no_libbpf_O: make NO_LIBBPF=1 > make_doc_O: make doc > make_util_pmu_bison_o_O: make util/pmu-bison.o > make_with_babeltrace_O: make LIBBABELTRACE=1 > make_no_ui_O: make NO_NEWT=1 NO_SLANG=1 NO_GTK2=1 > make_no_scripts_O: make NO_LIBPYTHON=1 NO_LIBPERL=1 > make_debug_O: make DEBUG=1 > make_perf_o_O: make perf.o > make_no_slang_O: make NO_SLANG=1 > make_no_newt_O: make NO_NEWT=1 > make_minimal_O: make NO_LIBPERL=1 NO_LIBPYTHON=1 NO_NEWT=1 > NO_GTK2=1 NO_DEMANGLE=1 NO_LIBELF=1 NO_LIBUNWIND=1 NO_BACKTRACE=1 > NO_LIBNUMA=1 NO_LIBAUDIT=1 NO_LIBBIONIC=1 NO_LIBDW_DWARF_UNWIND=1 > NO_AUXTRACE=1 NO_LIBBPF=1 NO_LIBCRYPTO=1 NO_SDT=1 NO_JVMTI=1 > make_install_O: make install > make_no_auxtrace_O: make NO_AUXTRACE=1 > make_no_backtrace_O: make NO_BACKTRACE=1 > make_install_prefix_slash_O: make install prefix=/tmp/krava/ > make_clean_all_O: make clean all > make_no_libpython_O: make NO_LIBPYTHON=1 > make_no_libaudit_O: make NO_LIBAUDIT=1 > make_no_libbionic_O: make NO_LIBBIONIC=1 > make_help_O: make help > make_no_libelf_O: make NO_LIBELF=1 > make_tags_O: make tags > make_no_libunwind_O: make NO_LIBUNWIND=1 > make_no_gtk2_O: make NO_GTK2=1 > make_pure_O: make > make_static_O: make LDFLAGS=-static > OK > make: Leaving directory '/home/acme/git/linux/tools/perf'
Pulled, thanks a lot Arnaldo! Ingo