On 6/11/19 8:40 AM, Christoph Hellwig wrote:
> This will allow sparc64 to override its ADI tags for
> get_user_pages and get_user_pages_fast.
>
> Signed-off-by: Christoph Hellwig
> ---
Commit message is sparc64 specific but the goal here is to allow any
architecture with memory tagging to use
Add MSM8998 GPU Clock Controller DT node.
Signed-off-by: Jeffrey Hugo
---
arch/arm64/boot/dts/qcom/msm8998.dtsi | 15 +++
1 file changed, 15 insertions(+)
diff --git a/arch/arm64/boot/dts/qcom/msm8998.dtsi
b/arch/arm64/boot/dts/qcom/msm8998.dtsi
index 574be78a936e..cf00bfeec6b3
Add of Synopsys MIPI D-PHY in RX mode support.
Separated in the implementation are platform dependent probing functions.
Signed-off-by: Luis Oliveira
---
Changelog
v3-v4
- This patch suffered from some changes, but still the D-PHY is configured
on boot-up either from dt or by pdata (check
This allows the driver loading via platform data which makes the driver
not device tree dependent.
Signed-off-by: Luis Oliveira
---
Changelog
v3-v4
- not present on v3, allows configuration using pdata
MAINTAINERS | 1 +
drivers/media/platform/dwc/dw-csi-plat.c |
Add device-tree bindings documentation for SNPS DesignWare MIPI D-PHY in
RX mode.
Signed-off-by: Luis Oliveira
---
Changelog
v3-v4
- @Laurent I know I told you I could remove the snps,dphy-frequency on V3 but
it is really useful for me here. I removed all other the proprietary
properties
Add the Synopsys MIPI CSI-2 controller driver. This
controller driver is divided in platform functions and core functions.
This way it serves as platform for future DesignWare drivers.
Signed-off-by: Luis Oliveira
---
Changelog
v3-v4
- fix v4l2_fwnode_endpoint bad initialization @eugen
- removed
This patch allows the configuration via platform data. This allows
the driver loading not devicetree dependent.
Signed-off-by: Luis Oliveira
---
Changelog
v3-v4
- This patch was not in the v3. This enables pdata to configure the driver
drivers/media/platform/dwc/dw-dphy-plat.c | 87
From: Luis Oliveira
Add bindings for Synopsys DesignWare MIPI CSI-2 host.
Signed-off-by: Luis Oliveira
---
Changelog
v3-v4
- remove "plat" from the block name @rob @laurent
- remove "phy-names" when single-entry @rob
- remove "snps,output-type" -> went to the driver config @laurent
This adds support for Synopsys MIPI CSI-2 Host and MIPI D-PHY.
The patch series include support for initialization/configuration of the
DW MIPI CSI-2 controller and the DW MIPI D-PHY and both include a reference
platform driver and the option to use platform data. The optional sysfs
interface is
The GPUCC manages the clocks for the Adreno GPU found on MSM8998.
Signed-off-by: Jeffrey Hugo
---
drivers/clk/qcom/Kconfig | 8 +
drivers/clk/qcom/Makefile| 1 +
drivers/clk/qcom/gpucc-msm8998.c | 359 +++
3 files changed, 368 insertions(+)
The Adreno GPU on MSM8998 has its own clock controller, which is a
dependency for bringing up the GPU. This series gets the gpucc all in
place as another step on the road to getting the GPU enabled.
v3:
-drop accepted DT patch
-correct "avoid" typo
-expand comment on why XO is required
v2:
On Wed, Jun 5, 2019 at 9:15 PM Hans de Goede wrote:
>
> Hi,
>
> On 28-05-19 15:37, Hans de Goede wrote:
> > Hi,
> >
> > On 24-05-19 18:43, Andy Shevchenko wrote:
> >> On Tue, May 21, 2019 at 9:28 AM Hans de Goede wrote:
> >>>
> >>> Commit 78f3ac76d9e5 ("platform/x86: asus-wmi: Tell the EC the OS
From: Mao Wenan
Date: Tue, 11 Jun 2019 09:32:13 +0800
> The variable 'status' in __packet_lookup_frame_in_block() is never used since
> introduction in commit f6fb8f100b80 ("af-packet: TPACKET_V3 flexible buffer
> implementation."), we can remove it.
>
> Signed-off-by: Mao Wenan
> ---
> v2:
On 11/06/2019 10:57:17+, Aisheng Dong wrote:
> > From: anson.hu...@nxp.com [mailto:anson.hu...@nxp.com]
> > Sent: Tuesday, June 11, 2019 2:34 PM
> >
> > i.MX system controller RTC driver can support all i.MX SoCs with system
> > controller inside, this patch makes the compatible string more
Den 11-06-2019 kl. 20:40, skrev Greg Kroah-Hartman:
On Tue, Jun 11, 2019 at 07:33:16PM +0200, Daniel Vetter wrote:
On Tue, Jun 11, 2019 at 5:37 PM Greg Kroah-Hartman
wrote:
On Tue, Jun 11, 2019 at 03:56:35PM +0200, Sven Joachim wrote:
Commit 1e07d63749 ("drm/nouveau: add kconfig option to
From: Adrian Hunter
Set up time ranges for efficient time interval filtering using the new
"fast forward" facility.
Because decoding is done in time order, intel_pt_time_filter() needs to
look only at the next start or end timestamp - refer intel_pt_next_time().
Signed-off-by: Adrian Hunter
From: Adrian Hunter
Intel PT decoding is done in time order. In order to support efficient time
interval filtering, add a facility to "fast forward" towards a particular
timestamp. That involves finding the right buffer, stepping to that buffer,
and then stepping forward PSBs. Because decoding
From: Adrian Hunter
Factor out intel_pt_8b_tsc() so it can be reused.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-6-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
Simplify perf_time__parse_for_ranges() error paths slightly.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-17-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
tools/perf/util/time-utils.c |
From: Arnaldo Carvalho de Melo
Suzuki noticed that this should be more useful in a generic header, and
after looking I noticed we have it already in our copy of
include/linux/bits.h in tools/include, so just use it, test built on
x86-64 and ubuntu 19.04 with:
perfbuilder@46646c9e848e:/$
From: Kan Liang
The existing "thread_siblings" and "thread_siblings_list" attribute will
be deprecated.
Use the new CPU topology sysfs attributes, "core_cpus" and
"core_cpus_list", which are synonymous with the deprecated attributes.
Check the new name first. If not available, use the
From: Adrian Hunter
Currently only a single explicit time range is accepted. Add support for
multiple ranges separated by spaces, which requires the string to be
quoted. Update the time utils test accordingly.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link:
From: Thomas Richter
Command
# perf test -Fv 6
fails with error
running test 100 'kvm-s390:kvm_s390_create_vm' failed to parse
event 'kvm-s390:kvm_s390_create_vm', err -1, str 'unknown tracepoint'
event syntax error: 'kvm-s390:kvm_s390_create_vm'
\___
From: Mathieu Poirier
When operating in CPU-wide trace mode with a source/sink topology of N:1
packets with multiple traceID will end up in the same cs_etm_queue. In
order to properly decode packets they need to be split in different
queues, i.e one queue per traceID.
As such add support for
From: Adrian Hunter
Test time ranges work as expected.
Committer testing:
$ perf test "time utils"
59: time utils: Ok
$ perf test -v "time utils"
59: time utils:
--- start ---
test child forked,
On Thu, May 9, 2019 at 2:15 AM Rajat Jain wrote:
> OK, NP. Just to be sure I understand,
>
> 1) Please let me know if I should send in a fix (it would be
> #include/linux/module.h and also add MODULE_LICENSE() I believe)?
> 2) Would this be lined up for next version though?
Resend a complete
From: Adrian Hunter
When the decoder gets the next trace buffer, some state is reset if the
buffer is not consecutive to the previous buffer. Add a parameter
'reposition' so that can be done also to support a "fast forward"
facility.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
From: Arnaldo Carvalho de Melo
We have $INSTALL_DIR/share/perf-core/strace/groups/string files with
syscalls that should be selected when 'string' is used, meaning, in this
case, syscalls that receive as one of its arguments a string, like a
pathname.
But those were first selected and tested on
From: Adrian Hunter
Explicit time ranges never contain a percent sign whereas percentage
ranges always do, so it is possible to call the correct parser.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-18-adrian.hun...@intel.com
From: Adrian Hunter
Instruction trace decoders can optimize output based on what time
intervals will be filtered, so pass that information in
itrace_synth_ops.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-4-adrian.hun...@intel.com
From: Adrian Hunter
Factor out set_percent_time() so it can be reused.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-14-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
tools/perf/util/time-utils.c | 39
From: Adrian Hunter
Implement the lookahead callback to let the decoder access subsequent
buffers. intel_pt_lookahead() manages the buffer lifetime and calls the
decoder for each buffer until the decoder returns a non-zero value.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link:
From: Adrian Hunter
Correct some punctuation and spelling and correct the format to show
that the time resolution is nanoseconds not microseconds.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-16-adrian.hun...@intel.com
From: Adrian Hunter
Instruction trace decoders can optimize output based on what time
intervals will be filtered, so pass that information in
itrace_synth_ops.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-3-adrian.hun...@intel.com
From: Thomas Richter
Debugging a OOM error using the TUI interface revealed this issue
on s390:
[tmricht@m83lp54 perf]$ cat /proc/kallsyms |sort
0001119b7158 B radix_tree_node_cachep
0001119b8000 B __bss_stop
0001119b8000 B _end
03ff80002850 t autofs_mount [autofs4]
From: Adrian Hunter
Add a callback function to enable the decoder to lookahead at subsequent
trace buffers. This will be used to implement a "fast forward" facility
which will be needed to support efficient time interval filtering.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link:
From: Thomas Richter
Perf report fails to display s390 specific event numbered bd000
on an x86 platform. For example on s390 this works without error:
[root@m35lp76 perf]# uname -m
s390x
[root@m35lp76 perf]# ./perf record -e rbd000 -- find / >/dev/null
[ perf record: Woken up 3 times to write
From: Adrian Hunter
Factor out intel_pt_get_buffer() so it can be reused.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-10-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
tools/perf/util/intel-pt.c | 60
From: Adrian Hunter
Factor out intel_pt_reposition() so it can be reused.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-7-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
Currently, options allow only 1 explicit (non-percentage) time range.
In preparation for adding support for multiple explicit time ranges,
treat time ranges consistently.
Instead of treating some time ranges as inclusive and some as excluding
the end time, treat all time
From: Kan Liang
The "sibling cores" actually shows the sibling CPUs of a socket. The
name "sibling cores" is very misleading.
Rename "sibling cores" to "sibling sockets"
Signed-off-by: Kan Liang
Reviewed-by: Jiri Olsa
Cc: Andi Kleen
Cc: Peter Zijlstra
Link:
From: Adrian Hunter
Prevent percentage time range overlap. This is only a 1 nanosecond
change but makes the results more logical e.g. a sample cannot be in
both the first 10% and the second 20%.
Note, there is a later patch that adds a test for time-utils.
Signed-off-by: Adrian Hunter
Cc: Jin
From: Adrian Hunter
Enhance the "All branches" and "Selected branches" reports to display IPC
information if it is available.
Committer testing:
So, testing this I noticed that it all starts with the left arrow in every
line, that should mean there is some tree there, i.e. look at all those ▶
From: Mathieu Poirier
Add handling of ITRACE events in order to add the tid/pid of the
executing process to the perf tools machine infrastructure. This
information is later retrieved when a contextID packet is found in the
trace stream.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc:
From: Arnaldo Carvalho de Melo
So that we can use it for multiple args, baby steps not to step into the
verifier toes.
In the process make sure we handle -EFAULT from bpf_prog_read_str(), as
this really is needed now that we'll handle more than one augmented
argument, i.e. if there is failure,
From: Leo Yan
On my Juno board with ARM64 CPUs, perf trace command reports the eBPF
program building failure but the command will not exit and continue to
run. If we define an eBPF event in config file, the event will be
parsed with below flow:
perf_config()
`> trace__config()
From: Adrian Hunter
Enhance the call tree to display IPC information if it is available.
Committer testing:
[acme@quaco adrian.hunter]$ python
~acme/libexec/perf-core/scripts/python/exported-sql-viewer.py
~/c/adrian.hunter/simple-retpoline.db
Reports -> Call Tree, then expand a few trees,
On Sat, Jun 08, 2019 at 11:26:55PM -0300, Mauro Carvalho Chehab wrote:
> Convert the cgroup-v1 files to ReST format, in order to
> allow a later addition to the admin-guide.
>
> The conversion is actually:
> - add blank lines and identation in order to identify paragraphs;
> - fix tables
From: Mathieu Poirier
This patch deals with timestamp packets received from the decoding
library in order to give the front end packet processing loop a handle
on the time instruction conveyed by range packets have been executed at.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc:
From: Adrian Hunter
Export cycle and instruction counts on samples and calls tables.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-18-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
Add a parameter to call graph and call tree, to determine whether IPC
information is available.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-20-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Mathieu Poirier
When working with CPU-wide traces different traceID may be found in the
same stream. As such we need to use the decoder callback that provides
the traceID in order to know the thread context being decoded.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander
From: yuzhoujian
One can just record callchains in the kernel or user space with this new
options.
We can use it together with "--all-kernel" options.
This two options is used just like print_stack(sys) or print_ustack(usr)
for systemtap.
Shown below is the usage of this new option combined
From: Mathieu Poirier
This patch adds the necessary intelligence to properly compute the value
of 'old' and 'head' when operating in snapshot mode. That way we can
get the latest information in the AUX buffer and be compatible with the
generic AUX ring buffer mechanic.
Tester notes:
> Leo,
From: Kan Liang
It is useful to aggregate counts per die. E.g. Uncore becomes die-scope
on Xeon Cascade Lake-AP.
Introduce a new option "--per-die" to support per-die aggregation.
The global id for each core has been changed to socket + die id + core
id. The global id for each die is socket +
From: Mathieu Poirier
Nowadays the synthesize code is using the packet's cpu information,
making cs_etm_queue::cpu useless. As such simply remove it.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc: Namhyung Kim
Cc: Peter Zijlstra
Cc: Suzuki
From: Mathieu Poirier
Fixing wrong indentation of the while() loop - no change of
functionality.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc: Namhyung Kim
Cc: Peter Zijlstra
Cc: Suzuki Poulouse
Cc: coresi...@lists.linaro.org
Cc:
From: Arnaldo Carvalho de Melo
So perf_config() uses:
int ret = 0;
perf_config_set__for_each_entry(config_set, section, item) {
...
ret = fn();
if (ret < 0)
break;
}
return ret;
Expecting that that break will imediatelly go to function
From: Adrian Hunter
The user probably wants to replace the find text, so select the find
text when the find bar is activated.
That is fairly standard behaviour for search text entry.
Entering text will replace the current text, but using edit keys
(arrows, home, end etc) cancels the selection
From: Kan Liang
With the new CPUID.1F, a new level type of CPU topology, 'die', is
introduced. The 'die' information in CPU topology should be added in
perf header.
To be compatible with old perf.data, the patch checks the section size
before reading the die information. The new info is added
From: Mathieu Poirier
Add handling of SWITCH-CPU-WIDE events in order to add the tid/pid of
the incoming process to the perf tools machine infrastructure. This
information is later retrieved when a contextID packet is found in the
trace stream.
Signed-off-by: Mathieu Poirier
Tested-by: Leo
From: Mathieu Poirier
Link contextID packets received from the decoder with the perf tool
thread mechanic so that we know the specifics of the process currently
executing.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc: Namhyung Kim
Cc: Peter
From: Arnaldo Carvalho de Melo
To support the SCA_FILENAME beautifier in more than one syscall arg, as
needed for syscalls such as the rename* family, we need to, after
processing one such arg, bump the augmented pointers so that the next
augmented arg don't reuse data for the previous augmented
From: Mathieu Poirier
Add support for CPU-wide trace scenarios by correlating range packets
with timestamp packets. That way range packets received on different
ETMQ/traceID channels can be processed and synthesized in chronological
order.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
From: Arnaldo Carvalho de Melo
Almost there, next step is to copy more than one filename payload.
Probably to read syscall arg structs, etc we'll need just a variation of
this that will decide what to use, if probe_read_str() or plain
probe_read for structs, i.e. fixed size.
Cc: Adrian Hunter
From: Adrian Hunter
Instruction trace decoders can optimize output based on what time
intervals will be filtered, so pass that information in
itrace_synth_ops.
Signed-off-by: Adrian Hunter
Cc: Jin Yao
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190604130017.31207-2-adrian.hun...@intel.com
From: Mathieu Poirier
There is no point in having two different error goto statement since the
openCSD API to free a decoder handles NULL pointers. As such function
cs_etm_decoder__free() can be called to deal with all aspect of freeing
decoder memory.
Signed-off-by: Mathieu Poirier
From: Leo Yan
The clang bpf cmdline template has defined default value in the file
tools/perf/util/llvm-utils.c, which has been changed for several times.
This patch updates the documentation to reflect the latest default value
for the configuration llvm.clang-bpf-cmd-template.
Signed-off-by:
Oleg Nesterov writes:
> On 06/10, Eric W. Biederman wrote:
>>
>> Personally I don't think anyone sane would intentionally depend on this
>> and I don't think there is a sufficiently reliable way to depend on this
>> by accident that people would actually be depending on it.
>
> Agreed.
>
> As I
From: Mathieu Poirier
The thread field of structure cs_etm_queue is CPU dependent and as such
need to be part of the cs_etm_traceid_queue in order to support CPU-wide
trace scenarios.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc: Namhyung Kim
From: Arnaldo Carvalho de Melo
The 'die' info isn't in the same array as core and socket ids, and we
missed the 'dies' string list, that comes right after the 'core' +
'socket' id variable length array, followed by the VLA for the dies.
Cc: Adrian Hunter
Cc: Andi Kleen
Cc: Jiri Olsa
Cc: Kan
From: Arnaldo Carvalho de Melo
Since we know what args are strings from reading the syscall
descriptions in tracefs and also already mark such args to be beautified
using the syscall_arg__scnprintf_filename() helper, all we need is to
fill in this info in the 'syscalls' BPF map we were using to
From: Jiri Olsa
We are getting false positive gcc warning when we compile with gcc9 (9.1.1):
CC jvmti/libjvmti.o
In file included from /usr/include/string.h:494,
from jvmti/libjvmti.c:5:
In function ‘strncpy’,
inlined from
From: Mathieu Poirier
When operating in CPU-wide mode tracers need to generate timestamps in
order to correlate the code being traced on one CPU with what is executed
on other CPUs.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc: Namhyung Kim
Cc:
From: Mathieu Poirier
The tid/pid fields of structure cs_etm_queue are CPU dependent and as
such need to be part of the cs_etm_traceid_queue in order to support
CPU-wide trace scenarios.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc: Namhyung Kim
From: Adrian Hunter
Enhance the call graph to display IPC information if it is available.
Committer testing:
[acme@quaco adrian.hunter]$ python
~acme/libexec/perf-core/scripts/python/exported-sql-viewer.py
~/c/adrian.hunter/simple-retpoline.db
Reports -> Context Sensitive Callgraph, then
From: Jiri Olsa
It's already setup in the only caller of this method in
perf_evsel__open(), right before calling perf_evsel__alloc_fd(), no need
to do it again.
Also it's better to have it out of the function before we move it to
libperf.
Signed-off-by: Jiri Olsa
Cc: Adrian Hunter
Cc: Jiri
From: Mathieu Poirier
When operating in CPU-wide mode being notified of contextID changes is
required so that the decoding mechanic is aware of the process context
switch.
Signed-off-by: Mathieu Poirier
Reviewed-by: Suzuki Poulouse
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
From: Adrian Hunter
Export cycle and instruction counts on samples and calls tables.
Committer testing:
First runs some workload collecting intel_pt with the 'cyc' ter just for
userspace:
[root@quaco adrian.hunter]# perf record -o simple-retpoline.perf.data -e
intel_pt/cyc/u
From: Kan Liang
There is no function to retrieve die id information of a given CPU.
Add cpu_map__get_die_id() to retrieve die id information.
Signed-off-by: Kan Liang
Reviewed-by: Jiri Olsa
Cc: Andi Kleen
Cc: Peter Zijlstra
Link:
> console=ttySC1,115200 earlycon=scif,mmio16,0xffe8 noiotrap' \
> > > -serial null -serial stdio \
> > > -net nic,model=rtl8139 -net user -nographic -monitor null
> > >
> > > Reverting this patch as well as "USB: drop HCD_LOCAL_MEM flag&qu
From: Mathieu Poirier
The decoder needs to work with more than one traceID queue if we want to
support CPU-wide scenarios with N:1 source/sink topologies. As such
move the packet buffer and related fields out of the decoder structure
and into the cs_etm_queue structure.
Signed-off-by: Mathieu
From: Mathieu Poirier
In an ideal world there is one CPU per cs_etm_queue and as such, one
trace ID per cs_etm_queue. In the real world CoreSight topologies allow
multiple CPUs to use the same sink, which translates to multiple trace
IDs per cs_etm_queue.
To deal with this a new
From: Arnaldo Carvalho de Melo
For instance, the rename* family uses "oldname", "newname", so check if
"name" is at the end and treat it as a filename.
Cc: Adrian Hunter
Cc: Jiri Olsa
Cc: Namhyung Kim
Link: https://lkml.kernel.org/n/tip-wjy7j4bk06g7atzwoz1mi...@git.kernel.org
Signed-off-by:
From: Mathieu Poirier
Ask the perf core to generate an event when processes are swapped in/out
of context. That way proper action can be taken by the decoding code
when faced with such event.
Signed-off-by: Mathieu Poirier
Tested-by: Leo Yan
Cc: Alexander Shishkin
Cc: Jiri Olsa
Cc:
From: Arnaldo Carvalho de Melo
One more step into copying multiple filenames to support syscalls like
rename*.
Cc: Adrian Hunter
Cc: Brendan Gregg
Cc: Jiri Olsa
Cc: Luis Cláudio Gonçalves
Cc: Namhyung Kim
Link: https://lkml.kernel.org/n/tip-xdqtjexdyp81oomm1rkze...@git.kernel.org
From: Adrian Hunter
Add field 'ipc' to display instructions-per-cycle.
Example:
perf record -e intel_pt/cyc/u ls
perf script --insn-trace --xed -F+ipc,-dso,-cpu,-tid
ls 2670177.697113434: 7f0dfdbcd090 _start+0x0 mov %rsp, %rdi IPC:
0.00 (1/877)
ls 2670177.697113434:
From: Adrian Hunter
To make it easier to add new code for different TIP cases, separate each
case.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-11-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
When CYC packets are not available, it is still possible to count cycles
using TSC/TMA/MTC timestamps.
As the timestamp increments in TSC ticks, convert to CPU cycles using
the current core-to-bus ratio.
Do not accumulate cycles when control flow packet generation is not
From: Adrian Hunter
Export cycle and instruction counts on samples and call-returns.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-16-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
Add counts of instructions and cycles, in order to represent
instructions-per-cycle (IPC).
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-7-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
Cycle and instruction counts are added to the stack. The IPC of a
function and all functions it calls, is also recorded.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-14-adrian.hun...@intel.com
Signed-off-by: Arnaldo
From: Adrian Hunter
Add brief documentation to explain how the database export maintains
backward and forward compatibility.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-15-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
Add brief documentation about instructions-per-cycle (IPC) information
derived from Intel PT.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-13-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de Melo
---
From: Adrian Hunter
In preparation for providing instructions-per-cycle (IPC) information,
accumulate cycle count from CYC packets.
Although CYC packets are optional (requires config term 'cyc' to enable
cycle-accurate mode when recording), the simplest way to count cycles is
with CYC packets.
From: Adrian Hunter
Copy the incremental instruction count and cycle count onto 'instructions'
and 'branches' samples.
Because Intel PT does not update the cycle count on every branch or
instruction, the incremental values will often be zero.
When there are values, they will be the number of
From: Adrian Hunter
In preparation for using MTC packets to count cycles, record whether
decoding is between a PSB and PSBEND packets.
Signed-off-by: Adrian Hunter
Cc: Jiri Olsa
Link: http://lkml.kernel.org/r/20190520113728.14389-10-adrian.hun...@intel.com
Signed-off-by: Arnaldo Carvalho de
From: Alexey Budankov
When DWARF stacks were requested and at the same time that the user
specifies a register set using the --user-regs option the full register
context was being captured on samples:
$ perf record -g --call-graph dwarf,1024 --user-regs=IP,SP,BP --
stack_test2.g.O3
From: Arnaldo Carvalho de Melo
We forgot to update the perf.data file format document for the
HEADER_CLOCKID header, do it now from comments in the patch introducing
it.
Cc: Adrian Hunter
Cc: Alexander Shishkin
Cc: Alexey Budankov
Cc: Andi Kleen
Cc: Chong Jiang
Cc: Jiri Olsa
Cc: Namhyung
+0200)
are available in the Git repository at:
git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git
tags/perf-core-for-mingo-5.3-20190611
for you to fetch changes up to 04c41bcb862bbec1fb225243ecf07a3219593f81:
perf trace: Skip unknown syscalls when expanding strace like syscall groups
301 - 400 of 1189 matches
Mail list logo