Re: perf/x86/intel: Collecting CPU-local performance counters from all cores in parallel

2017-05-23 Thread Andi Kleen
Michael Edwards writes: > > Am I going about this wrong? It seems like a reasonable optimization, but it's likely a lot of work. > Is there some better way to pursue the > high-level goal of gathering PMC-based statistics frequently and > efficiently from all cores, without breaking everything e

perf/x86/intel: Collecting CPU-local performance counters from all cores in parallel

2017-05-22 Thread Michael Edwards
I'm working on a system-wide profiling tool that uses perf_event to gather CPU-local performance counters (L2/L3 cache misses, etc.) across all CPUs (hyperthreads) of a multi-socket system. We'd like for the monitoring process to run on a single core, and to be able to sample at frequent, regular