subject:"perf measure for stalled cycles per instruction on newer Intel processors"

perf measure for stalled cycles per instruction on newer Intel processors

2020-10-15 Thread Or Gerlitz

Hi, Earlier Intel processors (e.g E5-2650) support the more of classical two stall events (for backend and frontend [1]) and then perf shows the nice measure of stalled cycles per instruction - e.g here where we have IPC of 0.91 and CSPI (see [2]) of 0.68: 9,568,273,970 cycles

Re: perf measure for stalled cycles per instruction on newer Intel processors

2020-10-15 Thread Andi Kleen

On Thu, Oct 15, 2020 at 05:53:40PM +0300, Or Gerlitz wrote: > Hi, > > Earlier Intel processors (e.g E5-2650) support the more of classical > two stall events (for backend and frontend [1]) and then perf shows > the nice measure of stalled cycles per instruction - e.g here where we > have IPC of 0.

Re: perf measure for stalled cycles per instruction on newer Intel processors

2020-10-18 Thread Or Gerlitz

On Thu, Oct 15, 2020 at 9:33 PM Andi Kleen wrote: > On Thu, Oct 15, 2020 at 05:53:40PM +0300, Or Gerlitz wrote: > > Earlier Intel processors (e.g E5-2650) support the more of classical > > two stall events (for backend and frontend [1]) and then perf shows > > the nice measure of stalled cycles pe

Re: perf measure for stalled cycles per instruction on newer Intel processors

2020-10-18 Thread Andi Kleen

> > Don't use it. It's misleading on a out-of-order CPU because you don't > > know if it's actually limiting anything. > > > > If you want useful bottleneck data use --topdown. > > So running again, this time with the below params, I got this output > where all the right most column is colored red