On 1/8/19 1:23 PM, Tapley, Mark via cctalk wrote: > Why so (why surprising, I mean)? Understood an unrolled loop executes > faster...
That can't always be true, can it? I'm thinking of an architecture where the instruction cache is slow to fill and multiple overlapping operations are involved and branch prediction assumes a branch taken. I'd say it was very close in that case. --Chuck