Paolo Bonzini wrote:
The primary icc/icl use of SSE/SSE2 masking operations, of course, is in the auto-vectorization of fabs[f] and conditional operations:I'm curious, does ICC support vector arithmetic like this?
sum = 0.f;
i__2 = *n;
for (i__ = 1; i__ <= i__2; ++i__)
if (a[i__] > 0.f)
sum += a[i__];
.... (Windows/intel asm syntax)
pxor xmm2, xmm2
cmpltps xmm2, xmm3
andps xmm3, xmm2
addps xmm0, xmm3
...
