Richard Mills <richardtmi...@gmail.com> writes:
> My guess is that, with the Intel compiler, at least, we are better off
> letting it do the unrolling.  I'm not sure about other compilers out there.

GCC and Clang seem to generate decent code so long as loop bounds are
known statically, though they are less aggressive about vectorizing than
Intel (and typically produce significantly smaller binaries as a
result).  I've had a hell of a time getting XL compilers to do anything
without nonsensical shenanigans.

Attachment: signature.asc
Description: PGP signature

Reply via email to