Re: Very simple SIMD programming

bearophile Wed, 24 Oct 2012 15:00:27 -0700

Manu:

The compiler would have to do some serious magic to optimisethat;flattening both sides of the if into parallel expressions, andthen applying the mask to combine...


I think it's a small amount of magic.

The simple features shown in that paper are fully focused on SIMDprogramming, so they aren't introducing things clearly notefficient.

I'm personally not in favour of SIMD constructs that areanything less than
optimal (but I appreciate I'm probably in the minority here).
(The simple benchmarks of the paper show a 5-15% performanceloss compared
to handwritten SIMD code.)
Right, as I suspected.

15% is a very small performance loss, if for the programmer thealternative is writing scalar code, that is 2 or 3 times slower:-)

The SIMD programmers that can't stand a 1% loss of performanceuse the intrinsics manually (or write in asm) and they ignore allother things.

A much larger population of system programmers wish to use modernCPUs efficiently, but they don't have time (or skill, this meanstheir programs are too much often buggy) for assembly-levelprogramming. Currently they use smart numerical C++ libraries,use modern Fortran versions, and/or write C/C++ scalar code (orFortran), add "restrict" annotations, and take a look at theproduced asm hoping the modern compiler back-ends will vectorizeit. This is not good enough, and it's far from a 15% loss.

This paper shows a third way, making such kind of programmingsimpler and approachable for a wider audience, with a smallperformance loss compared to handwritten code. This is whatlanguage designers do since 60+ years :-)


Bye,
bearophile

Re: Very simple SIMD programming

Reply via email to