Re: Any usable SIMD implementation?

Joe Duarte via Digitalmars-d Mon, 02 May 2016 02:18:54 -0700

On Saturday, 23 April 2016 at 10:40:12 UTC, Johan Engelen wrote:

On Monday, 18 April 2016 at 00:27:06 UTC, Joe Duarte wrote:
Someone else said talked about marking "Broadwell" and othergeneration names. As others have said, it's better to specifyfeatures. I wanted to chime in with a couple of additionalexamples. Intel's transactional memory acceleratinginstructions (TSX) are only available on some Broadwell partsbecause there was a bug in the original implementation(Haswell and early Broadwell) and it's disabled on most. Butthe new Broadwell server chips have it, and it's a big dealfor some DB workloads. Similarly, only some Skylake chips havethe Secure Guard instructions (SGX), which are very powerfulfor creating secure enclaves on an untrusted host.
Thanks, I've seen similar comments in LLVM code.

I have a question perhaps you can comment on?
With LLVM, it is possible to specify something like"+sse3,-sse2" (I did not test whether this actually results inSSE3 instructions being used, but no SSE2 instructions). Whatshould be returned when querying whether "sse3" feature isenabled?Should __traits(targetHasFeature, "sse3") == true mean thatimplied features (such as sse and sse2) are also available?

If you specify SSE3, you should definitely get SSE2 and plain oldSSE with it. SSE3 is a superset of SSE2 and includes all the SSE2instructions (more than 100 I think.)

I'm not sure about your syntax – I thought the hyphen meant toinclude the option, not remove it, and I haven't seen theaddition sign used for those settings. But I haven't done muchwith those optimization flags.

You wouldn't want to exclude SSE2 support because it's becomingthe bare minimum baseline for modern systems, the de facto FPunit. Windows 10 requires a CPU with SSE2, as do more and moreapplications on the archaic Unix-like platforms.

Re: Any usable SIMD implementation?

Reply via email to