Re: SIMD under LDC

12345swordy via Digitalmars-d-learn Mon, 04 Sep 2017 18:16:01 -0700

On Monday, 4 September 2017 at 23:06:27 UTC, Nicholas Wilsonwrote:

On Monday, 4 September 2017 at 20:39:11 UTC, Igor wrote:
I found that I can't use __simd function from core.simd underLDC
Correct LDC does not support the core.simd interface.
and that it has ldc.simd but I couldn't find how to implementequivalent to this with it:
ubyte16* masks = ...;
foreach (ref c; pixels) {
        c = __simd(XMM.PSHUFB, c, *masks);
}
I see it has shufflevector function but it only acceptsconstant masks and I am using a variable one. Is this possibleunder LDC?
You have several options:
* write a regular for loop and let LDC's optimiser take care ofthe rest.
alias mask_t = ReturnType!(equalMask!ubyte16);
pragma(LDC_intrinsic, "llvm.masked.load.v16i8.p0v16i8")
ubyte16 llvm_masked_load(ubyte16* val,int align, mask_tmask, ubyte16 fallthru);
ubyte16* masks = ...;
foreach (ref c; pixels) {
        auto mask = equalMask!ubyte16(*masks, [-1,-1,-1, ...]);
        c = llvm_masked_load(&c,16,mask, [0,0,0,0 ... ]);
}
The second one might not work, because of type differences inllvm, but should serve as a guide to hacking the `cmpMask` IRcode in ldc.simd to do what you want it to.
BTW. Shuffling channels within pixels using DMD simd is about5 times faster than with normal code on my machine :)
Don't underestimate ldc's optimiser ;)

I seen cases where the compiler fail to optimized for smid.

Re: SIMD under LDC

Reply via email to