On Friday, 30 October 2015 at 21:29:47 UTC, Iakh wrote:
...

I got it to 1.5 the running time of C using SSE2 but couldn't get GDC to emit the correct aligned loads, if I used __builtin_assume_aligned the optimizer started being really off.

Reply via email to