BTW: -march=native automatically implies -mtune=nativeThanks, I`ll remove mtune)
It would be really interesting if you could try writing the same code in c, both a scalar version and a version using gcc's vector instrinsics, to allow us to compare performance and identify areas for D to improve.