Sorry for the late answer. Most of the things above are correct, when
building for a specific architecture the compiler does wonders, give or
take a few years. But as Gilles pointed out we are seeking the best
performance across different family of processors, so we helped the
compiler a little.
I
Florent,
With such specific hardware needs the community would not be able to
provide any guarantee (compile or test) about these component. Are you
planning to join our github CI or at least the MTT tester in order to
ensure correctness of the two proposed components ?
Best,
George.
On Wed, Ja