On Tuesday, 24 September 2019 at 07:41:35 UTC, Martin
Tschierschke wrote:
Thank you, I found this too, but it is more an example of the
principle, but what is the use case?
It is only useful if the instruction set of the compiling
computer differ from target
hardware and by this you get
using host processor instruction set
???
If you don't want to ship 10 fine-tuned binaries for 10 different
CPUs (see `-mcpu=?`), you can use JIT to compile and tune
performance-critical pieces for the executing/target CPU. E.g.,
letting the auto-vectorizer exploit the full register width for
AVX-512 CPUs etc.