Federico Beffa writes:

> Out of curiosity, could you outline how OpenBLAS is optimized for a
> specific CPU architecture while being compiled on a different CPU (and
> hence allowing to be substituted)?

The Quick Install instructions[1] say that when OpenBLAS is compiled
with DYNAMIC_ARCH=1

   "All kernel will be included in the library and dynamically switched
    the best architecutre at run time."

It seems that unlike ATLAS, OpenBLAS does not perform any self-tuning
but relies on hand-optimised code (e.g. by using CPU-specific
instructions).

~~ Ricardo

[1]: 
https://github.com/xianyi/OpenBLAS/blob/80bf3e6a3525f558a9fde2514622313c991c091f/GotoBLAS_02QuickInstall.txt#L67

Reply via email to