https://gcc.gnu.org/bugzilla/show_bug.cgi?id=51119
--- Comment #27 from Thomas Koenig <tkoenig at gcc dot gnu.org> --- (In reply to Joost VandeVondele from comment #22) > I agree that inline should be faster, if the compiler is reasonably smart, > if the matrix dimensions are known at compile time (i.e. should be able to > generate the same kernel). I haven't checked yet. If the compiler turns out not to be reasonably smart, file a bug report :-)