On Monday, 2 March 2020 at 13:35:15 UTC, p.shkadzko wrote:
[snip]
Thanks. I don't have time right now to review this thoroughly. My recollection is that the dot product of two matrices is actually matrix multiplication, correct? It generally makes sense to defer to other people's implementation of this. I recommend trying lubeck's version against numpy. It uses a blas/lapack implementation. mir-glas, I believe, also has a version.
Also, I'm not sure if the fastmath attribute would do anything here, but something worth looking into.