Performance differences between SystemML LibMatrixMult and Breeze with native BLAS

fschueler Wed, 30 Nov 2016 14:54:38 -0800

Hi all,

I have run a very quick comparison between SystemML's LibMatrixMult andBreeze matrix multiplication using native BLAS (OpenBLAS throughnetlib-java). As per my very small comparison I get the result thatthere is a performance difference for dense-dense Matrices of size 1000x 1000 (our default blocksize) with Breeze being about 5-6 times fasterhere. The code I used can be found here:https://github.com/fschueler/incubator-systemml/blob/model_types/src/test/scala/org/apache/sysml/api/linalg/layout/local/SystemMLLocalBackendTest.scala

Running this code with 50 iterations each gives me for example averagetimes of:

Breeze:         49.74 ms
SystemML:   363.44 ms

I don't want to say this is true for every operation, but those resultslet us form the hypothesis that native BLAS operations can lead to asignificant speedup for certain operations which is worth testing withmore advanced benchmarks.

Btw: I am definitely not saying we should use Breeze here. I am morelooking at native BLAS and LAPACK implementations in general (asprovided by OpenBLAS, MKL, etc.).


Let me know what you think!
Felix

Performance differences between SystemML LibMatrixMult and Breeze with native BLAS

Reply via email to