On 19/5/22 16:50, kawakam...@fujitsu.com wrote:

Hello,

I am working on speeding up NumPy with the AArch64 SVE instruction set.
I could not find a numpy implementation for SVE. Is there already a test 
implementation or discussion about SVE support?

OpenBLAS apparently has SVE support from 0.3.20 [0], which was merged to NumPy yesterday.


So far we have only support for NEON and ASIMD in NumPy, see [1] for a description of the way we use intrinsics. Contributions to improve the implementations and the documentation would be welcome. We currently use travis CI to run our aarch64 tests, if there was a way to get access to more advanced machines that also would be good. Do you know of commercially available machines with SVE or SVE2 support?


Matti


[0] https://github.com/xianyi/OpenBLAS/blob/faf58d2b3ffb20fd334cab080700be564ef74ec2/Changelog.txt#L28

[1] https://numpy.org/devdocs/reference/simd/build-options.html



_______________________________________________
NumPy-Discussion mailing list -- numpy-discussion@python.org
To unsubscribe send an email to numpy-discussion-le...@python.org
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: arch...@mail-archive.com

Reply via email to