On 19/5/22 16:50, kawakam...@fujitsu.com wrote:
Hello,
I am working on speeding up NumPy with the AArch64 SVE instruction set.
I could not find a numpy implementation for SVE. Is there already a test
implementation or discussion about SVE support?
OpenBLAS apparently has SVE support from 0.3.20 [0], which was merged to
NumPy yesterday.
So far we have only support for NEON and ASIMD in NumPy, see [1] for a
description of the way we use intrinsics. Contributions to improve the
implementations and the documentation would be welcome. We currently use
travis CI to run our aarch64 tests, if there was a way to get access to
more advanced machines that also would be good. Do you know of
commercially available machines with SVE or SVE2 support?
Matti
[0]
https://github.com/xianyi/OpenBLAS/blob/faf58d2b3ffb20fd334cab080700be564ef74ec2/Changelog.txt#L28
[1] https://numpy.org/devdocs/reference/simd/build-options.html
_______________________________________________
NumPy-Discussion mailing list -- numpy-discussion@python.org
To unsubscribe send an email to numpy-discussion-le...@python.org
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: arch...@mail-archive.com