[Numpy-discussion] Re: Introducing Arm Optimized Routines

2023-02-08 Thread Chris Sidebottom
Hello again :-) Just as an update for the list, the first PR has now been raised to integrate Optimized Routines, demonstrating the performance improvements (sometimes 2x faster): https://github.com/numpy/numpy/pull/23171 Once we've achieved the initial milestone of getting these routines inte

[Numpy-discussion] Re: Introducing Arm Optimized Routines

2022-11-09 Thread Matti Picus
On 8/11/22 19:45, Chris Sidebottom wrote: Hi Matti, Thanks for your questions :-) This seems like it would improve performance on aarch64. Would the routines also work with the Apple silicon? Yip, I can't see a reason why that wouldn't be the case. If these are new routines, it would be

[Numpy-discussion] Re: Introducing Arm Optimized Routines

2022-11-09 Thread Chris Sidebottom
Hi Matti, Thanks for your questions :-) > This seems like it would improve performance on aarch64. Would the routines > also work with the Apple silicon? Yip, I can't see a reason why that wouldn't be the case. > If these are new routines, it would be better to implement them in terms of > th

[Numpy-discussion] Re: Introducing Arm Optimized Routines

2022-11-08 Thread Matti Picus
Thanks, this seems like it would improve performance on aarch64. Would the routines also work with the Apple silicon arm64? If these are new routines, it would be better to implement them in terms of the numpy universal intrinsics rather than adding a new submodule. Matti On 8/11/22 13:30,