If you’re using a typical consumer computer, you’ll get limitations of memory
bandwidth, which it seems will limit simple calculations on large amounts of
data. It would probably be worth ensuring one’s installation of PDL is compiled
with -O3 just in case; -O2 (the usual default) enables
This thread is interesting.
I was wondering if anyone has ever seen speedups of 2x or better with
PDL_AUTOPTHREAD_TARG > 2? I find it tends to max out at around 1.5-1.7x
whatever I set.
I know about overhead etc. but kind of feel for some of the basic stuff (e.g.
A=B*C for large arrays with