there isn't that much code in numpy that profits from modern x86
instruction sets, even the simple arithmetic loops are strided and thus
unvectorizable by the compiler. They have been vectorized manually in
1.8 using sse2 and it is on my todo list to add runtime detected avx
support.
On 26.11.201
Have you tried on an Intel CPU? I have both a i5 quad core and an i7 octo
core where I could run it over the weekend. One may expect some compiler
magic taking advantage of the advanced features, specially the i7.
/David
On Nov 25, 2013 8:16 PM, "Julian Taylor"
wrote:
> On 25.11.2013 02:32, Yaro
On 25.11.2013 02:32, Yaroslav Halchenko wrote:
>
> On Tue, 15 Oct 2013, Nathaniel Smith wrote:
>> What do you have to lose?
>
>>> btw -- fresh results are here http://yarikoptic.github.io/numpy-vbench/ .
>
>>> I have tuned benchmarking so it now reflects the best performance across
>>> multiple