Re: SSE2 basecase multiplication

2013-12-07 Thread Torbjorn Granlund
Vasili Burdo writes: I implemented basecase multiplication and squaring for x86 using SSE2 instructions and Comba column-wise multiplication method. On Ivy Bridge (Intel Core i7 3517U) multiplication 10-20% faster than present GMP basecase MMX multiplication. Squaring is 5-10% faster th

SSE2 basecase multiplication

2013-12-05 Thread Vasili Burdo
Hi. I implemented basecase multiplication and squaring for x86 using SSE2 instructions and Comba column-wise multiplication method. On Ivy Bridge (Intel Core i7 3517U) multiplication 10-20% faster than present GMP basecase MMX multiplication. Squaring is 5-10% faster than GMP MMX version. However,