I've been doing some more work on the FFT. I found a way to get around
writing more new butterflies for the FFT. So now I have the FFT
working for power of two lengths.

The times are quite good but can be improved yet. Firstly MPIR just
uses mpn_mul_n to do pointwise mults in the FFT range. I'm just using
that at present because I don't have an optimised Nussbaumer
convolution. But that can be improved considerably. Secondly I have
not optimised the first layer of the FFT and the last layer of the FFT
for cache friendliness.

Anyhow, even with these two problems it is still beating MPIR right up
to 4Gbit integers.

More to do yet.

Bill.

-- 
You received this message because you are subscribed to the Google Groups 
"mpir-devel" group.
To post to this group, send email to mpir-devel@googlegroups.com.
To unsubscribe from this group, send email to 
mpir-devel+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/mpir-devel?hl=en.

Reply via email to