here is the first stab at mul_basecase. It's basically just an inlined version
of
mpn_zero(xp,xn);
rp[xn]=mpn_addmul(rp,xp,xn,yp[0]);
rp++;yp++yn--;
while(yn>=1)
{rp[xn]=mpn_addmul(rp,xp,xn,yp[0]);
rp++;yp++,yn--;}
for the timings in the result file attatched , I skipped the mpn_
On Nov 28, 5:21 pm, [EMAIL PROTECTED] wrote:
> .align 16
> mpn_add_n:
> mov $0,carryreg
> .align 16
> mpn_add_nc:
> here is the usual stuff
>
> Here we either align it and have to skip over 16 bytes (which is at least 1
> extra cycle) or we dont bother with the alignment for the _nc version , and
On Nov 29, 1:28 am, "Bill Hart" <[EMAIL PROTECTED]> wrote:
> It's true these are not used much. But at this point we definitely
> don't want to break strict compatibility with GMP.
>
> I don't have an opinion on the other options though. Perhaps someone
> else has something meaningful...
>
> Bil
On Nov 29, 8:00 am, Cactus <[EMAIL PROTECTED]> wrote:
> On Nov 29, 1:28 am, "Bill Hart" <[EMAIL PROTECTED]> wrote:
>
>
>
> > It's true these are not used much. But at this point we definitely
> > don't want to break strict compatibility with GMP.
>
> > I don't have an opinion on the other option