On Friday 22 July 2011 17:39:55 Jason wrote:
> On Friday 22 July 2011 12:36:21 Jason wrote:
> > On Friday 22 July 2011 12:25:42 Bill Hart wrote:
> > > That's fantastic! Thanks for all your hard work on these Jason.
> > > 
> > > Bill.
> > > 
> > > On 22 July 2011 11:13, jason <ja...@njkfrudils.plus.com> wrote:
> > > > Hi
> > > > 
> > > > New assembler for the nehalem mpn_addadd mpn_addsub mpn_subadd , used
> > > > to run at 3.5c/l now at 3.0c/l therefore optimal. I'll check out how
> > > > they run on the other intel chips.
> > 
> > it's also an improvement on core2/penryn and nearly optimal , probably
> > just needs a shuffle. Sandybridge doesn't benefit though.
> > 
> > > > Note mpn_subadd really should be called mpn_subsub , I'll change it
> > > > later.
> > > > 
> > > > Jason
> > > > 
> > > > --
> > > > You received this message because you are subscribed to the Google
> > > > Groups "mpir-devel" group. To post to this group, send email to
> > > > mpir-devel@googlegroups.com. To unsubscribe from this group, send
> > > > email to mpir-devel+unsubscr...@googlegroups.com. For more options,
> > > > visit this group at http://groups.google.com/group/mpir-devel?hl=en.
> 
> I tried to do a mpn_addaddadd ie x=y+z+u+v but on Intel chips the scheduler
> cant really cope with it , also with 5 pointer you get so many L1 data
> cache bank conflicts that the code runs at different speeds for the
> relative differences between the pointers mod 64 . But if we are many
> using it for toom then we could perhaps guarantee the relative
> differences. On the AMD chips I have a strange problem with my optimizer
> where it reports silly numbers for some functions ie sumdiff addadd , no
> idea why it's happening , I even reverted to an earier svn version where I
> found the original fast addadd code , but it still gave silly figures ,
> but karaadd was fine ?
> 
> Jason

New mpn_sumdiff for the nehalem , didn't have before so we can say it would of 
run at 4.0c/w but now is 3.6c/w (lost a lttle bit with the feedin) , this code 
also benefits the core2 but not penryn or sandybridge , probably just another 
trivial shuffle needed :)

Jason

-- 
You received this message because you are subscribed to the Google Groups 
"mpir-devel" group.
To post to this group, send email to mpir-devel@googlegroups.com.
To unsubscribe from this group, send email to 
mpir-devel+unsubscr...@googlegroups.com.
For more options, visit this group at 
http://groups.google.com/group/mpir-devel?hl=en.

Reply via email to