I figured out why 8 tables is optimal on this machine. The L1 cache is
128kb and at 2048x2048, 8 Gray tables with k = 5 fits exactly in half
the cache, which is probably optimal.

Bill.

On 19 May, 02:00, Martin Albrecht <[EMAIL PROTECTED]>
wrote:
> On Monday 19 May 2008, Bill Hart wrote:
>
>
>
> > Well, apparently there are speed gains right up to 8 Gray tables,
> > though I really have no idea why that is.
>
> > Here are the new times:
>
> > Magma Old New
>
> > 10000x10000:
> >    2.940s 3.13s 2.32s
>
> > 16384x16384:
> >    9.250s 12.96s 9.17s
>
> > 20000x20000:
> >    16.57s 22.43s 16.49s
>
> > 32000x32000:
> >    59.1s 90.2s 81.6s 65.7s
>
> > So now we beat Magma all the way up to 20000x20000.
>
> > I'll put the adjusted code in the directory:
>
> >http://sage.math.washington.edu/home/wbhart/m4ri3gray/
>
> > in just a moment. I also found a memory leak in my code which I fixed.
>
> > Bill.
>
> Cool, I'll take a look tomorrow. It seems we're closing in on the classical
> algorithm though. Although, you don't seem to decrease k anymore.
>
> As for the copying and 2^14 vs. 10^4: Maybe the extra rows/columns cost too
> many cycles. I'll try valgrind/callgrind/cachegrind on the code to see if
> that is true. Also maybe for 2^14 x 2^14 the submatrices are nicely aligned
> on cache lines. I'm widely speculating here.
>
> Martin
>
> --
> name: Martin Albrecht
> _pgp:http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x8EF0DC99
> _www:http://www.informatik.uni-bremen.de/~malb
> _jab: [EMAIL PROTECTED]
--~--~---------~--~----~------------~-------~--~----~
To post to this group, send email to sage-devel@googlegroups.com
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at http://groups.google.com/group/sage-devel
URLs: http://www.sagemath.org
-~----------~----~----~----~------~----~------~--~---

Reply via email to