> I think a better idea would be to explicitly force all matrices and > all rows to be 128 bit aligned if the matrices are wide enough to > benefit from SSE2, Then the combine function can always use SSE2 and > there will be no need to check for alignment.
That doesn't seem to make a noticeable difference for me (on C2D). However, I realised that the multiplications where the target matrix is a real matrix rather than a window (which has bad data locality). Copying everything over seems not like a good idea but it at least indicates an area for improvements. Martin -- name: Martin Albrecht _pgp: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x8EF0DC99 _www: http://www.informatik.uni-bremen.de/~malb _jab: [EMAIL PROTECTED] --~--~---------~--~----~------------~-------~--~----~ To post to this group, send email to sage-devel@googlegroups.com To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/sage-devel URLs: http://www.sagemath.org -~----------~----~----~----~------~----~------~--~---