I have committed the remaining transform-functions written in SSE-assembly.
That means now all the transform_points* functions are done in SSE-assembly :)

These functions are faster than the fpu-ones, but to get  the most speed out of the 
SSE the data, the SSE-instructions
are working on (transform-matrix,  src_vertices and dst_vertices) must be 16byte 
aligned.

I have to find a way how to do it best (while keeping the code as clean as possible !).

Bye,
   Andre.


_______________________________________________
Mesa-dev maillist  -  [EMAIL PROTECTED]
http://lists.mesa3d.org/mailman/listinfo/mesa-dev

Reply via email to