I have committed the remaining transform-functions written in SSE-assembly.
That means now all the transform_points* functions are done in SSE-assembly :)
These functions are faster than the fpu-ones, but to get the most speed out of the
SSE the data, the SSE-instructions
are working on (transform-matrix, src_vertices and dst_vertices) must be 16byte
aligned.
I have to find a way how to do it best (while keeping the code as clean as possible !).
Bye,
Andre.
_______________________________________________
Mesa-dev maillist - [EMAIL PROTECTED]
http://lists.mesa3d.org/mailman/listinfo/mesa-dev