At 11:03 PM 2/28/2002 +0100, Guillermo Ballester Valor wrote:
>The memory bottleneck was the first thing I thought, and I was near to
>discard the idea when I realized that the trig bata would be the same, and
>the required memory access would be less than double the single stream scheme.

Well, that would be true if SSE2 had a multiply vector by scalar instruction.
That is, to multiply two values by the same trig value, you must either load
two copies the trig value or add instructions to copy the value into both 
halves
of the SSE2 register.

_________________________________________________________________________
Unsubscribe & list info -- http://www.ndatech.com/mersenne/signup.htm
Mersenne Prime FAQ      -- http://www.tasam.com/~lrwiman/FAQ-mers

Reply via email to