> At 11:03 PM 2/28/2002 +0100, Guillermo Ballester Valor wrote:
> >The memory bottleneck was the first thing I thought, and I was near to
> >discard the idea when I realized that the trig bata would be the same, and
> >the required memory access would be less than double the single stream
> > scheme.
>
> Well, that would be true if SSE2 had a multiply vector by scalar
> instruction. That is, to multiply two values by the same trig value, you
> must either load two copies the trig value or add instructions to copy the
> value into both halves
> of the SSE2 register.
>

Yes, I was thinking in copy the trig value from a half to other, although I 
don't know how would be the cost.
_________________________________________________________________________
Unsubscribe & list info -- http://www.ndatech.com/mersenne/signup.htm
Mersenne Prime FAQ      -- http://www.tasam.com/~lrwiman/FAQ-mers

Reply via email to