> At 11:03 PM 2/28/2002 +0100, Guillermo Ballester Valor wrote: > >The memory bottleneck was the first thing I thought, and I was near to > >discard the idea when I realized that the trig bata would be the same, and > >the required memory access would be less than double the single stream > > scheme. > > Well, that would be true if SSE2 had a multiply vector by scalar > instruction. That is, to multiply two values by the same trig value, you > must either load two copies the trig value or add instructions to copy the > value into both halves > of the SSE2 register. >
Yes, I was thinking in copy the trig value from a half to other, although I don't know how would be the cost. _________________________________________________________________________ Unsubscribe & list info -- http://www.ndatech.com/mersenne/signup.htm Mersenne Prime FAQ -- http://www.tasam.com/~lrwiman/FAQ-mers
