Denis Serenyi writes:
 > I don't think there will be a problem with releasing my source code. 
 > That is, if it works and is truly a performance win :)

Cool!

 > There are some PDF docs available on Intel's web site that have sample 
 > code for an SSE bcopy, and give performance results (in particular, 
 > "Block Copy Using Pentium III Streaming SIMD Extensions"). It seems to 
 > be about 60 - 80% faster than using MMX instructions. However, when you 
 > use SSE to store data in the destination memory location, you bypass the 
 > processor's caches. So, if you were to touch the data soon after the 
 > bcopy, it is no win at all.

Hey, that's great!  The copies I care about are in situtations where
the data is not touched until much later, so the normal copy is
typically a big loose because it blows out the cache..

Good luck,

Drew


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-hackers" in the body of the message

Reply via email to