Re: The "right way" to handle alignment of pointer targets in the compiler?

Tim Prince Sat, 02 Jan 2010 19:49:30 -0800

Benjamin Redelings I wrote:

Thanks for the information!

Here are several reasons (there are more) why gcc uses 64-bit loads bydefault:1) For a single dot product, the rate of 64-bit data loads roughlybalances the latency of adds to the same register. Parallel dot products(using 2 accumulators) would take advantage of faster 128-bit loads.2) run-time checks to adjust alignment, if possible, don't pay off forloop counts < about 40.3) several obsolete CPU architectures implemented 128-bit loads by pairsof 64-bit loads.4) 64-bit loads were generally more efficient than movupd, prior tobarcelona.

In the case you quote, with parallel dot products, 128-bit loads wouldbe required so as to show much performance gain over x87.

Re: The "right way" to handle alignment of pointer targets in the compiler?

Reply via email to