Re: [fpc-devel] Vectorization

Adriaan van Os Tue, 12 Dec 2017 01:25:58 -0800

J. Gareth Moreton wrote:

I created a Wiki page to plan things out: 
http://wiki.lazarus.freepascal.org/Vectorization

As a side bar, note what Intel writes about Optimization in the Intel Math Kernel Library DeveloperReference

<https://software.intel.com/en-us/mkl-developer-reference-c-performance-enhancements>

Performance Enhancements

The Intel Math Kernel Library has been optimized by exploiting both processor and system featuresand capabilities. Special care has been given to those routines that most profit fromcache-management techniques. These especially include matrix-matrix operation routines such as dgemm().

In addition, code optimization techniques have been applied to minimize dependencies of schedulinginteger and floating-point units on the results within the processor.


The major optimization techniques used throughout the library include:

• Loop unrolling to minimize loop management costs

• Blocking of data to improve data reuse opportunities

• Copying to reduce chances of data eviction from cache

• Data prefetching to help hide memory latency

• Multiple simultaneous operations (for example, dot products in dgemm) to 
eliminate stalls due to
arithmetic unit pipelines

• Use of hardware features such as the SIMD arithmetic units, where appropriate

Regards,

Adriaan van Os
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

Re: [fpc-devel] Vectorization

Reply via email to