Default initialization of static array faster than void initialization

wolframw via Digitalmars-d-learn Fri, 08 Nov 2019 08:51:32 -0800

Hi,

Chapter 12.15.2 of the spec explains that void initialization ofa static array can be faster than default initialization. Thisseems logical because the array entries don't need to be set toNaN. However, when I ran some tests for my matrix implementation,it seemed that the default-initialized array is quite a bitfaster.

The code (and the disassembly) is athttps://gist.github.com/wolframw/73f94f73a822c7593e0a7af411fa97ac

I compiled with dmd -O -inline -release -noboundscheck -mcpu=avx2and ran the tests with the m array being default-initialized inone run and void-initialized in another run.

The results:
Default-initialized: 245 ms, 495 μs, and 2 hnsecs
Void-initialized: 324 ms, 697 μs, and 2 hnsecs

What the heck?

I've also inspected the disassembly and found an interestingdifference in the benchmark loop (annotated with "start of loop"and "end of loop" in both disassemblies). It seems to me like thecompiler partially unrolled the loop in both cases, but in thedefault-allocation case it discards every second result of themultiplication and saves each other result to the sink matrix. Inthe void-initialized version, it seems like each result is storedin the sink matrix.I don't see how such a difference can be caused by the differentinitialization strategies. Is there something I'm not considering?Also, if the compiler is smart enough to figure out that it candiscard some of the results, why doesn't it just do away with theentire loop and run the multiplication only once? Since bothinput matrices are immutable and opBinary is pure, it isguaranteed that the result is always the same, isn't it?

Default initialization of static array faster than void initialization

Reply via email to