I was curious if binary sizes had decreased because of the changes Ilya had been making to try and scope imports better and make them more selective:

https://github.com/D-Programming-Language/phobos/pulls?utf8=%E2%9C%93&q=is%3Apr+author%3A9il+clean

Hello world (void main(){ import std.stdio; writefln("hello roboto"); }) size went from 464 KB to 548 KB when going from 2.066.1 to 2.067.1 on linux/x86, an increase of 18% (dmd -O -release main.d). I used nm to try and find some of the symbols using the most space (command taken from SO):

nm -B -r --size-sort --print-size -t d main

I noticed that the symbol taking up the third-most space was _d_arraysetlengthT, which wasn't in the older executable generated by 2.066.1. Disassembling the newer executable (objdump -rD main), it appears that it's called from exactly one function, std.uni.GcPolicy.realloc, which is in turn only called from one templated struct's member function, std.uni.CowArray.length. That instantiated function isn't called from anywhere else in the binary.

The templated struct std.uni.CowArray is only instantiated by the templated struct std.uni.InversionList in the source, but I'm not sure why neither is instantiated in the older executable and a diff of the two versions of std.stdio doesn't produce anything that stands out. None of this appears to be used when the binary is run, as having gdb break on _d_arraysetlengthT does nothing.

But std.uni isn't actually imported directly by std.stdio, where does it come from? Nearest I can tell from adding the -v flag to dmd, std.stdio has a couple scoped, selective imports to some functions from std.utf. std.utf has exactly _one_ scoped, selective import of std.string.format in its UTFException class, and std.string has several selective imports from std.uni, including one at module scope.

I tried commenting out that single selective import of std.string.format in std.utf and the same binary compiled and ran fine without any imports of std.string or std.uni, plus it was now 36 KB smaller. :)

I realize executable size may not be a priority, but this exploration shows how easy it is to get a bunch of template garbage pulled in to executables (I know this is not news for some). Perhaps the binary would have been twice as big if not for Ilya's work! Maybe this isn't considered something that should be fixed at the compiler level, but rather by properly working with the linker to remove these, as David did with --gc-sections for ldc. Either way, some kind of dashboard that charts binary sizes for dmd PRs can't come soon enough, so we can keep better tabs on this.

Reply via email to