System programming in D (Was: The God Language)

Vladimir Panteleev Thu, 29 Dec 2011 03:22:23 -0800

On Thursday, 29 December 2011 at 09:16:23 UTC, Walter Brightwrote:

Are you a ridiculous hacker? Inline x86 assembly that thecompiler actually understands in 32 AND 64 bit code, hex stringliterals like x"DE ADB EEF" where spacing doesn't matter, theability to set data alignment cross-platform with type.alignof= 16, load your shellcode verbatim into a string like so: autostr = import("shellcode.txt");

I would like to talk about this for a bit. Personally, I thinkD's system programming abilities are only half-way there. Notethat I am not talking about use cases in high-level applicationcode, but rather low-level, widely-used framework code, whereevery bit of performance matters (for example: memory copyroutines, string builders, garbage collectors).

In-line assembler as part of the language is certainly neat, andin fact coming from Delphi to C++ I was surprised to learn thatC++ implementations adopted different syntax for asm blocks.However, compared to some C++ compilers, it has severelimitations and is D's only trick in this alley.

For one thing, there is no way to force the compiler to inline afunction (like __forceinline / __attribute((always_inline)) ).This is fine for high-level code (where users are best left withPGO and "the compiler knows best"), but sucks if you need aguarantee that the function must be inlined. The guarantee isn'tjust about inlining heuristics, but also implementationcapabilities. For example, some implementations might not be ableto inline functions that use certain language features, and yourcode's performance could demand that such a short function mustbe inlined. One example of this is inlining functions containingasm blocks - IIRC DMD does not support this. The compiler shouldfail the build if it can't inline a function tagged with@forceinline, instead of shrugging it off and failing silently,forcing users to check the disassembly every time.

You may have noticed that GCC has some ridiculously complicatedassembler facilities. However, they also open the way to thepossibilities of writing optimal code - for example, creatingcustom calling conventions, or inlining assembler functionswithout restricting the caller's register allocation with apredetermined calling convention. In contrast, DMD is veryconservative when it comes to mixing D and assembler. One time Ifound that putting an asm block in a function turned what weresingle instructions into blocks of 6 instructions each.

D's lacking in this area makes it impossible to create languagefeatures that are on the level of D's compiler built-ins. Forexample, I have tested three memcpy implementations recently, butnone of them could beat DMD's standard array slice copy (despitethat in release mode it compiles to a simple memcpy call). Why?Because the overhead of using a custom memcpy routine negated itsperformance gains.

This might have been alleviated with the presence of sane macros,but no such luck. String mixins are not the answer: trying totranslate macro-heavy C code to D using string mixins is stringescape hell, and we're back to the level of shell scripts.

We've discussed this topic on IRC recently. From what Iunderstood, Andrei thinks improvements in this area are not"impactful" enough, which I find worrisome.

Personally, I don't think D qualifies as a true "systemprogramming language" in light of the above. It's more of acompiled language with pointers and assembler. Before youdisagree with any of the above, first (for starters) I'd like toinvite you to translate Daniel Vik's C memcpy implementation toD: http://www.danielvik.com/2010/02/fast-memcpy-in-c.html . Itdoesn't even use inline assembler or compiler intrinsics.

System programming in D (Was: The God Language)

Reply via email to