Re: C++ compiler vs D compiler

Timon Gehr via Digitalmars-d Sat, 03 Oct 2015 08:46:08 -0700

On 10/03/2015 12:45 PM, Abdulhaq wrote:

Perhaps the answer to this is obvious, but what's harder to write from
scratch - a C++ compiler or a D compiler? :-)


We know Walter wrote a C++ compiler single handedly, does anyone else
recall the C++ Grandmaster qualification, the free course where
participants get to write a complete C++ compiler from scratch? I think
it's dead now but I can't find any real info about that despite a
serious google. What's the chances of anyone single-handedly writing a D
compiler from scratch in the future? I know deadalnix is writing SDC in
D - sounds interesting.

I have also started a similar project four years ago (it's currentlyjust an incomplete compiler front-end), but I have not been able to workmuch on it during the last year. (It only compiles with DMD 2.060though, because of issues similar to what I discuss below.)

Is the D language well enough documented /
specified for a complete D implementation to be even possible (as things
stand now)?

Well, not really. The main impediment to a fully formal specification isthe interplay of forward references and the meta-programming system.(DMD just does a best-effort kind of thing, where the common casescenarios for which there were bug reports work, but in general thesemantics of the resulting code can depend on things like the order thatmodules are passed on the command line, or perfectly valid code isrejected with a "forward reference error".) The documentation justspecifies that everything works, but there is no consistent way tointerpret it.


E.g.:

static if(!is(typeof(x))) enum y=2;
static if(!is(typeof(y))) enum x=2;

Arbitrarily abstruse examples can be constructed, e.g. this is from mytest suite:


struct TestInvalidInheritance{
    class A{ int string; } // error
    template Mixin(string s){
        mixin("alias "~s~" Mixin;");
    }
    class D: Mixin!({D d = new E; return d.foo();}()){
        int foo(int x){ return 2;}
        string foo(){ return "X"; }
    }
    class E: D{
        override int foo(int x){ return super.foo(x); }
        override string foo(){ return "A"; }
    }
}

(I currently accept this code if the declaration of int string in classA is removed. Otherwise the code is analyzed until the point when it isclear that A is in D's superclass chain, and it is also clear that thesymbol 'string' which was necessary to resolve in order to discover thisfact was resolved incorrectly. The compiler then gives up and prints anerror message:


example.d:3:18: error: declaration of 'string' is invalid
    class A{ int string; } // error
                 ^─────
example.d:9:9: note: this lookup on subclass 'D' should have resolved to it
        string foo(){ return "X"; })
        ^─────

I and SDC have different ways to deal with those kinds of examples, andI think the SDC way does not work (unless things have changed since Ihave looked at it). It assumes that declarations can be ordered and thatit is fine to depend on the order of declarations.

My implementation is designed to be independent of declaration order andto reject the cases where there is no single obvious and consistentinterpretation of the program. The drawbacks currently are:

- It is overly conservative in some cases, especially when string mixinsare involved. E.g., the following code has only one consistentinterpretation, but it is rejected (as is any reordering of thosedeclarations):


enum x = "enum xx = q{int y = 0;};";

struct SS{
    mixin(xx);
    mixin(x);
}

- The current implementation is somewhat slow. IIRC, N-fold recursivetemplate instantiation currently runs in Ω(N²). It's clear that thisneeds to be improved. If this is to be adopted as the official solution,it should not make the compiler any slower, at least in the common case.

There are also some other, more minor issues. For example, when thelanguage specification speaks about "memory safety", it is reallyunclear what this means, as the language designers seem to think it thatit is fine to have undefined behaviour in a section of code that is"verified memory safe".

Re: C++ compiler vs D compiler

Reply via email to