Solving the spurious forward/cyclic reference errors in DMD

Elie Morisse via Digitalmars-d Sun, 16 Jul 2017 18:21:20 -0700

Timon, any update on this? What are the insights you gained withyour frontend?


I recently reported two cases without a simple fix:


https://issues.dlang.org/show_bug.cgi?id=17656
https://issues.dlang.org/show_bug.cgi?id=17194#c1

and have seen a lot more referencing errors with Calypso,especially when this gets enabled:https://github.com/Syniurge/Calypso/commit/1e1ae319e32120bd9ef0009716ddabed92f69ac2

Calypso makes its mapped C++ symbols go through the sameimportAll -> semantic1,2,3 route that D symbols take. Ultimatelythis is mostly useless work that should be skipped, the reason itcurrently works this way being that I wasn't familiar yet withthe DMD source code when I started. But what this hard andungrateful work has also been doing (and many large libraries areblocked by this) is exposing a seemingly infinite number of bogusforward/circular/misc referencing DMD errors.Those errors boil down to semantic calls getting triggered at thewrong time, on symbols that the caller doesn't really depend upon.

Because most of the time, the semantic() call on the LHS ofDotXXXExp, inside AggregateDeclaration.determineSize, etc. isthere in case there are:

 - mixins to expand
 - attributes whose members have to be added to the parent symtab
 - if LHS is a template to instantiate

These are (AFAIK) the only cases where the symtab of the LHS orthe aggregate may get altered, and if I understand correctlythat's what the semantic call is checking before searching forthe RHS or determining the aggregate fields and then its size.

So would splitting semantic() into determineMembers() precedingthe rest of semantic() be worth exploring? The thing is, thiswould help in most cases but I can imagine scenarios where simplysplitting may not be enough. Example:


enum E { OOO = S.UUU }

import std.conv;

string genMember1() { return "enum G8H9 = " ~(cast(int)E.OOO).to!string; }

string genMember2() { return "enum UUU = 1;"; }

struct S {
    mixin(genMember1());
    mixin(genMember2());
}

We'll have S.determineMembers -> E.OOO.semantic ->S.determineMembers, and although in this case the value of OOOmay be interpreted to 1, at this point the compiler can't easilyknow whether mixins will generate zero, one or more UUU membersor not. To attenuate the problem determineMembers() could be madebe callable multiple times (recursively), each time starting fromwhere the previous (on-going) call left off, so in thisparticular case the second S.determineMembers call would expandthe second mixin to enum UUU = 1. But then how does the compilerknows and react if genMember1 generate a new UUU member? Ok asecond UUU enum will error, but what if UUU was a function andgenMember1() generates a more specialized overload of UUU? I.e:


enum E { OOO = S.UUU(1) }

import std.conv;

string genMember1() { return "static int UUU(int n) { return n;}; enum G8H9 = " ~ (cast(int)E.OOO).to!string; }string genMember2() { return "static int UUU(int n, int a = 5) {return n + 5; }"; }


struct S {
    mixin(genMember1());
    mixin(genMember2());
}

At this point well it's getting a bit contrived, so maybe it'snot really worth finding a solution to make this compile (butideally the compiler should still warn the user).

Should I try splitting semantic() and make a PR? It might be alot of work, so I'd like to know if this makes sense first.

Solving the spurious forward/cyclic reference errors in DMD

Reply via email to