Re: DConf 2014 Day 1 Talk 4: Inside the Regular Expressions in D by Dmitry Olshansky

Dicebot via Digitalmars-d-announce Sat, 14 Jun 2014 08:10:30 -0700

On Thursday, 12 June 2014 at 16:42:38 UTC, Dmitry Olshansky wrote:

It's always nice to ask something on D NG, so many good answersI can hardly choose whom to reply ;) So this is kind ofbroadcast.
Yes, the answer seems spot on - reflection! But allow me toretort.
I'm not talking about completely stand-alone generator. Just aswell generator tool could be written in D using the same exactsources as your D program does. Including the staticintrospection and type-awareness. Then generator itself is alibrary + "an invocation script" in D.
The Q is specifically of CTFE in this scenario, including notonly obvious shortcomings of design, but fundamental ones ofcompilation inside of compilation. Unlike proper compilation ishas nothing persistent to back it up. It feels backwards, a bitlike C++ TMP but, of course, much-much better.
1)
Reflection. It is less of an issue for pure DSL solutionsbecause thosedon't provide any good reflection capabilities anyway, butother code
generation approaches have very similar problems.
By doing all code generation in separate build step youpotentially losemany of guarantees of keeping various parts of yourapplication in sync.
Use the same sources for the generator. In essence all is thesame, just relying on separate runs and linkage, not mixin.Necessary "hooks" to link to later could indeed be generatedwith a tiny bit of CTFE.
Yes, deeply embedded stuff might not be that easy. The scopeand damage is smaller though.
2)
Moving forward. You use traditional reasoning of DSL generallybeingsomething rare and normally stable. This fits most common DSLusage buttight in-language integration D makes possible brings newopportunitiesof using DSL and code generation casually all other yourprogram.
Well, I'm biased by heavy-handed ones. Say I have a (no longer)secret plan of doing a next-gen parser generator in D. Needlessto say swaths of non-trivial code generation. I'm all forembedding nicely but I see very little _practical_ gains inCTFE+mixin here EVEN if CTFE wouldn't suck. See the point aboveabout using the same metadata and types as the user applicationwould.

Consider something like REST API generator I have describedduring DConf. There is different code generated in differentcontexts from same declarative description - both for server andclient. Right now simple fact that you import very same modulefrom both gives solid 100% guarantee that API usage between thosetwo programs stays in sync.

In your proposed scenario there will be two different generatedfiles imported by server and client respectively. Tiny typo inwriting your build script will result in hard to detect run-timebug while code itself still happily compiles.

You may keep convenience but losing guarantees hurts a lot. To beable to verify static correctness of your program / group ofprograms type system needs to be aware how generated code relatesto original source.

Also this approach does not scale. I can totally imagine youdoing it for two or three DSL in single program, probably evendozen. But something like 100+? Huge mess to maintain. Accordingto my experience all builds systems are incredibly fragilebeasts, trusting them something that impacts program correctnessand won't be detected at compile time is just too dangerous.

I totally expect programming culture to evolve to the pointwheresomething like 90% of all application code is being generatedin typicalproject. D has good base for promoting such paradigm switchand reducing
any unnecessary mental context switches is very important here.
This was pretty much the point I was trying to make with myDConf talk (
and have probably failed :) )
I liked the talk, but you know ... 4th or 5th talk withCTFE/mixin I think I might have been distracted :)
More specifically this bright future of 90%+ concise DSL drivenprograms is undermined by the simple truth - no amount ofimprovement in CTFE would make generators run faster thenoptimized standalone tool invocation. The tool (library writtenin D) may read D metadata just fine.
I heard D builds times are important part of its adoption so...

Adoption - yes. Production usage - less so (though stillimportant). Difference between 1 second and 5 seconds is veryimportant. Between 10 seconds and 1 minute - not so much.

JIT will be probably slower than stand-alone generators but notthat slower.

It might solve most of _current_ problems, but I foreseefundamental issues of "no global state" in CTFE that in say 10years from now would look a lot like `#include` in C++.

I hope 10 years ago from now we will consider having global statein RTFE stone age relict :P

A major one is there is no way for compiler to not recompilegenerated code as it has no knowledge of how it might havechanged from the previous run.

Why can't we merge basic build system functionality akin to rdmdinto compiler itself? It makes perfect sense to me as buildprocess can benefit a lot from being semantically aware.

Re: DConf 2014 Day 1 Talk 4: Inside the Regular Expressions in D by Dmitry Olshansky

Reply via email to