D Language Foundation October Monthly Meeting Summary

Mike Parker via Digitalmars-d-announce Sun, 31 Dec 2023 03:16:44 -0800

The D Language Foundation's monthly meeting for October 2023 tookplace on Friday the 13th at 15:00 UTC. It lasted around one hourand thirty minutes. I was unable to attend, so thanks to Razvanfor running it and to Dennis for recording it.

Two attendees were first-timers: Adam Wilson and Luís Ferreira. Iinvited Adam along after a conversation we had a DConf. Luís wasoriginally supposed to attend the quarterly meeting the weekbefore as a representative of Weka but had been unable to makeit, so I invited him to this one.


## The Attendees

The following people attended the meeting:

* Andrei Alexandrescu
* Walter Bright
* Luís Ferreira
* Timon Gehr
* Martin Kinkelin
* Dennis Korpel
* Átila Neves
* Razvan Nitu
* Adam D. Ruppe
* Steven Schveighoffer
* Adam Wilson

### Razvan

Razvan got started with an issue he had recently encountered withthe inliner that he didn't know how to resolve. Normally, whenmodule A imports module B and calls a function from B, thenyou'll end up with a linker error if B's object file isn't handedoff to the linker, e.g., when compiling with `A.d` on the commandline and omitting `B.d`. However, when the called function isinlined, no linker error is raised. He suspected this was anoptimization.

He'd found that this behavior breaks building with BetterC withthe inline flag. He explained that the inliner runs afterSemantic 3, but since B is not the root module some extrasemantic stuff is done, and then you end up with `TypeInfo`errors. His first instinct was that this was a hack and that theinliner shouldn't be doing any semantic analysis. But if wechange this, it might cause linker errors elsewhere.

Walter said this was the first he'd heard of this. Why not justlink in module B or add it as a root file? And why was it aparticular problem for BetterC? To the former, Razvan said that'sthe way to fix the error. Then he asked if Walter agreed that theinliner shouldn't be doing any semantic analysis. Walter said hedidn't know as he hadn't looked into it. He didn't remember thatthe inliner was doing semantic analysis.

Razvan then asked if everyone agreed that compilation with andwithout the inline flag should yield the same result. Walter saidnot necessarily. If a function is inlined, you don't need to linkit in. And to link it, you have to add it as a root module. So hedidn't see why this was a problem and why it was a particularproblem for BetterC.

Steve said that the inliner had to semantically analyze theinlined code. So he asked if Razvan was talking about some othersemantic analysis taking place, like on code that wasn't used.Razvan explained he was talking about the analysis the inlinerhas to do on inlined code when the imported module isn't a rootmodule.

Martin mentioned some LDC linker flags that require extrasemantic analysis on things that would otherwise be linkedexternally. He said if something is a root module, it's going tobe analyzed anyway, but if you just compile module A and itdecides to inline another function that would otherwise notrequire semantic analysis, then yes... He wasn't sure heunderstood what the problem was here, but he could confirm thatthey'd had a problem a couple of weeks before. They were gettinglinker errors related to inlining, but it had nothing to do withseparate compilation. He said that was a topic for later.

Walter noted that if the inlined function from module B calledany other functions from module B, then you'd see linker errorsif B wasn't linked. Was that the problem? Martin said it wasn'tthe problem in their case. It was a unit test runner thatincluded all of the code, so there shouldn't be any undefinedsymbols.

Walter asked Razvan to clarify what sort of linker errors he wasseeing. Razvan said it had to do with `TypeInfo` generation inBetterC code. The fundamental issue was that semantic analysiswas being done when it shouldn't and `TypeInfo` was beinggenerated. If you compile with `-betterC` but without `-inline`,you don't see the errors, but if you compile with both, you seethem.

Walter said that, okay, it was a `TypeInfo` issue. To figure thatout, he'd have to trace through the code to understand why a`TypeInfo` was being generated in module B in that case. Hecouldn't think of a reason off the top of his head.

Razvan suggested an alternative fix would be to only inlinefunctions from root modules. Walter said that would never work.You wouldn't be able to have header-only libraries very well. Hesaid the correct fix was to determine why a `TypeInfo` was beinggenerated when it shouldn't be.

Martin said LDC didn't have these kinds of issues with `TypeInfo`because it had a different emission strategy for them. Hesuggested DMD take the same route. For classes, the `TypeInfo` isalways generated in the module which contains the classdeclaration. That was the same as DMD as far as he knew. Wherethey diverge is with structs. The special `TypeInfo` memberswhich are added to the struct are all emitted into the objectfile that contains the struct, but the actual `TypeInfo` isemitted lazily whenever it's accessed, but from the codegenlayer. So they avoid all the trouble with `TypeInfo` being neededat CTFE and such because it's generated lazily in each modulethat needs it, which accesses it in actual code that's executedat runtime. All of the speculative instantiation stuff could beuntangled by just doing it in the codegen layer rather than aspart of semantic analysis.

Steve was trying to understand that if you're calling a functionwith BetterC that needs `TypeInfo`, and it's linking without`-inline`, then that must mean that the function being calledisn't really BetterC. Because it needs the `TypeInfo`. Dennisthought it came up when you want to call a CTFE-only function ina library, e.g., a Phobos function that uses GC only for CTFE,and with inlining it's inserting the Phobos code into yourBetterC code. He said that basically what it comes down to isthat BetterC is a bunch of hacks and the front-end inliner is abit of a hack, and in the long term they should both go. Stevesummed it up as "BetterC can't use CTFE when a CTFE function usesruntime features, which is a long-standing problem."

Walter asked Razvan if there was a Bugzilla issue for this.Razvan said there was and there was also a PR. Walter asked ifthe PR worked, and Razvan said it didn't.

(__UPDATE__: Both [the Bugzillaissue](https://issues.dlang.org/show_bug.cgi?id=24153) and [thepull request](https://github.com/dlang/dmd/pull/15627) have sincebeen closed, as the issue is no longer reproducible.)


### Dennis

Dennis wanted to know what the future of the error interface isgoing to be in DMD. They'd been working to use an error sink sothat it wasn't just printing directly to the console, but theinterface is a thin wrapper around `printf`. Some issues with ithad come up and attempts to fix them had fallen short because theinterface is too limiting. Walter wanted a simple interface andhad rejected proposals that would replace it with one that'scomplex. So Dennis was wondering if there was a place to meet inthe middle: make it slightly more complex to tackle the issuesthey'd encountered. He noted that one of our goals was to improveerror messages.

Walter said he agreed with the goal. With his recent pullrequests, he'd been trying not to have multiple interfaces to theerror message handler. He cited the diagnostic error messageprinter as another thing they were trying to simplify. Herecalled agreeing with Iain to remove the complexity that hadbeen added to the error sink, but nothing had happened yet. Thepurpose for the error sink was twofold: to simplify theinterface, and to make it usable with DMD-as-a-library. Then thelibrary can provide its own interface to do what it wants. So hewanted to keep that simple. DMD-as-a-library was going to besimple so we could do the LSP implementation simply.

This was followed by a discussion about issues with the currentimplementation, the use of `toChars` and `toPrettyChars`,decisions about truncation and formatting, etc. A big point herewas that Walter said all the custom formatting should be upfrontbefore the message gets to the error sink. Syntax highlightingand things like that should be done by whatever the error sinkcalls. But the error sink itself shouldn't be making anydecisions about formatting and highlighting.

The outcome was that Dennis said he would experiment with a new`toChars` method and see how far he could get.


### Timon
Timon had nothing to bring up this time.

### Adam W.

Adam started by saying he had spoken with me at DConf, and I hadinvited him to the meeting to talk about how MS handles theirrelease stuff for .NET editions. He summarized how it works:


* It's a one-year release cycle that ends in November.

* They have a three-month planning phase with the first previewcoming in February.

* They do seven total previews.
* Those are followed by two release candidates.

He said that during the three-month planning phase, they stillpush out library fixes even for features they've decided to cut.They do feature requests and such right on GitHub and also listtheir focus areas there. And they have pretty strict rules aboutwhat can and cannot be done and when. So in the preview phasefeatures can be added at any time even if it's running late. Thelanguage is finalized before RC1, and then RC1 is bug fixes only,and RC2 is polish and critical bug fixes only.

They're very focused on forward compatibility. If a feature is inthere now, they have to support it even if they remove it fromlater editions. In terms of new language features, the compilerand the library are tied together, so you can't necessarily buildthe latest version of the library with an older version of thecompiler, but you can go the other way.

When they release, they do multiple articles on their blogdescribing all the new stuff, written by the developers who wrotethe features. They push a lot of stuff out on Twitter and getYouTube creators involved. The final release is done on theMonday night before their annual .NET Conf. Then on Tuesdaymorning at the start of the conference, everyone gets to downloadthe new release. He had talked with me at one of the BeerConfsessions in London about this and suggested that once we getgoing with editions, we consider tying each release to DConf.

Walter noted it's always difficult getting people to write blogarticles about this stuff. Adam agreed. He said he'd tell me thatI'd wear my fingers out if I tried to write it all, but suggestedthat I could spend some time interviewing the people who writethe features. He said we could pull some of Adam Ruppe's writinginto the mainstream. And he said he'd been doing some morewriting lately himself, so he'd be willing to contribute on thatfront.

Walter said articles are a great marketing tool, but the mosteffective ones are from users. He cited all the articles outthere from Rust programmers, like "I wrote TicTacToe in Rust" or"I wrote a text editor in Rust" or "I wrote some boringconventional thing in Rust". Many of them had no merit, but theconstant drumbeat of articles about Rust appearing on socialmedia was effective. They don't even have to be very substantive.Just something that gives a constant presence out there.

He said that another effective approach was responding toprogramming articles you see on social media with articles abouthow D solves whatever problem the article was solving. He gave anexample of a discussion he'd seen on Hacker News aboutfallthrough in switch statements, so he wrote a little post abouthow D handles it with `goto case`. It got a lot of upvotes. Heavoids criticizing other languages in that kind of writing. Hejust says, here's our solution and maybe they should adapt theirsolution to it.

So if more D users were doing that kind of thing, or periodicallytweeting out three lines about what they're working on withrelevant hashtags, maybe throwing some polls out there, that sortof thing drives engagement.

This led to some discussion about SEO, hashtags, theineffectiveness of ads, etc.

To wrap up, Adam said he'd been working on an ImportC articlebased on the stuff he'd done at DConf.


### Luís

__CTFE integer overflows__

Luís opened with an issue Weka had run into with constant foldingintegers at compile time: there's no way to know if an integer isgoing to overflow. They'd like to have a warning for that. He'sworking on a linter using DMD-as-a-library and as a plugin forLDC. He'd like to have a warning for that in the linter, butthere's no way to hook the constant folder to do what they want.

Walter said that the problem with doing integer overflow checksis that sometimes you want integer overflow. Luís agreed and saidthat we could have a way to ignore the checks when they're reallywanted, but most of the time they don't want them. He said forthe compiler, at least for compile-time stuff, we could do whatClang and GCC do for sanitizing signed integer overflow checks.At run time, we can use whatever sanitizers GCC and LLVM support.

He said he'd opened a PR about this and Dennis told him this wasdefined behavior. He agreed with that, but there were some usecases where it wasn't wanted. He understood that Walter didn'tlike warnings. Even if this was something that wasn't going to beupstreamed in the compiler, he'd like a way to query if aconstant has been poisoned somehow and then it can be lintedafterward. It fitted in with the work Razvan was doing onDMD-as-a-library.

Walter said that first of all, sometimes you do want integeroverflow. Second, integer adds happen in a bunch of places thataren't in the source, e.g., for example, adding on the offset ofa struct member. Should those places be checked for integeroverflow? He didn't think that was clear. There were a lot ofissues around integer overflow that he hadn't resolved. Anotherproblem was that it didn't fit in DMD's backend. Having differentbehavior at compile time and run time was not ideal.

Luís said that if DMD could do it at compile time, they couldcheck it at run time. Ideally, they wanted both. Martin notedthat Weka already had their own fork of the compiler. If Luísalready had the PR, then they could just implement it on theirside. (Dennis lost his Jitsi connection in the middle of this, sowhatever else Martin said was lost with it. Dennis got back infairly quickly, but Walter was speaking then).

Walter said that they'd have different behavior running throughconstant folding vs. run time. He thought that was not a nicething to have, but if Weka were okay with that, then they couldimplement it in their fork of the compiler. And he said Stevenhad pointed out that they could use `checkedint` explicitly. Someof those were done in the DMD source. Places that were vulnerableto overflow have an explicit check. Timon noted thatfloating-point behavior at CTFE is different from run time.Walter said that was a known issue that was difficult to fix.He'd made a PR to fix it and it broke a lot of stuff, so hebacked off of it.

This led to a bit of a side discussion on floating pointdifferences, how real is explicitly specced as implementationdependent, use doubles if you want portability... Walter said wecould replace all the floating point calculations with our ownemulator, but then it runs like a pig. At some point, you justhave to live with the differences.

The discussion got back to integer overflow, scenarios that aresusceptible to it, how you should explicitly check in thosecases, the performance cost of having it always enabled, and soon. It went on until Luís noted that LDC has a flag to enableoverflow checks at runtime. Walter suggested that they shouldalso then be able to have a flag to enable them at compile time,and thought that would be a good idea. He suggested Luís talk toMartin about it (Martin had to leave the meeting just a fewminutes earlier). Luís agreed.


__Attribute inference bug__

Luís's second issue was a bug with attribute inference thatmanifests during separate compilation. There were some caseswhere it wasn't happening correctly. When the issue showed upwith something in Phobos, he didn't have a way to fix it. Itdidn't affect their main codebase too badly because they compiledit with one compiler invocation. But some of their projectsweren't compiled with the same build system, and that was whereit became an issue. When working on their laptops, compiling withone compiler invocation used too much memory. They needed to beable to compile with multiple invocations, but this bug wasblocking them.

Walter said that was usually caused by forward reference issues.Luís agreed and said he'd tried to fix it but had been unable to.Walter said Dennis had some ideas about that. He'd wanted to swapthe default. Dennis said he'd given it a try but had run intoissues. When you queried the type of a function, then it eagerlyneeded to know the attributes.

In the interest of time, further discussion of this issue waspushed off to later.


__Semantic analysis in AST nodes__

Next, Luís had a topic related to DMD-as-a-library. He said he'dbeen working on some linting rules for an LDC lint. He'd foundthat a lot of AST methods did semantics under the hood. Hethought it would be cool if we could have a project to splitthose out. This was part of the big refactor of the compiler.

Walter said he'd slowly been working toward minimizing the ASTfunctions, pulling out non-virtual functions that didn't need tobe there. It was a time-consuming process, but that was thedirection he was headed.

Luís said the main issue from LDC lint's perspective was theywanted to query the AST, but not mutate it. Currently, somequeries, like when testing for the presence of `@nogc`, would runsemantics if the attribute was not yet found on that call. Hedidn't want to run semantics in those query functions. If it wasa forward reference, just let him know.

He went on to explain that LDC lint was using LDC's AST, and ifbetween semantics and codegen, if he mutated the AST andsomething was relying on it not being mutated, then he was goingto have undefined behavior on his side. Átila said that constshould help with that. Luís agreed. But from a refactoringperspective, the semantics should be separated from the queries.You need to know that if you're querying something, it isn'tgoing to mutate. Átila noted that was exactly what const was for.Some people complain that it's too strict, but this was the pointof it.

Walter said the idea about separating things so that only constfunctions are available in the AST was a pretty good one.


### Steven

Steve started by noting that in D1, you could cast an int arrayliteral to ubyte to set the type, e.g., `cast(ubyte)[1,2,3,4]`.He said someone in Discord had shown that`cast(ubyte)[10000,2,3,4]` did the same thing, but that the10,000 ended up as whatever the truncated ubyte value of 10,000was. He remembered this being just a means to set the type, notso you could cast away information. He thought this was weird andwondered if it was something we should address. Walter asked himto file a Bugzilla issue, and Steve said there should already beone. (No one posted a link in the chat, and I was unable to findit with a few different search terms.)

Next, he said he'd found a significant flaw in his compile-timeassociative array code, which Dennis had merged into DMD for him.The `hashOf` function was doing things differently than the`toHash` on `TypeInfo`. This could cause the hashes coming out ofthem to be different sometimes, and that would result in anincorrect representation at run time. He said he thought he had asolution for it.

Next, he reported that he had a bunch of new students in hishomeschool coding class. He'd started rewriting [his website thattalks about it](https://codingcat.club). He said this kind ofthing was useful for bringing in people who had never programmed,and that D was a really good first language. He was hoping to getthat more completely filled out and maybe publish some videos togo along with it.

Finally, he brought up code-d, [the Visual Studio Code extensionfor D](https://github.com/Pure-D/code-d) maintained by JanJurzitza (Webfreak). Steve said that it was great when it worked,but there were a lot of weird things that caused it to break. Hethought it would be important to have the DLF sponsor Jan to addsome stuff to it. Átila wondered what that would look like sinceJan was already working on it anyway. Steve wondered if there wasanything we could do to help with debugger support or anythinglike that. Walter suggested that as a start, we could bring itunder the DLF GitHub umbrella and encourage people to help outwith it. Steve said he would bring it up with Jan.

Luís said Weka was sponsoring work on serve-d, a core componentof code-d, and he talked about some of the work he'd done on it.Steve had experienced some serve-d crashes, so there was a bit ofdiscussion about that.

(__NOTE__: Had I attended this meeting, I would have noted thatwe raised something like $3000 for Jan back in 2018. It was to bepaid out for specific milestones. Once we hit the goal, Jan askedme to delay the payments for a while, and ultimately told me theywere motivated just by working on D and not by the money. Irecall we used some of the money to get Jan to DConf 2019, andJan talked about putting some bounties on bugs.

I also would have reminded everyone that one of our major goalsright now is to strengthen the ecosystem. We're absolutelywilling to throw some money at code-d and any other importantprojects in our ecosystem where that money can help get somethingdone. We have over $11,000 sitting [in our OpenCollectiveaccount](https://opencollective.com/dlang) that can be used forthis sort of thing. Jan or anyone working on a key D project iswelcome to reach out to me to discuss possibilities: bugbounties, contract work for specific tasks, etc.

What is an important, or key, D project other than code-d? That'sone of the things we need to sort out. Until then, if you thinkyour D project is important to the D ecosystem and you have aspecific need for some financial support, please get in touchwith me at soc...@dlang.org and we can talk about it.

For now, if you'd like to contribute to the D community in someway, helping improve code-d is a high-impact way to do it.)


### Adam R.

__Mac issues__

First, Adam brought up the state of D on Mac. He said it wasokay, but not great. It felt unfinished. With LDC, you had allthe architectures, but not the latest language features. WithDMD, you had the language features, but not the architectures.And lately, regular users had been encountering linker issues,and he'd found some codegen bugs. It was good enough for him, andhe made it work, but it wasn't as good as it could be.

Luís said that Weka was going to support AArch64 and wouldprobably have some changes for upstream.

Walter asked Adam if the codegen problems were with DMD, LDC,GDC, or all of them. Adam said that DMD had the codegen bug.Walter asked for a link. Adam said he didn't have it right now.Walter asked if Adam could email it to him. Adam said he would.He said to reproduce it, you had to do a GUI application. He saidhe could try to make a smaller one. Walter said that if Adamcould isolate it to one function that he could compile and lookat the generated code, that would be even better. He said it wasoften relocations or the fixups that were wrong on the Macbecause they did weird things in it.


__String interpolation__

Next, he brought up string interpolation, saying we'd beenworking on this for years. He said John and Andrei had written [apretty good proposal](https://github.com/John-Colvin/YAIDIP)(YAIDIP) that hit real-world issues they were having at Symmetry,and it was a pity that the D language was so stagnant. Átila saidthe issue was that John and Andrei had never finished theproposal. Adam said the implementation worked and we shouldn'tput up so much useless red tape when we could have just moved onand been productive. Átila he hadn't been aware there was animplementation of it. The last he'd heard they were still workingon it. Adam said he'd written an implementation for the other DIPhe'd worked on and had withdrawn it in favor of YAIDIP, but thecore of it was essentially the same. Átila asked where thisimplementation was. Adam said it was in a DMD PR somewhere.

(__NOTE__: Had I attended, I would have noted as a reminder thatwe've got a pause on new features at the moment. One of our majorgoals is stabilizing the language and the library. We plan tostart looking at new features again, and launch a new streamlinedDIP process, once we've finalized the editions proposal.)


### Átila

Átila started by saying he had done some work on editions theweek before but had been too busy to make more progress thisweek. Instead, he had put together and given a talk about D atCERN (he used to work there), hoping to capitalize on the factthat they might build a new accelerator and are thinking ofswitching languages.

Next, he said he'd discovered that a one-line file with `importstd.file` takes 200ms to compile, and that was nuts. He needed tofigure out at some point exactly what the problem was. It wasjust semantic analysis just from the import. He wasn't evengenerating the object file. On the same machine, he also tried aC++ compile with just `#include <iostream>` and that took 400ms.He said that twice as fast as C++ was nowhere good enough. Walteragreed.

Átila wasn't sure what the next steps were. He thought we neededto solve the problem of circular imports in Phobos anyway, but hewasn't 100% convinced that would make a big dent in these buildtimes. Steve wondered if it had to do with CTFE running thingswhen it didn't have to. Átila said sure. It was one thing if youhad a static foreach over a million items. That was going to takea while and there was nothing you could do about it besideshaving a better CTFE engine. But other than doing stupid things,it shouldn't be taking so long. He didn't even use any of thesymbols from the imported file.

Walter noted that templates weren't semantically analyzed just byimporting them. They had to be expanded. So something in`std.file` was expanding the templates. Átila suspected it hadsomething to do with `std.uni`. He remembered something like thison something else he was working on that imported it.


### Walter

Walter said he'd been refactoring the front end with the goals ofsimplifying it and making it easier to understand and easier toimplement DMD-as-a-library. He said he would defer discussion ofthat to the upcoming meeting that Razvan was in charge of. Hethought Luís's idea of having const AST functions was a good one.

He'd managed to minimize the dependencies of several modules.That made the compiler easier to work with. He planned tocontinue with that. Luís noted that the expression module was oneof the biggest files in DMD. Walter said that one was on hislist. He'd been looking at splitting it into two files.

Aside from that, he was still doing the constant work of bugfixing and trying to stabilize the language.

(__NOTE__: The meeting Walter referred to was a kind of focusedworkgroup we'd scheduled to sort out some decisions about theimplementation of DMD-as-a-library. There ended up being two ofthem in October, and they were held in place of our normalplanning sessions. I sent invitations to Jan Jurzitza, PrajawalSN, and Luís Ferreira. I'll include some info about thosemeetings in a combined October/November planning update.)


### Martin, Andrei

Both Martin and Andrei had to leave the meeting before they had aturn.


## Conculsion

As the meeting wrapped up and Razvan asked if anyone had anythingto add, Timon said he'd recently participated in a couple ofprogramming contests. One thing he'd noticed was that his Dsolutions were usually the most succinct among all thecontestants. Walter encouraged him to write up a paragraph or twoabout it in the forums and publicize it elsewhere.


Our next monthly meeting took place on November 10 at 16:00 UTC.

D Language Foundation October Monthly Meeting Summary

Reply via email to