[rust-dev] statement-expressions and block-terminators

Graydon Hoare Tue, 23 Nov 2010 14:34:26 -0800

Hi,

Some of you may have noticed that in the rewrite from rustboot to rustcwe're becoming substantially more expression-language-ish. This ismostly a result of me yielding to the preferences of other developers(and LLVM's semantics), as well as some hint that things get much easierin syntax extensions and calculating compile-time-constants if we permitmore "statement-ish" forms as expressions. Particularly conditionals.

We've run into a (common, seen in many other languages) sort of problemalong the way here, which is that some expressions are implicitlyignored (or must be, due to being in an ignored context) whereas othersare not. We have a nil-type (), but we don't always have sensible rulesfor forcing things to have the nil type by context.

This email is a poll of alternative solutions. I'll give two examplecases and ask people for their input on which modification of the rulesfeels best.


Example case that does compile:

  A:  auto x = if (foo()) { 10; } else { 11; };

Example case that does not compile:

  B:  if (foo()) { 10; } else { "hello"; }

We can write this in rust at the moment, but in the rustc typecheckingrules it will fail to compile, because 'if' is an expression-statement,expressions have types, and the types of the two branches (judged as thelast statement's expression value, if it's an expression, or else nil)are of different types.

Here are some approaches to solving this example. Please pick the oneyou like the most:

(1) Kick all branchy expressions out of the expression grammar, put themback in the statement grammar. Case B will compile, and case A must berewritten like so:


  A:  auto x = { auto t = 11; if (foo()) { t = 10; }; t; };

This is the C-with-GNU-extensions model.

(2) Hoist all statements up into the expression language and makesemicolon into a sequencing operator, with a trailing-semi ignored bythe parser. Then we need to rewrite only the second case to force unittypes in the to-be-ignored differing branches.


  B:  if (foo()) { 10; () } else { "hello"; () }

Though we'd also be *allowed* to rewrite the first case to drop thesemicolons:


  A:  auto x = if (foo() { 10 } else { 11 };

This is the Ocaml approach.

(3) A slightly weaker form of (2), which is to reformulate blocks withthe following grammar:


    block ::=  { [ stmt ; ]* expr? }

In other words, every block becomes a brace-enclosed sequence ofsemicolon-terminated statements, followed by an optional expr. If theexpr is missing, it is implied as (). In this case we'd be rewritingonly the first case:


  A:  auto x = if (foo()) { 10 } else { 11 };

This is similar to the Ocaml rule in practice, except that it makes thepresence or absence of the final semicolon in a block equivalent toending the block with the nil type. This is a possible hazard(especially during refactoring or editing) to users who want to write avalue-producing block but accidentally semicolon-terminate the lastexpression; but it's not a huge hazard since the typechecker will tellthem the value they produced is of nil type. It just might be hit a lot.

(4) Statically determine the contexts in which an expression's value"will be used" in an outer expression, and only typecheck thosecontexts. This permits both of the examples to compile as-is, but it'sthe most unorthodox approach, and poses a refactoring hazard as code maybecome type-invalid when nested into an expression context that "uses"its previously-ignored result. Again, as in (3) the typechecker willcatch these cases, but they might happen more or less often than thosein (3).

We can't think of any other options. Significant whitespace is not anoption :)

Personally my knee-jerk reaction is to embrace (1) since I likestatements anyway, but I can see plausible arguments for the other 3.Can I get a show of hands? We have to pick something.


-Graydon
_______________________________________________
Rust-dev mailing list
[email protected]
https://mail.mozilla.org/listinfo/rust-dev

[rust-dev] statement-expressions and block-terminators

Reply via email to