Re: Rehabilitating switch -- a scorecard

Brian Goetz Wed, 19 May 2021 04:13:22 -0700

So, here's another aspect of switches rehabilitation, this time in termsof syntactic rewrites. By way of analogy with lambdas, there's asequence of


    x -> e                 // parens elided in unary lambda

is-shorthand-for

    (x) -> e               // types elided

is-shorthand-for

    (var x) -> e // explicit request for inference

is-shorthand-for

    (<actual type> x) -> e // explicit types

That is, there is a canonical (lowest) form, and the various shorthandsform a chain of embeddings. The chain shape reduces cognitive load onthe user, because instead of thinking "there are seven forms of lambda",they can instead think there is single canonical form, with progressiveoptions for leaving things out / mushing things together.


We get more of a funnel with the syntax of switch:

    case L, J, K -> X;

is-shorthand-for

    case L, J, K: yield X; // expression switch, X is an expression
    case L, J, K: X;         // expression switch, X is a block
    case L, J, K: X; break;  // statement switch

and

    case L, J, K: X;

is-shorthand-for

     case L:
     case J:
     case K:
         X;



On 5/17/2021 5:36 PM, Brian Goetz wrote:

This is a good time to look at the progress we've made with switch. When we started looking at extending switch to support patternmatching (four years ago!) we identified a lot of challenges derivingfrom switch's C legacy, some of which is summarized here:
http://cr.openjdk.java.net/~briangoetz/amber/switch-rehab.html
We had two primary driving goals for improving switch: switches asexpressions, and switches with patterns as labels. In turn, thesepushed on a number of other uncomfortable aspects of switch: fallthrough, totality, scoping, and null handling.
Initially, we were unsure we would be able to rehabilitate switch tosupport these new requirements without being forever bogged down bythe mistakes of the past. Bit by bit, we have chipped away at thenegative aspects of switch, while respecting the existing code thatdepends on those aspects. I think where we've landed is, in manyways, better than we could have initially hoped for.
Throughout this exercise, there were periodic calls for "just toss itand invent something new" (which we sometimes called "snitch",shorthand for "new switch"*), and no shortage of people's attempts todesign their ideal switch construct. We resisted this line of attack,because we believed having two similar-but-different constructs livingside by side would be more annoying (and confusing) to users than arehabilitated, albeit more complex, construct.
The first round of improvements came with expression switches. Thiswas the easy batch, because it didn't materially change the set ofquestions we could ask with switch, just the form in which we askedthe question. This brought the following improvements:
- Switches as expressions. Many existing switch statements are inreality modeling expressions, in a more roundabout and less safe way. Expressing it directly is simpler and less error-prone. - Checked totality. The compiler enforces that a switch expressionis exhaustive (because, expressions must be total). In the case ofenum switches, a switch that covers all the cases needs no defaultclause, and the compiler inserts an extra case to catch novel valuesand throw (ICCE) on them. (Eventually the same will be true forswitches on sealed classes as well.) - A fallthrough-free option. Switches now give us a choice betweentwo styles of _switch blocks_, the old willy-nilly style, and the newsingle-consequent (arrow) style. Switches that choose arrow-styleneed not reason about fallthrough.
Unfortunately, it also brought a new asymmetry; switch expressionsmust be total (and you get enhanced type checking for this), butswitch statements cannot be. This is a shame, since the improved typechecking for totality is one of the best things about the improvementsin switch, as a switch that is total by virtue of actually coveringall the cases acts as a tripwire against new enum constants /permitted subtypes being added later, rather than papering it overwith a catch-all. We explored several ways to explicitly add backtotality checking, but this always felt like a hack, and requires theprogrammer to remember to ask for this checking.
Our resolution here offers a path to true healing with minimal userimpact, by (temporarily) carving out the semantic space of oldstatement switches. A "legacy switch" is a statement switch on anumeric primitive or its box, enum, or string, and which contains nopattern labels (i.e., a statement switch that is valid today.) Likeexpression switches, we will require non-legacy statement switches tobe exhaustive, and warn on non-exhaustive legacy switches. (To makethe warning go away, just insert a "default: " or "default: break" atthe bottom of the switch; not painful.) After some time, we should beable to make this warning an error, which again is easy to mitigatewith a single line. In the end, all switch constructs will be totaland type-checked for exhaustiveness, and once done, the notion of"legacy switch" can be garbage-collected.
Looking ahead to patterns in switch, we have several legacyconsiderations to navigate:
- Fallthrough and bindings. While fallthrough is not inherentlyproblematic (though the choice of fallthrough-by-default wasunfortunate), if a case label introduces a pattern variable, thenfallthrough to another case (at least one that doesn't introduce thesame pattern variable with the same type) makes little sense, and suchfallthrough has been outlawed. - Scoping. The block of a switch is one big scope, rather than eachcase label group being its own scope. (Again, one might call this ahistorical error, since there's little good that comes from this.) With case labels introducing variable declarations, this could havebeen a big problem, if one case polluted later cases (forcing users topick unique names for each binding in a switch statement), but flowscopoing solves that one. - Nulls. In Java 1.0, switching over reference types was notpermitted, so we didn't have to worry about this. In Java 5,autoboxing and enums meant we could switch over some reference types,but for all of these, null was a "silly" value so we didn't care aboutNPEing on null. In Java 7, when we added string switch, we could haveconceivably allowed `case null`, but instead chose to follow theprecedent set by Java 5. But once we introduce switches over anytype, with richer patterns, eagerly NPEing on null becomes much moreproblematic. We've navigated this by say that switches can NPE onnull if they have no nullable cases; nullable cases are those thatexplicitly say "null", and total patterns (which always come lastsince they dominate all others.) The old rule of "switches throw onnull" becomes "switches throw on null, except when they say 'casenull' or the bottom case is total." Default continues to mean what italways did -- "anything not already matched, except null."
The new treatment of null actually would have fallen out of thedecisions on totality, had we not gotten there already via anotherpath. Our notion of totality accounts for "remainder", which includesthings like novel subclasses of sealed types that did not exist atcompile time, which it would not be reasonable to ask users to writecode to deal with, and null fits into this treatment as well. We typecheck that a switch is sufficiently total, and then insert extra codeto catch "silly" values that are not otherwise handled, includingnull, and throw. (This also enables DA analysis to truly trust switchtotality.)
Where we land is a single unified switch construct that can be eithera statement or an expression; that can use either old-style flow(colon) or the more constrained flow style (arrow); whose case labelscan be constant, patterns (including guarded patterns), or a mix ofthe two; which can accept the legacy null-hostility behavior, or canoverride it by explicitly using nullable case labels; and which arealmost always type checked for totality (with some temporary, legacyexceptions.) Fallthough is basically unchanged; you can getfallthrough when using the old-style flow, but becomes less importantas fallthrough is (mostly) nonsensical in the presence of patterncases with bindings, and the compiler prevents this misuse. Thedistinction between "legacy" switches and pattern switches istemporary, with a path to getting to "all switches are total" over time.
I think we've done a remarkable job at rehabilitating this monster.
*Someone actually suggested using the syntax "new switch", on thebasis that new was already a keyword. Would not have aged well.

Re: Rehabilitating switch -- a scorecard

Reply via email to