Re: Switch labels (null again), some tweaking

Brian Goetz Wed, 28 Apr 2021 13:12:13 -0700

Armed with this observation, here's a restack of the terminology andrules, but with no semantic differences from the current story, just newnames.


The basic idea is:
 total on T with no remainder -> total on T
 total on T with remainder R -> covers T, excepting/modulo R



Concepts:

- A pattern P can be _total_ on a type T. Total patterns match everyelement of the value set of T, including null. - A set of patterns P* can _cover_ a type T, excepting/modulo R*. This means that for every element in the value set of T that is notmatched by some pattern in R*, it is matched by some pattern in P*.

 - If P is total on T, { P } covers T (with no exceptions.)

Base cases:
 - (type patterns) The type pattern `T t` is total on all U <: T
 - (var patterns) The var pattern `var x` is total on all T

- (default) The label `default` corresponds to a pattern that coversall types, modulo null. - (sealing) For an abstract sealed type S permitting T0..Tn, the setof patterns { T0, T1, ... Tn } covers S, modulo null and novel subtypesof S.


Induction cases:

- (lifting) For a deconstruction pattern D(T), { D(Q) : Q in Q* }covers D (modulo D(R*) and null) iff Q* covers T modulo R.



Construct restrictions:

- The pattern on the RHS of instanceof may not be total on the type ofthe LHS. - In a (non-legacy) switch on x : T, the set of patterns in the caselabels must cover T, excepting some remainder. It throws on the remainder. - In a pattern assignment `P = e`, where `e : T`, P must be cover T.It throws on the remainder.

This is just a restack of the terms, but I think it eliminates the mostproblematic part, which is 'total with remainder'. Now, total meanstotal. Total with remainder becomes "covers, excepting".





On 4/28/2021 2:13 PM, Brian Goetz wrote:

I think you're right that the terminology is currently biased formathematicians, which is a necessary initial phase to prove that thelanguage is right, but needs to get to the part where the terminologymakes sense to Joe Java.
The notion of "total with remainder" is indeed confusing, and we needto find a better way to say it, but we come by it honestly. Becausethe "remainder" corresponds to cases where no reasonable Javadeveloper would want to be forced to write them out, such as "novelenum value".
I think we can make things slightly (but not enough) better by using"total" for single patterns only, and using "exhaustive" for sets ofpatterns, since that's what drives switch exhaustiveness.
But that's only a start.

On 4/28/2021 2:09 PM, Maurizio Cimadamore wrote:
I think I got the two main fallacies which led me down the wrong path:
1. there is a distinction between patterns that are total on T, andpatterns that are total on T, but with a "remainder"
2. there is a distinction between a pattern being total on T, and aset of patterns being total (or exhaustive) on T
This sent me completely haywire, because I was trying to reason interms of what a plain pattern instanceof would consider "total", andthen translate the results to nested patterns in switch - and thatdidn't work, because two different sets of rules apply there.
Maurizio

On 28/04/2021 18:27, Brian Goetz wrote:
I think part of the problem is that we're using the word "total" indifferent ways.
A pattern P may be total on type T with remainder R. For example,the pattern `Soup s` is total on `Soup` with no remainder.
A _set_ of patterns may be total on T with remainder R as well. (The only way a set of patterns is total is either (a) one of thepatterns in the set is already total on T, OR (b) sealing comes intoplay.) Maybe this should be called "exhaustive" to separate frompattern totality.
Switch exhaustiveness derives from set-totality.
Instanceof prohibits patterns that are total without remainder, fortwo reasons: (1) its silly to ask a question which constant-folds to`true`, and (b) the disagreement between traditional `instanceofObject` and `instanceof <total pattern>` at null would likely be asource of bugs. (This was the cost of reusing instanceof ratherthan creating a new "matches" operator.)
Foo x = ...
if (x instanceof Bar)
The instanceof will not be considered total, and therefore beaccepted by the compiler (sorry to repeat the same question - Iwant to make sure I understand how totality works with sealedhierarchies).
If the RHS of an `instanceof` is a type (not a type pattern), thenthis has traditional `instanceof` behavior. If Bar <: Foo, thenthis is in effect a null check.
If the RHS is a _pattern_, then the pattern must not be totalwithout remainder. If Bar <: Foo, `Bar b` is total on Foo, so thecompiler says "dumb question, ask a different one."
If the RHS is a non-total pattern, or a total pattern withremainder, then there's a real question being asked. So in yourLunch-permits-Soup example, you could say
    if (lunch instanceof Soup s)
and this matches _on non-null lunch_. Just like the switch. Theonly difference is switch will throw on unmatched nulls, whereasinstanceof says "no, not an instance", but that's got nothing to dowith patterns, it's about conditional constructs.
If that's the case, I find that a bit odd - because enums kind ofhave the same issue (but we have opted to trust that a switch on anenum is total if all the constants known at compile time arecovered) - and, to my eyes, if you squint, a sealed hierarchy islike an enum for types (e.g. sum type).
OK, let's talk about enums.  Suppose I have:

    enum Lunch { Soup }

and I do

    switch (lunch) {
        case Soup -> ...
    }
What happens on null? It throws, and it always has. The behaviorfor the sealed analogue is the same; the `Soup s` pattern matchesthe non-null lunches, and if null is left unhandled elsewhere in theswitch, the switch throws. No asymmetry.
Anyway, backing up - this below:

```

switch (lunch) {
        case Box(Soup s):
              System.err.println("Box of soup");
              break;

        case Bag(Soup s):
             System.err.println("Bag of soup");
             break;

        /* implicit */
        case Box(null), Bag(null): throw new NPE();
    }
```
is good code, which says what it means. I think the challenge willbe to present error messages (e.g. if the user forgot to add caseBox(null)) in a way that makes it clear to the user as to what'smissing; and maybe that will be enough.
The challenge here is that we don't want to force the user to handlethe "silly" cases, such as:
    Boolean bool = ...
    switch (bool) {
        case true -> ...
        case false -> ...
case null -> ... // who would be happy about having towrite this case?
    }

and the similarly lifted:

    Box<Boolean> box = ...
    switch (box) {
        case Box(true) -> ...
        case Box(false) -> ...
case Box(null) -> ... // who would be happy about having towrite this case?
        case Box(novel) -> ... // or this case?
        case null ->           // or this case?
    }
So we define the "remainder" as the values that "fall into thecracks between the patterns." Users can write patterns for these,and they'll match, but if not, the compiler inserts code to catchthese and throw something.
The benefit is twofold: not only does the user not have to write thestupid cases (imagine if Box had ten slots, would we want to writethe 2^10 partial null cases?), but because we throw on theremainder, DA can treat the switch as covering all boxes, and beassured there are no leaks.

Re: Switch labels (null again), some tweaking

Reply via email to