Evolving past reference type patterns

Brian Goetz Fri, 15 Apr 2022 13:50:46 -0700

We characterize patterns by their /applicability/ (static typechecking), /unconditionality/ (can matching be determined without adynamic check, akin to the difference between a static and dynamiccast), and /behavior/ (under what conditions does it match, and whatbindings do we get.)


       Currently shipping

A type pattern |T t| for a ref type T is /applicable to/ a ref type U ifU is downcast-convertible to T.


A type pattern |T t| is /unconditional/ on |U| if |U <: T|.

A type pattern |T t| matches a target x when the pattern isunconditional, or when |x instanceof T|; if so, its binding is |(T) x|.



       Record patterns

In the next round, we will add /record patterns/, which bring in /nestedpatterns/.

A record pattern |R(P*)| is applicable to a reference type U if U isdowncast-convertible to R. A record pattern is never unconditional.

Record patterns also drag in primitive patterns, because records canhave primitive components.

A primitive type pattern |P p| is applicable to, and unconditional on,the type P. A primitive type matches a target x when the pattern isunconditional, and its binding is |(P) x|.

Record patterns also drag in |var| patterns as nested patterns. A |var|pattern is applicable to, and unconditional on, every type U, and itsbinding when matched to |x| whose static type is |U|, is |x| (think:identity conversion.)


This is what we intend to specify for 19.


       Primitive patterns

Looking ahead, we’ve talked about how far to extend primitive patternsbeyond exact matches. While I know that this makes some peopleuncomfortable, I am still convinced that there is a more powerful rolefor patterns to play here, and that is: as the cast precondition.

A language that has casts but no way to ask “would this cast succeed” isdeficient; either casts will not be used, or we would have to toleratecast failure, manifesting as either exceptions or data loss /corruption. (One could argue that for primitive casts, Java is deficientin this way now (you can make a lossy cast from long to int), but themonomorphic nature of primitive types mitigates this somewhat.) Prior topatterns, users have internalized that before a cast, you should firstdo an |instanceof| to the same type. For reference types, the|instanceof| operator is the “cast precondition” operator, with anadditional (sensible) opinion that |null| is not deemed to be aninstance of anything, because even if the cast were to succeed, theresult would be unlikely to be usable as the target type.

There are many types that can be cast to |int|, at least under someconditions:


 * Integer, except null
 * byte, short, and char, unconditionally
 * Byte, Short, and Character, except null
 * long, but with potential loss of precision
 * Object or Number, if it’s not null and is an Integer


 * any int
 * Integer, when the instance is non-null (unboxing)
 * Any reference type that is cast-convertible to Integer, and is
   |instanceof Integer| (unboxing)
 * byte, short, and char, unconditionally (types that can be widened to
   int)
 * Byte, Short, and Character, when non-null (unboxing plus widening)
 * long when in the range of int (narrowing)
 * Long when non-null, and in the range of int (unboxing plus narrowing)

This table can be generated simply by looking at the set of castconversions — and we haven’t talked about patterns yet. This is simplythe generalization of |instanceof| to primitives. If we are to allow|instanceof int| at all, I don’t think there is really any choice ofwhat it means. And this is useful in the language we have today,separate from patterns:


 * asking if something fits in the range of a byte or int; doing this
   by hand is annoying and error-prone
 * asking if casting from long to int would produce truncation; doing
   this by hand is annoying and error-prone

Doing this means that

|if (x instanceof T) ... (T) x ... |

becomes universally meaningful, and captures exactly the preconditionsfor when the cast succeeds without error, loss of precision, or nullescape. (And as Valhalla is going to bring primitives more into theworld of objects, generalizing this relationship will become only moreimportant.)


|if (x instanceof T t) ... t ... |

Extending instanceof / pattern matching to primitives in this way is notonly a sensible generalization, but failing to do so would exposegratuitous asymmetries that would be impediments to refactoring:


 *

   Cannot necessarily refactor |int x = 0| with |let int x = 0|. While
   this may seem non-problematic on the surface, as soon as |let|
   acquires any other feature besides “straight unconditional pattern
   assignment”, such as let-expression, it puts users in the bad choice
   between “Can use let, or can use assignment conversion, but not both.”

 *

   Loss of duality between |new X(args)| and |case X(ARGS)|. The
   duality between construction and deconstruction patterns (and
   similar for static factories/patterns, builders/“unbuilders”, and
   collection literals/patterns is a key part of the story; we take
   things apart in the same way we put them together. Any gratuitous
   divergence becomes an avoidable sharp edge.

Since these are related to assignment and method invocation, let’s ask:how do these conversions line up with assignment and method invocationconversions?

There are two main differences between the safe cast conversions andassignment context. One has to do with narrowing; the “if it’s a literaland in range” is the best approximation that assignment can do, while ina context that accepts partial patterns, the pattern can be morediscriminating, and so should. The other is treatment of null; again,because of the totality requirement, assignment throws when unboxing anull, but pattern matching in a partial context can deal moregracefully, and simply decline to match.

There are also some small differences between the safe cast conversionsand method invocation context. There is the same issue with unboxingnull (throws in (loose) invocation context), and method invocationcontext makes no attempt to do narrowing, even for literals. This lastseems mostly a historical wart, which now can’t be changed because itwould either potentially change (very few) overload selection choices,or would require another stage of selection.

What are the arguments against this interpretation? They seem to bevarious flavors of “ok, but, do we really need this?” and “yikes, newcomplexity.”

The first argument comes from a desire to treat pattern matching as a“Coin”-like feature, strictly limiting its scope. (As an example of asimilar kind of pushback, in the early days, it was asked “but doespattern matching have to be an expression, couldn’t we just have an“ifmatch” statement? (See answer here:http://mail.openjdk.java.net/pipermail/amber-dev/2018-December/003842.html)This is the sort of question we get a lot — there’s a natural tendencyto try to “scope down” features that seem unfamiliar. But I think it’scounterproductive here.

The second argument is largely a red herring, in that this is /not/ newcomplexity, since these are exactly the rules for successful casts. Infact, not doing it might well be perceived as new complexity, since itresults in more corner cases where refactorings that seem like theyshould work, do not, because of conversions.

Evolving past reference type patterns

Reply via email to