Re: Evolving CONSTANT_Class

Brian Goetz Mon, 15 Jun 2020 12:29:39 -0700

Here's a table listing all the type-flavored uses (where "X" means"allowed here" and "~" means "maybe not essential, but the semanticswould be clear"):

More specifically, in the first two columns X means "allowed now", andin the later columns, X means "proposed." Note too that the proposedSpecies column is identical to the proposed Class name column.

The primitive column is interesting as we probably are going totranslate away all of these to some sort of `Qint` type when they appearin these places, so in the JVM, are probably not needed.

Another way to handle it is to distinguish between a *species*, whichis a class-like entity, and a *species type*. It's helpful to rememberthat there may be inline types of species (that is, a "Q envelope" ofa species).

I think this is a fruitful direction; I can have `ArrayList[T] extendsList[T]` where it is a class-like use, and I can have `Foo[T].x` whereit is a type-like use.

1) Treat everything in the class/interface table as a degenerate useof a type. A class name is always interpreted as an L type.

Given that a specializable class Foo<T> gives rise to species Foo[x] andFoo[y], _and_ a class type Foo such that Foo[t] <: Foo for all t, theduality between class and type here seems inevitable.

- When a Class constant is viewed as a type (for (1) that's always,for (2) that's for type-flavored references), the implicit L envelopeis a historical wart. Do we also support explicit L descriptors? Do wetry to migrate the world away from the implicit envelopes?

I would love to migrate away, but I suspect the cost/benefit isn'tthere. Historical warts are OK.

- Should we add primitive types? How are they spelled? (The standarddescriptor syntax for primitives is already interpreted as a bareclass name.)

Given the way we are thinking for translation, where there is going tobe some Q type that stands in for primitives when used in class-ycontexts (if for no other reason than the double-slot thing), I don'tthink this is needed.

- How do we handle type variables, both top-level and nested? Eitherwe embed constant pool pointers in Utf8 entries (yuck!), or we need toextend Class constants to support references both to Utf8 entries andto [some new thing].

This is the stringy-vs-tree problem we've been wrestling with for a longtime. The solution to this problem seems to hinge on the solution tothat one.

- Should we revisit "naked" descriptor references, allowing them topoint to either bare Utf8 entries or Class constants andMethodType/[something else] constants? Do we try to migrate the worldaway from naked descriptor references?


I think this may well fall out of the "trees vs strings" discussion.

I'm appealing here to a design principle that seems to have driven the original constant 
pool design: Class constants are for things that get resolved (and can be cached); 
descriptor strings are little more than fancy names. This principle doesn't always get 
followed: the verifier sometimes loads classes named by descriptors; array type class 
constants resolve their element types without a separate entry; more recently, 
StackMapTables use Class constants to represent types, and MethodTypes resolve method 
descriptors "as if" there were class constants for all of the parameter types. 
But I think these, especially the recent ones, are mistakes, and I still think the 
original notion is a useful separation of concerns that we should try to follow in our 
design.

The tension that comes up here is that we want to be able to matchdescriptors between clients and declarations. I don't want to inventone way to describe class constants for species, and another way toembed species in descriptors.

Now, it may be possible (depending on our translation strategy) that wedon't need to embed species in descriptors, because we're just going toerase descriptors, and put the specialization information somewhereelse, for the VM to use opportunistically. That would make thesplitting strategy more appealing.

- For bare descriptors (type of a field), it's fine to use something like 
"LList[QVal;];". Or maybe it's useful to describe descriptors in terms of Class/Species 
constants. In any case, there's still a need to figure out how to parameterize a descriptor with 
live constants ("LList[$T];"), but I think this can be set aside as a separate problem.


This is the one I'm alluding to above.

So I think we need CONSTANT_SpecializedMethodref, which has 1) a pointer to a 
Methodref constant, and 2) pointers to some resolvable constants (typically, 
but maybe not exclusively, representing types). (Caveat: there are some details 
about the interaction between type arguments, overriding, and method resolution 
that I'm hand-waving about. Maybe the encoding will be stacked a little 
differently.)

We've been around this merry go round a few times too, going back andforth between cramming stuff into the descriptor string and putting themethod types somewhere else. Again, the translation story (can we leavedescriptors alone) impinges on this.

Don't forget that when you have a local generic class nested in ageneric method, the method args implicitly parameterize the nestedclass. Which means that when we refer to a species of the local class,we have to supply the type arguments for both the method and for thelocal class (and any other enclosing classes.) Again, there is alump/split choice here; we can smoosh together the arguments, or providea trail of witnesses to the enclosing arguments. If we choose thelatter, then it might be mix of C_SMRef and C_Species.

Re: Evolving CONSTANT_Class

Reply via email to