Re: Concerns about the plan for `==`

Brian Goetz Fri, 24 Jun 2022 08:08:46 -0700

I don't have an answer for you, but I can add some information to the mix.

Currently there are _nine_ "implementations" of `==`; one forreferences, and one for each of the eight primitives. Regardless ofwhether or not they are perfect tests of substitutibility (curse you,floating point), the eight primitive `==` functions are highlydomain-specific. They can be so because the primitives aremonomorphic. In a sense, we've allowed primitives to "overload" `==`because monomorphism means we can define `==` with full knowledge of thedomain, and without worry about non-well-definedness or the variousother problems of `equals` in extensible class hierarchies (as EJexhaustively catalogued.)

What's being proposed here is that we evolve `Object==` from "compareidentities" to a case analysis, to account for the fact that Object willdescribe more things:


    case (IdentityObject a, IdentityObject b) -> identity==(a, b)

case (ValueObject a, ValueObject b) -> (isNull(a) == isNull(b)) &&(type(a) == type(b))

&& (state(a) == state(b))
    default -> false

Just as `identity==` was the best we could do as a default onpolymorphic identity objects, this is the best we can do on polymorphicmixed identity/value objects. (There's a whole digression intooverloading `==` on value types, but I'm not going to go there right now.)

While we're not making the problem of "`==` is unreliable" better, andarguably making it incrementally worse by making it work in more casesthat look a little like the cases in which it is unreliable, we *are*making something better here: you can now use `.equals()` everywhere. One of the complains about `==` is that sometimes you use `==` andsometimes you use `.equals()` and sometimes you can accidentally use onewhere you should use the other. But this is because you couldn'tprevious use .equals() on primitives, so an `equals()` method wouldnecessarily do things like:


    boolean equals(Object o) {
        return o instanceof Foo f
            && f.size == this.size
            && f.name.equals(this.name);
    }

What stinks here is that at each point, you have to ask yourself"equals, or =="? Now you can have a fixed rule: always say `.equals()`:


    boolean equals(Object o) {
        return o instanceof Foo f
            && f.size.equals(this.size)   // works on int!
            && f.name.equals(this.name);
    }

(The equals method on primitives is monomorphic so will JIT away, foranyone worried about the performance.)

It is a little sad because we had to resolve the problem by using theunfortunate spelling all the time, because `==` got the good name, butthat's not a new problem. But it means the cognitive load can disappearif we train ourselves to uniformly use `.equals()`.

We will surely have about a million calls to make `===` or `eq` orsomething else sugar for `.equals()`. We can consider that, but I don'tthink its essential to do that now.



On 6/15/2022 1:51 PM, Kevin Bourrillion wrote:

What I think I understand so far:
The current plan for `==` for all bucket 2+ types (except the 8_primitive_ types, as I still use the word) is to have it perform afieldwise `==` comparison: identity equality for bucket 1 fields, whatit's always done for primitive fields, and of course recurse for the rest.
If we consider that the broadest meaning of `a == b` has always been"a and b are definitely absolutely indistinguishable no matter what",then this plan seems to compatibly preserve that, which makes sensefor purposes of transition.
What concerns me:
It's good for transition, at least on the surface, but it's a badlong-term outcome.
Users hunger for a shorter way to write `.equals()`, and they willthink this is it. I would not underestimate the pushback they willexperience to writing it out the long way in cases where `==` at least*seems* to do the right thing. Because in some number of cases, it*will* do the same thing; specifically, if you can recurse throughyour fields and never hit a type that overrides equals().
This is extremely fragile. A legitimate change to one type can breakthese expectations for all the types directly or indirectly dependingon it, no matter how far away.
In supporting our Java users here, there's no good stance we can takeon it: if we forbid this practice and require them to call `.equals`,we're being overzealous. If we try to help them use it carefully, atbest users will stop seeing `Object==Object` as a code smell (as wehave spent years training them to do) and then will start misusing iteven for reference types again.
btw, why did I say it's good for transition "on the surface"? Becausefor any class a user might migrate to bucket 2+, any existing calls to`==` in the wild are extremely suspect and *should* be revisitedanyway; this is no less true here than it is for existingsynchronization etc. code.
What's an alternative?:
I'm sure what I propose is flawed, but I hope the core arguments arecompelling enough to at least help me fix it.
The problem is that while we /can/ retcon `==` as described above,it's not behavior anyone really /wants/. So instead we double down onthe idea that non-primitive `==` has always been about identity andmust continue to be. That means it has to be invalid for bucket 2+ (atcompile-time for the .val type; failing later otherwise?).
This would break some usages, but again, only at sites that deserve tobe reconsidered anyway. Some bugs will get fixed in the process. Andat least it's not the language upgrade itself that breaks them, onlythe specific decision to move some type to new bucket. Lastly, wedon't need to break anyone abruptly; we can roll out warnings as Iproposed in the email "We need help to migrate from bucket 1 to 2".
A non-record class that forgets to override equals() from Object evenupon migrating to bucket 2+ is also suspect. If nothing special isdone, it would fail at runtime just like any other usage of`Foo.ref==Foo.ref`, and maybe that's fine.
Again, I'm probably missing things, maybe even big things, but I'mjust trying to start a discussion. And if this can't happen I am justsearching for a solid understanding of why.

Re: Concerns about the plan for `==`

Reply via email to