Re: Null checking in Beam

Jan Lukavský Fri, 22 Jan 2021 01:18:10 -0800

Hi,

I'll give my two cents here.

I'm not 100% sure that the 1-5% of bugs are as severe as other types ofbugs. Yes, throwing NPEs at user is not very polite. On the other hand,many of these actually boil down to user errors. Then we might ask whata correct solution would be. If we manage to figure out what the actualproblem is and tell user what specifically is missing or going wrong,that would be just awesome. On the other hand, if a tool used foravoiding "unexpected" NPEs forces us to code

Object value = Objects.requireNonNull(myNullableObject); // orsimilar using Preconditions

   value.doStuff();

instead of just

  myNullableObject.doStuff()

what we actually did, is a) made a framework happy, and b) changed aline at which NPE is thrown by 1 (and yes, internally prevented JVM fromthrown SIGSEGV at itself, but that is deeply technical thing). Nothingchanged semantically, from user perspective.

Now, given that the framework significantly rises compile time (due toall the checks), causes many "bugs" being reported by static codeanalysis tools (because the framework adds @Nonnull default annotationseverywhere, even when Beam's code actually counts with nullability of afield), and given how much we currently suppress these checks ($ gitgrep BEAM-10402 | wc -l -> 1981), I'd say this deserves a deeper discussion.


 Jan


On 1/20/21 10:48 PM, Kenneth Knowles wrote:

Yes, completely sound nullability checking has been added to theproject via checkerframework, based on a large number of NPE bugs(1-5% depending on how you search, but many other bugs likelyattributable to nullness-based design errors) which are extraembarrassing because NPEs have were essentially solved, even inpractice for Java, well before Beam existed.
Checker framework is a pluggable type system analysis with some amountof control flow sensitivity. Every value has a type that is eithernullable or not, and certain control structures (like checking fornull) can alter the type inside a scope. The best way to think aboutit is to consider every value in the program as either nullable ornot, much like you think of every value as either a string or not, andto view method calls as inherently stateful/nondetermistic. This canbe challenging in esoteric cases, but usually makes the overall codehealth better anyhow.
Your example illustrates how problematic the design of the Javalanguage is: the analysis cannot assume that `getDescription` is apure function, and neither should you. Even if it is aware ofboolean-short-circuit it would not be sound to accept this code. Thereis an annotation for this in the cases where it is true (likeproto-generate getters):https://checkerframework.org/api/org/checkerframework/dataflow/qual/Pure.html
The refactor for cases like this is trivial so there isn't a lot ofvalue to thinking too hard about it.
if (statusCode.equals(Code.INVALID_ARGUMENT)
  @Nullable String desc = statusCode.toStatus().getDescription();
  if (desc != null && desc.contains("finalized")) {
    return false;
  }
}
To a casual eye, this may look like a noop change. To the analysis itmakes all the difference. And IMO this difference is real. Humans mayassume it is a noop and humans would be wrong. So many times when youthink/expect/hope that `getXYZ()` is a trivial getter method you laterlearn that it was tweaked for some odd reason. I believe this codechange makes the code better. Suppressing these errors should beexceptionally rare, and never in normal code. It is far better toimprove your code than to suppress the issue.
It would be very cool for the proto compiler to annotate enough thatnew-and-improved type checkers could make things more convenient.
Kenn
On Mon, Jan 11, 2021 at 8:53 PM Reuven Lax <re...@google.com<mailto:re...@google.com>> wrote:
    I can use that trick. However I'm surprised that the check appears
    to be so simplistic.

    For example, the following code triggers the check on
    getDescription().contains(), since getDescription returns a
    Nullable string. However even a simplistic analysis should realize
    that getDescription() was checked for null first! I have a dozen
    or so cases like this, and I question how useful such a simplistic
    check it will be.

    if (statusCode.equals(Code.INVALID_ARGUMENT)
    &&statusCode.toStatus().getDescription() !=null 
&&statusCode.toStatus().getDescription().contains("finalized")) {return false;
    }


    On Mon, Jan 11, 2021 at 8:32 PM Boyuan Zhang <boyu...@google.com
    <mailto:boyu...@google.com>> wrote:

        Yeah it seems like the checker is enabled:
        https://issues.apache.org/jira/browse/BEAM-10402. I used
        @SuppressWarnings({"nullness" )}) to suppress the error when I
        think it's not really a concern.

        On Mon, Jan 11, 2021 at 8:28 PM Reuven Lax <re...@google.com
        <mailto:re...@google.com>> wrote:

            Has extra Nullable checking been enabled in the Beam
            project? I have a PR that was on hold for several months,
            and I'm struggling now with compile failing to complaints
            about assigning something that is nullable to something
            that is not nullable. Even when the immediate control flow
            makes it absolutely impossible for the variable to be null.

            Has something changed here?

            Reuven

Re: Null checking in Beam

Reply via email to