Re: [rust-dev] Exceptions without exceptions (was Re: Writing cross-platform low-level code)

Graydon Hoare Sat, 19 Nov 2011 11:41:09 -0800

On 19/11/2011 4:02 AM, David Rajchenbach-Teller wrote:

Let's start with a little safety vocabulary, not specifically related to
Rust. It is quite common to have two distinct words: "exceptions" for
exceptional but expected behavior – that you typically want to catch –

Heh. I appreciate the attempt to pick such terminology neutrally, but"exception" and "catch" very plainly points towards the moderntyped-exceptions-with-unwind-semantics implementation.

There are many other systems for error management: error codes, monads,conditions-and-restarts, signals-and-handlers, etc. etc.Unwinding-and-catching is a very specific strategy.

I point all this out because, quite a long while ago (before publishinganything), the Rust-on-paper design I worked on had a non-unwind-basedcondition system; errors represented an opportunity for adynamically-scoped handler to offer recovery, *or fail*.

and "failures" for issues that are beyond the control of the developer
and can at best be contained by shutting down a system and relaunch it.
To attain a high level of safety, having a manner of dealing with both
it is a Good Thing (tm).

Fortunately, these two concepts mostly map to your points 1. or 2.:

Agreed. There are at least expected-and-possibly-handled things andunexpected-or-definitely-can't-handle things. 1. and 2. :)

For category 2., as you mention, Rust has the mechanism of tasks and
failures. The non-spawnability of closures has me a little worried, but
I am sure that most/all useful cases can be encoded without this feature
and since my current hands-on experience with Rust tasks and failures is
essentially non-existing, I feel incompetent to discuss these in depth.

Closures will be spawnable when we've implemented unique closures.That's a WIP, not a permanently-missing-feature.

However, this whole thread was about category 1 and, more precisely,
about library-design of category 1, rather than language-design.

Ok.

For reporting exceptional behaviors, the "wonderful tag things" you
mention are necessary, and a sufficient *language* mechanism, but
stopping the *library* design at this point is inviting either the same
mess as Haskell or OCaml or the same mess as mozilla-central. Both
Haskell and OCaml have around 6 distinct – and largely type-incompatible
– manners of reporting exceptions. This does not even take into account
the fact that both OCaml and Haskell sum types are (or can be made) more
flexible/powerful than Rust tags, something we probably do not want in
Rust. On the other side, mozilla-central has only one mechanism, which
has a fixed set of exceptions, and new exceptions can only be added by
rebuilding the whole platform.

In order to avoid both pitfalls, I advocate that we need to decide of a
standard manner of reporting exceptions very early in the development of
Rust – ideally before anybody starts writing or using any library that
makes heavy use of exceptions, such as an IO library.

Yes, and I am proposing one: pass *in* your handlers (or symbolic codesindicating handler-strategy) and have the callee handle *at the site ofthe condition*. Sorry if I wasn't clear enough about the implied use oftags I meant, up-thread.

I'm serious. I've read and understood what you wrote above, so I'll askyou do the courtesy of reading and understanding the followingparagraphs fully as well. I'm not writing them not to clobber you withthe Obvious Superiority of my own beliefs -- they may well be wrong --just to clarify exactly what I'm suggesting, what I'd do asalternatives, and why.

The old Rust condition system was modeled on the condition system inMesa. You named "signals" as global items and gave a syntactic form torouting a given signal to a locally-defined "handler" in the caller,much as you would a try/catch block. The difference is that a handler inthis scheme is a typed function-like definition dangling after theprotected block. It reads like so:



try {
  os::open(fname);
} handle os::file_not_exist(str filename) -> file {
  ret os::create(fname);
}

So the recovery logic remains off the main code path, like a modern"catch block", but with a fn-like signature: arguments and its own"recovery value" return type. At signal occurrence, the originating sitewould invoke the signal by item name; this would cause the runtime tofind the innermost installed handler via either"head-of-a-task-local-list" search, or by static code-range search ofthe caller stack, similar to C++-unwinding, and call it. The handlerwould either return the typed recovery value, or fail. Failure to locatea handler at all, of course, also generates a fail.

This is a nice pleasant scheme half way between Liskov's CLU exceptionsand lisp's restarts. It has the positive property that it's orientedtowards handling at the signalling site rather than unwinding when youactually intend to continue; but it introduces fewer moving parts thanthe lisp system.

During early review, someone -- I think possibly Brendan? -- pointed outsome retrospective comments -- I think possibly from Lampson? -- on thesystem in Mesa. The retrospective was somewhat damning: not of thesystem in particular, but of the whole notion of splitting the recoverypath off into a slow-to-invoke secondary handler (as in Mesa but also asin most modern exception systems).

The retrospective reasoning, IIRC (working from memory here; if Brendanor whoever pointed it out is reading I'd appreciate a pointer to theoriginal text) went like this:


  - The conditions you expect to generate, the author of the callee
    code necessarily can enumerate in their own mind. They invoke the
    signals when things go wrong, after all!

  - The set of plausible-and-useful recoveries for any given signal is
    really quite small and predictable; that same author of the callee
    can mentally enumerate all the ways they could expect to be told to
    recover anyways.

  - If a signal is frequent enough to make it into the API this way,
    it's frequent enough that you're going to be invoking the handler
    regularly. Having that invocation be hundreds of times slower is
    undesirable.

  - Having the recovery logic at a distance from the origin and
    duplicated for each caller who wants to follow a given pattern
    actually leads to buggier, more fragile and less likely recovery.

  - The above points combined to -- quite naturally and without
    stated intention -- make the programmers using the system gradually
    shift any API they designed from using signals to using flags
    or variants that described the recovery mode they wanted into
    any subsystem with predictable signals.

So they eventually removed the remaining uses of the signal system(mostly bitrotted) and were happier for it.

I found this argument compelling. So much so I'm probably exaggeratingor mis-stating the arguments a bit. But it lead me to reflect a bit moreon the *realistic* uses of exceptions I've seen in programs, and foundmyself unable to debate it: most catch clauses I see do one of a smallnumber of very predictable things: ignore, retry, hard-fail, log, or tryone of a very small number of alternative strategies to accomplishingthe initial goal (create the file rather than open it, say) that theauthor of the callee could very well have predicted and codified in asmall tag-set of extra arguments.

So I removed the condition system from the design docs, and neverimplemented it. I propose structuring the libraries along these lines.That is, to have the callee authors actually think a bit about what anunusual-return means, which ways there might plausibly be for recoveringfrom it, and take a tag or vec-of-tags carrying the preferred recoverystrategy.

If this fails to hold together and we really, really have to revive somekind of structured at-a-distance recovery system, I'm going to suggestgoing back to the Mesa-like signal scheme I sketched out earlier (andabove). The main (substantial!) advantage it offers is that recoverypaths cause no actual unwinding-or-destruction -- recovery occurseffectively "at the signal site" -- so there's no question ofperturbation of the typestate. Unwinding still only happens duringfailure. The handler is invoked like any other function and if itsucceeds the unwinder is never even involved. IMO it's much tidier thantry/throw/catch and/or monads-by-macros.


-Graydon
_______________________________________________
Rust-dev mailing list
Rust-dev@mozilla.org
https://mail.mozilla.org/listinfo/rust-dev

Re: [rust-dev] Exceptions without exceptions (was Re: Writing cross-platform low-level code)

Reply via email to