Re: std.experimental.checkedint is ready for comments!

tsbockman via Digitalmars-d Wed, 15 Jun 2016 11:36:33 -0700

On Wednesday, 15 June 2016 at 16:40:19 UTC, Andrei Alexandrescuwrote:

Thanks for this work.
[...]
I think there are a few considerable issues with the proposal,but also that all are fixable.

Your message was very long, so for the moment I'm going to filterit down to just the high-level design criticism. (The rest isunimportant unless/until we reach consensus on the design,anyway.)

* The opening documentation states this proposal is concernedwith adding checking capabilities to integral types. It is afull-blown package with 6 modules totaling 4690 lines as wccounts. (For comparison: std.experimental.allocator, whichoffers many facilities and implements a number of difficultstructures and algorithms, has 11831 lines.) That's a highbudget and a high imposition on users for adding checks tointegral types. Such extensive style of coding goes against theD style we want to promote.
[...]
The budget I'd  establish for this is one parameterized type in
one module of manageable size. Several parameterized types would
be okay if they characterize distinct abstractions and use thesame backend. Anything else is an indication of a design runawry.


This can be summarized as, "It's too big and complicated."

`checkedint` as it stands is, I believe, fairly close to thesmallest implementation possible in D today, within theconstraints of the features demanded by the community in pastdiscussions, coupled with my high-level design goals.

If you want something shorter or simpler, you will have to cutfeatures or compromise the design in other ways. (Or improve theD language to facilitate a more elegant design.)


Some features and design goals that combine to motivate my design:

1) Checked types must signal an error (somehow) whenever theirbehaviour

      deviates from that of an ideal mathematical integer.

2) It should be possible to recover from errors - using`assert(false)` ora deliberate divide-by-zero to crash the program is baddesign unlessthe condition that triggers it is never supposed to happen,ever.

3) Performance (with respect to both speed and memory use)should be asclose as possible to that of the built-in machine integertypes.

4) The API should minimize verbosity and ceremony, becauseotherwise hardlyanyone will use it - people generally prefer convenienceover safety.

5) Writing generic code that works correctly with both checkedand unchecked

      types must be easy.

6) The API must make safe usage easy, and (accidental) unsafeusage hard,because people generally don't pay much attention to thedocs (even ifthey're good). A false sense of security is worse than noneat all.


   7) The API must be usable in `nothrow @nogc` code.

8) The number of distinct template instantiations generated innatural usemust be finite, to prevent excessive combinatorialexplosion whenchecked types are used in public APIs. (Templates that arejust aliases,and small functions that always inline don't count againstthis.)

Also it is worrisome that one type wasn't enough (in spite ofextensiveparameterization with policies) and two are needed, with subtledifferences
between them that need to be summarized in a table.

The reason for the `SmartInt` versus `SafeInt` split is that withD's current semantics

(4) and (5) conflict.

`SmartInt` prioritizes (4); `SafeInt` and `DebugInt` prioritize(5).

Getting to the design: the root of the problem is a byzantinedesign that is closed to extension.

The design was closed deliberately because of (8). Template bloatis a major concern, even with the current finite design.

I want `checkedint` to be usable in public APIs, and thatrequires some standardization of error handling and base types tobe enforced upon the users. Otherwise, everyone will choosesomething different and all template instantiations involvinginteger types will become practically single-use.

Looking at the IntFlagPolicy, it offers three canned behavior:throws, asserts, and noex.

The choice of policies is motivated by the naturalincompatibility of (2), (4), (6), and (7). I built in enoughvariety to allow people to choose their own priorities amongthose goals, and no more because of (8).

* One of the first things I looked for was establishing boundsfor numbers, like Smart!(int, 0, 100) for percentage. For allits might, this package does not offer this basic facility, andfrom what I can tell does not allow users to enforce it viapolicies.

Here you are suggesting adding even more complexity to a designthat you have already de-facto rejected as overly complex. Asdiscussed earlier in this very thread, I studied adding supportfor arbitrary bounds and decided not to pursue that right nowbecause implementing it in a performant way would greatlyincrease the complexity of `checkedint`, and make the templatebloat problem much worse.

Also, this suggests that other types should be considered, howabout Smart!bool and Smart!double?

Neither `bool` nor `double` has the kind of severe-but-fixablesafety and correctness issues that the machine integer types do,which motivates the `checkedint` design. No doubt someone canmake up some sort of meaning to attach to those symbols, but itwill most likely have nothing to do with `SmartInt`.

* Not sure why divPow2 is needed, why not some enforcedOp!"<<"

Because (although similar) a bit shift is actually semanticallydifferent than dividing or multiplying by a power of two. The bitshift implies different rules for rounding and overflow.

I realized in testing that even for `SmartInt`, the bit shiftsemantics are still useful sometimes, and decided that it wasbetter not to confuse the two. A `smartOp` version of the bitshifts is necessary because the built-in bit shifts have someundefined behaviour that needs to be fixed.

etc. Same about pow, why not enforcedOp!"^^"?

`pow()` exists as a free function to satisfy (5), and because the`^^` and `^^=` operators both have language-level bugs thatcurrently make their use incompatible with (6).

I see little value in free functions such as e.g. abs() becausethey are trivial one-liners. I understand the need forcompleteness, but it seems a good aspiration for consistency isbeing marred by a bunch of code pulp that really does nothinginteresting. Probably not worth it.

`checkedint`-aware versions of functions like `abs()` arenecessary to satisfy (4) and (6) together.

The hook may have state (e.g. hysteresis, NaN flag, errorstate, etc) so that's why it may be embedded within Checked.

The early iterations of `checkedint` worked this way (although Ihad no plans to support user-defined hooks). I implemented anddebugged it, and thought it was about ready to submit many monthsago.

Then I actually tried *using* it, and hated it. The problem withusing a NaN state, is that in nothrow code you have to manuallycheck it before *every* call to an external(non-`checkedint`-aware) function, or you may accidentally loseit.

As a result, safe NaN-based APIs violate (4), while concise APIsviolate (6). The IEEE-inspired sticky flags feature is mysolution to this problem, and it is far more pleasant to workwith in practice - as well as faster and more memory efficient.

Meanwhile, in code that allows exceptions, there is no reason topay the speed and memory penalty of carting around the NaN stateto check later - just throw on the spot whenever something goeswrong.

Re: std.experimental.checkedint is ready for comments!

Reply via email to