Re: What are best practices around toString?

tsbockman via Digitalmars-d-learn Sat, 01 Oct 2022 01:31:20 -0700

On Friday, 30 September 2022 at 13:11:56 UTC, christian.koestlinwrote:

Dear Dlang experts,
up until now I was perfectly happy with implementing`(override) string toString() const` or something to get nicelyformatted (mostly debug) output for my structs, classes andexceptions.

Human beings read extremely slowly compared to how quickly the GCcan allocate and free `string`s as needed, so there is no need tocomplicate your code with more text formatting strategies unlessyou want to generate this debug output far faster than a humancan actually read it.

But recently I stumbled uponhttps://wiki.dlang.org/Defining_custom_print_format_specifiersand additionallyhttps://github.com/dlang/dmd/blob/4ff1eec2ce7d990dcd58e5b641ef3d0a1676b9bb/druntime/src/object.d#L2637 which at first sight is great, because it provides the same customization of an objects representation with less memory allocations.
When grepping through phobos, there are a bunch of "different"signatures implemented for this, e.g.
```d
...
phobos/std/typecons.d: void toString(DG)(scope DG sink)const
...
phobos/std/typecons.d: void toString(DG, Char)(scope DGsink, scope const ref FormatSpec!Char fmt) const
...
phobos/std/typecons.d: void toString()(scope voiddelegate(const(char)[]) sink, scope const ref FormatSpec!charfmt)
...
phobos/std/sumtype.d: void toString(this This, Sink,Char)(ref Sink sink, const ref FormatSpec!Char fmt);
...
```
to just show a few.

The `FormatSpec` parameter only belongs there if you're actuallygoing to do something useful with it in your `toString`implementation. Even if you are going to use it, you shouldprobably still provide a convenience overload with a defaultspecifier.

Furthermore, when one works with instances of struct, objectsor exceptions a `aInstance.toString()` does not "work" when oneonly implements the sink interface (which is to be expected),whereas a `std.conv.to!string` or a formatted write with `%s`always works (no matter what was used to implement thetoString).



I generally do something like this:

```D
struct A {
    string message;
    int enthusiasm;

    void toString(DG)(scope DG sink) scope const @safe
        if(is(DG : void delegate(scope const(char[])) @safe)
        || is(DG : void function(scope const(char[])) @safe))
    {
        import std.format : formattedWrite;
        sink(message);
        sink(" x ");
        formattedWrite!"%d"(sink, enthusiasm);
        sink("!");
    }
    string toString() scope const pure @safe {
        StringBuilder builder;

toString(&(builder.opCall)); // Find the exact stringlength.

        builder.allocate();
        toString(&(builder.opCall)); // Actually write the chars.
        return builder.finish();
    }
}
```

So, the first `toString` overload defines how to format the valueto text, while the second overload does memory management andforwards the formatting work to the first.


`StringBuilder` is a utility shared across the entire project:

```D
struct StringBuilder {
private:
    char[] buffer;
    size_t next;

public:

void opCall(scope const(char[]) str) scope pure @safe nothrow@nogc {

        const curr = next;
        next += str.length;
        if(buffer !is null)
            buffer[curr .. next] = str[];
    }
    void allocate() scope pure @safe nothrow {
        buffer = new char[next];
        next = 0;
    }

void allocate(const(size_t) maxLength) scope pure @safenothrow {

        buffer = new char[maxLength];
        next = 0;
    }
    string finish() pure @trusted nothrow @nogc {
        assert(buffer !is null);
        string ret = cast(immutable) buffer[0 .. next];
        buffer = null;
        next = 0;
        return ret;
    }
}
```

The first formatting pass to find the required buffer length canbe skipped if you can somehow pre-calculate the maximum possiblelength, or if you prefer the common strategy of repeatedlyre-allocating the buffer with exponentially increasing size usedby the likes of `std.array.Appender`. Since the API for`toString` remains the same regardless, you are free to choosethe best strategy for each type.

Re: What are best practices around toString?

Reply via email to