Re: Performance of std.json

w0rp via Digitalmars-d Mon, 02 Jun 2014 04:41:17 -0700

On Monday, 2 June 2014 at 00:39:48 UTC, Jonathan M Davis viaDigitalmars-d wrote:

It's my understanding that the current design of std.json isconsidered
to be poor, but I don't haven't used it, so I don't know any the
details. But if it's as slow as you're finding to be the case,then I
think that that supports the idea that it needs a redesign. The
question then is what a new std.json should look like and whowould doit. And that pretty much comes down to an interested andmotivated
developer coming up with and implementing a new design and then
proposing it here. And until someone takes up that torch, we'llbestuck with what we have. Certainly, there's no fundamentalreason whywe can't have a lightening fast std.json. With ranges andslices,parsing in D in general should be faster than C/C++ (anddefinitelyfaster than Haskell of python), and if it isn't, that indicatesthatthe implementation (if not the whole design) of that code needsto be
redone.
I know that vibe.d uses its own json implementation, but Idon't knowhow much of that is part of its public API and how much of thatis
simply used internally: http://vibed.org

- Jonathan M Davis

I implemented a JSON library myself which parses JSON andgenerates JSON objects similar to how std.json does not. I wroteit largely because of the poor API in the standard library at thetime, but I think by this point nearly all of the concerns havebeen alleviated.

At the time I benchmarked it against std.json and vibe.d'simplementation, and they were all pretty equivalent in terms ofperformance. I settled for edging just slightly ahead ofstd.json. If there's any major performance gains to make, Ibelieve we will have to completely rethink how we go aboutparsing JSON I suspect transparent character encoding anddecoding (dchar ranges) might be one potential source of trouble.

In terms of API, I wouldn't go completely for an approach basedon serialising to structs. Having a tagged union type is stillhelpful for situations where you just want to quickly get at someJSON data and do something with it. I have thought a great dealabout writing data *to* JSON strings however, and I have an ideafor this I would like to share.

First, you define by convention that there is a functionwriteJSON which takes some value and an OutputRange, and thenwrites the value in a JSON representation directly to anOutputRange. You define in the library writeJSON functions forstandard types.


writeJSON(OutputRange)(JSONValue, OutputRange);
writeJSON(OutputRange)(string, OutputRange);
writeJSON(OutputRange)(int, OutputRange);
writeJSON(OutputRange)(bool, OutputRange);
writeJSON(OutputRange)(typeof(null), OutputRange);
// ...

You define one additional writeJSON function, which takes anyInputRange of type T and writes an array of Ts. (So string[] willwrite an array of strings, int[] will write ints, etc.)

writeJSON(InputRange, OutputRange)(InputRange inRange,OutputRange outRange) {

   foreach(ref value; inRange) {
       writeJSON(value, outRange);
   }
}

Add a convenience method which takes var args alternativelystring, T, string, U, ... Call it say, writeJSONObject.

You now have a decent framework for writing objects directly toOutputRanges.


struct Foo {
    AnotherType bar;
    string stringValue;
    int intValue;
}

writeJSON(OutputRange)(Foo foo, OutputRange outRange) {
    // Writes {"bar":<bar_value>, ... }
    writeJSONObject(outRange,
         // writeJSONObject calls writeJSON for AnotherType, etc.
        "bar", foo.bar,
        "stringValue", foo.stringValue,
        "intValue", foo.intValue
    );
}

There are more details, and something would need to be done forhandling stack overflows, (inlining?) but there's the idea that Ihad for improving writing JSON at least. One advantage in thisapproach would be that it wouldn't be dependent on the GC, andscoped buffers could be used. (A @nogc candidate, I think.) Youcan't get this ability out of something like toJSON whichproduces a string at once.

Re: Performance of std.json

Reply via email to