Re: Range interface for std.serialization

Tyler Jameson Little Thu, 22 Aug 2013 19:10:54 -0700

On Thursday, 22 August 2013 at 14:48:57 UTC, Dicebot wrote:

On Thursday, 22 August 2013 at 03:13:46 UTC, Tyler JamesonLittle wrote:
On Wednesday, 21 August 2013 at 20:21:49 UTC, Dicebot wrote:
It should be range of strings - one call to popFront shouldserialize one object from input object range and providematching string buffer.
I don't like this because it still caches the whole objectinto memory. In a memory-restricted application, this isunacceptable.
Well, in memory-restricted applications having large object atall is unacceptable. Rationale is that you hardly ever wanthalf-deserialized object. If environment is very restrictive,smaller objects will be used anyway (list of smaller objects).

It seems you and I are trying to solve two very differentproblems. Perhaps if I explain my use-case, it'll make thingsclearer.

I have a server that serializes data from a socket, processesthat data, then updates internal state and sends notifications toclients (involves serialization as well).

When new clients connect, they need all of this internal state,so the easiest way to do this is to create one large object outof all of the smaller objects:


    class Widget {
    }

    class InternalState {
        Widget[string] widgets;
        ... other data here
    }

InternalState isn't very big by itself; it just has anassociative array of Widget pointers with some other rather smalldata. When serialized, however, this can get quite large. Sincearchive formats are orders of magnitude less-efficient thanin-memory stores, caching the archived version of the internalstate can be prohibitively expensive.

Let's say the serialized form of the internal state is 5MB, and Ihave 128MB available, while 50MB or so is used by theapplication. This leaves about 70MB, so I can only support 14connected clients.

With a streaming serializer (per object), I'll get that 5MB downto a few hundred KB and I can support many more clients.

...
There's no reason why the serializer can't output this inchunks
Outputting on its own is not useful to discuss - in pipe modeloutput matches input. What is the point in outputting partialchunks of serialized object if you still need to provide it asa whole to the input?

This only makes sense if you are deserializing right afterserializing, which is *not* a common thing to do.

Also, it's much more likely to need to serialize a single object(as in a REST API, 3d model parser [think COLLADA] or configparser). Providing a range seems to fit only a small niche,people that need to dump the state of the system. Withsingle-object serialization and chunked output, you can defineyour own range to get the same effect, but with an API as youdetailed, you can't avoid memory problems without going outsidestd.

Re: Range interface for std.serialization

Reply via email to