Re: std.jgrandson

Sönke Ludwig via Digitalmars-d Sun, 03 Aug 2014 02:42:07 -0700

A few thoughts based on my experience with vibe.data.json:

1. No decoding of strings appears to mean that "Value" also alwayscontains encoded strings. This seems the be a leaky and also error proneleaky abstraction. For the token stream, performance should be toppriority, so it's okay to not decode there, but "Value" is a high levelabstraction of a JSON value, so it should really hide all implementationdetails of the storage format.

2. Algebraic is a good choice for its generic handling of operations onthe contained types (which isn't exposed here, though). However, atagged union type in my experience has quite some advantages forusability. Since adding a type tag possibly affects the interface in anon-backwards compatible way, this should be evaluated early on.

2.b) I'm currently working on a generic tagged union type that alsoenables operations between values in a natural generic way. This has thebig advantage of not having to manually define operators like in"Value", which is error prone and often limited (I've had to make manyfixes and additions in this part of the code over time).

3. Use of "opDispatch" for an open set of members has been criticizedfor vibe.data.json before and I agree with that criticism. The onlyadvantage is saving a few keystrokes (json.key instead of json["key"]),but I came to the conclusion that the right approach to work with JSONvalues in D is to always directly deserialize when/if possible anyway,which mostly makes this is a moot point.

This approach has a lot of advantages, e.g. reduction of allocations,performance of field access and avoiding typos when accessing fields.Especially the last point is interesting, because opDispatch based fieldaccess gives the false impression that a static field is accessed.

The decision to minimize the number of static fields within "Value"reduces the chance of accidentally accessing a static field instead ofhitting opDispatch, but there are still *some* static fields/methods andany later addition of a symbol must now be considered a breaking change.

3.b) Bad interaction of UFCS and opDispatch: Functions like "remove" and"assume" certainly look like they could be used with UFCS, butopDispatch destroys that possibility.

4. I know the stance on this is often "The D module system has enoughfacilities to disambiguate" (which is not really a valid argument, butrather just the lack of a counter argument, IMO), but I highly dislikethe choice to leave off any mention of "JSON" or "Json" in the globalsymbol names. Using the module either requires to always use a renamedimport or a manual alias, or the resulting source code will always leavethe reader wondering what kind of data is actually handled. Handlingmultiple "value" types in a single piece of code, which is not uncommon(e.g. JSON + BSON/ini value/...) would always require explicitdisambiguation. I'd certainly include the "JSON" or "Json" part in thenames.

5. Whatever happens, *please* let's aim for a module name ofstd.data.json (similar to std.digest.*), so that any data formats addedlater are nicely organized. All existing data format support (XML + CSV)doesn't follow contemporary Phobos style, so they will need to bedeprecated at some point anyway, freeing the way for a clean annon-breaking transition to a more organized module hierarchy.

6. (Possibly compile time optional) support for keeping track ofline/column numbers is often important for better error messages, sothat would be good to have included as part of the parser and in the"Token" type.


Sönke

Re: std.jgrandson

Reply via email to