Am 27.11.2015 um 16:23 schrieb Walter Bright:
On 11/26/2015 11:08 PM, Sönke Ludwig wrote:
This looks like it's creeping towards inventing a new script programming
language. Adding loops, switch statements, functions, etc., can't be far
off. Before you get too far down this path, consider:

Actually, no! Conditionals and loops are the only constructs - switch
is a
possibility, but basically nothing else. There will also never be
variables,
just constants. There is a definitive limit, namely when it becomes
impossible
to reason about the code in a generic way, without "executing" it, so in
particular anything that would make it touring complete is a no-go - no
recursion, no loop flow control statements, no goto. In fact, there
are no
"statements" at all, these are all purely declarative "directives".

I would say to that: "famous last words". As Exhibit A, I submit 'static
if', which has been getting increasing pressure to augment with loops.

It's hard to make guarantees, true. But at least "static foreach" has always been a relatively obvious candidate, and at the same time there is a well defined limit in case of the package recipe format.

1. JSON has a superset programming language - Javascript - which has
conventional syntax rather than the DEP4 proposal for odd syntax like

     if dub-version="<0.9.24"

which I would strongly recommend against. And, we already have a
Javascript engine written in D:

     https://github.com/DigitalMars/DMDScript

2. D itself can be used as a scripting language (when # is the first
character in the source code). Having DUB use this might be quite
interesting.

On one hand that means that now you have to take care of security
issues (holes
in the scripting engine/compiler or DoS attacks of various sorts) when
you want
to use this on a server (code.dlang.org).

You have to deal with that even if just plain json or sdl. After all,
the implementation of those formats could be susceptible to buffer
overflow or DoS as well. But this is less likely with json, because
you'd be using a well-used json parser rather than your own sdl parser
that is only used for Dub.

The important difference is that a JSON/SDL parser has a vastly lower complexity than a scripting engine and, more importantly, the source file is just parsed in a linear fashion, without any arbitrary runtime execution. So when just parsing the format, making sure that the file is below a certain maximum size is enough to prevent typical DoS vectors.

For scripts, you'd at least have to be able to terminate after a certain time (but even with a relatively low timeout, say 5 seconds, it would be easy to bring the system down temporarily, by e.g. publishing a bunch of package versions at once). And if things like file system or network access are possible, the execution would realistically have to be moved to a sandbox (VM/chroot) environment to be safe.

(Yes I saw later that you use it in some
other projects, but does it see use outside of your own things?)

The current version of the sdlang-d package has been downloaded 83 times (DUB not counted) and there are GitHub issues opened by about 13 different people, so it's definitely used for other projects, even if not yet hugely popular.

Javascript can only interact with its environment using the DOM. If Dub
presented its own DOM to the js engine, there isn't much the js code can
do other than go into an infinite loop, recursive overflow, or exploit a
buffer overflow.

This is where I'd see a similar problem to the "static foreach" one above. I'm pretty sure that people would start to ask for functions to access the file system, or to run arbitrary commands (which is fine on a local developer machine). It will be hard to argue against adding features that are so straight forward to implement.

Once there are big numbers of
packages, this could also mean that the hardware eventually needs to
be upgraded
when it would have done fine for a long time with a tiny declarative
parser.

I would think these problems have all been solved with Javascript, since
it is used so extensively. Javascript is also a lightweight scripting
language.

If the script is just a linear setup of the same fields as the current JSON/SDLang recipe then yes. But it's hard to predict what people will do with it. They might well go crazy and generate source code or other things that could take quite some time. It's just speculation, but the risk is there that this might considerably increase the load in the long run.

On the other hand, it's not possible with a script to make general
predictions
of how a package would behave, for example the script could select a
dependency
based on some environment variable or a file that is only defined on
the target
system.

That goes back to restricting the DOM.

True, but the pressure to add more power to the DOM will most likely be high.

Finally, it's always possible to switch from declarative to script
without
loosing expressive power, but not necessarily the other way around.

True, but consider this. JSON is a subset of Javascript. That means you
could add a subset of Javascript to JSON, i.e. just the if statements.
You'll have a clear design for it, and a clear path for how to do
further enhancements.

The fundamental difference is that JSON just describes a single value, while a JS file describes a program. So while a subset of JS would be an option, it would still mean a completely different appearance for the package recipe files. And of course this really is inventing a new language ("why doesn't ... work if this is JS?").

"With a standard json parser in Phobos, zip zap boom you're done. You
don't have to design it, argue about it, build it, document it, debug
it, test it, optimize it, explain it, deal with bug requests, deal with
enhancement requests, deal with legacy compatibility, build a converter,
build a gui tool for it, etc."

Let's say this isn't really an argument anymore now that it has
already been
done,

The existence of the DEPs suggest otherwise,

The SDLang format is just affected as a side-effect of 2 of those DEPs - just like the JSON format is. So, of course all supported formats have to be maintained and extended over time, but those are really quite rare occasions and the big majority of work is agnostic to the file format.

the number of posts in this
thread suggest otherwise,

The number of posts in this thread has multiple reasons, I'd argue that it's questionable to draw conclusions from that. Also, you need to contrast this to the amount of posts that complained about JSON, or those that would have happened for a different format choice.

the calls for a gui editor suggest otherwise,

That has nothing to do with SDLang or not (at least as far as I understand it).

the customer "should I use json or sdl" makes for an ongoing support
problem,

I can't remember that that has happened. The current situation is that SDLang is endorsed as the recommended format and those who are used to the JSON one can just continue to use it if they want.

no current means to convert between the two, etc.

That's really trivial to add, though. There is a generic internal representation and the only thing missing is the conversion back to SDLang.

but it wouldn't have been a strong argument anyway, because the SDLang
parser is actually in use for other projects as well, so it has to be
maintained
anyway. There really is very little investment necessary
development-wise, I
think it took me maybe three to four hours total to implement it,
including the
support on code.dlang.org. Creating the sdlang-d library itself (by Nick
Sabalausky) was of course a bigger task, as were the discussions and
the design
process.

The time for JSON was zero. You're a key developer here, and your time
is very valuable. I can't tell you what to work on, but I can't be quiet
about spending time on things with such marginal utility (and yes, I
waste time, too). By using sdl, though, you're also spending other
peoples' time, even if it's just "which format should I use for my
project?" and then the D forum members have to advise them.

Again, I haven't seen that question so far if I remember right. But this also leaves out the reason why SDLang support was added in the first place: To improve the experience of working with package recipes. A lot more people are going to do that a lot more frequently, so that even a small amount of reduced friction is going to be likely to save overall time.

And sometimes small things can have a great impact. (The original) D is a good example, a lot of its appeal came from seemingly trivial syntax changes, but those actually often make a big difference in readability and developer focus.

Of course that computation may not hold if we just compare the time that it saves/costs the D contributors alone. But I wonder how many new features in general will actually save overall time if you just look at the core contributors.

But apart from that, finding a format that a) allows (real) comments
and b) has
less syntax noise was necessary in any case. Sure, JSON *works*, but
it becomes
really unpleasant with more complicated files, and the whole
{"comment": "..."}
approach is nothing but an ugly and highly inconvenient hack, both
when writing
and when reading it.

I'm not accepting the "ugly and highly inconvenient hack" argument in
the light of the DEP4 proposal for conditional syntax that I already
commented on. And, as mentioned before, I use $(COMMENT ...) in Ddoc and
it works out quite nicely, even though Ddoc has no syntax for comments.

True, DEP4 definitely pushes the boundary of what is naturally representable with SDLang. But JSON files generally already have such a convoluted appearance that it simply becomes painfully involved and error prone to maintain them starting from a certain size. Since comments are mainly useful for larger documents, in the form of "comment" fields, they would make it even harder to read and maintain those.

If we are talking about how DEP4 looks for SDLang, just imagine how it would look for JSON...


And if comments were the only reason to use sdl, and a solid case was
made for them vs my suggestion, I'd vastly prefer adding /**/ to the
json support rather than switching to an apparently dead format.

It just has to be clear that it's not JSON anymore what we use then (interoperability). Of course comments are not the only reason, but I think it's safe to say that they are one of the two most important ones. The other one is that the XML-like structure of SDLang lends itself much better for the task (unfortunately XML is even more involved to read/write than JSON).

And the fact is that no matter which other format we would
have chosen (JSON with comments is also another language) we'd have these
bikeshedding discussions.

Sticking with json would enable you to simply ignore it. But you've been
pretty much forced to engage in this one.

Maybe it would have, maybe certain actions would still require to react. I don't have absolute numbers, but the complaints against JSON so far can probably easily rival those against SDL.


The way I see it:
 - It's clear that no solution will make everybody happy
- The number of opponents for each format has shown to be in the same order of magnitude - The number of proponents is always hard to judge, because most of them usually stay quiet - Talking about purely declarative formats, popularity is hardly a strong argument anyway, because most people will still have to learn a new format (outside of JSON or XML) - SDLang is so simple and intuitive to C-family developers that there is almost nothing to learn

Based on this I'd rather concentrate at how well a format is suited for the particular task. Ideally that will result in a good format gaining some popularity (seems to be the case with TOML and Rust).

Reply via email to