Re: AST in JSON format

Maciej Stachowiak Mon, 07 Dec 2009 11:17:03 -0800


On Dec 7, 2009, at 10:11 AM, Brendan Eich wrote:

On Dec 7, 2009, at 8:56 AM, Maciej Stachowiak wrote:
Actually, this is potentially a factor for any natively supportedAST format. If execution is direct rather than via transoformationto JS source, the implementation would have to verify that the ASTis one that could be created by parsing JS source.
This reminds me of SafeTSA:

http://portal.acm.org/citation.cfm?id=378825
http://portal.acm.org/citation.cfm?doid=1377492.1377496
and more specifically of work by Christian Stork and Michael Franz,see:
http://www.ics.uci.edu/~cstork/
The idea as I first heard it from Chris and Michael was toarithmetically code ASTs such that no ill-formed tree could beencoded. You could take a JPEG of the Mona Lisa, run it through thedecoder, and if it succeeded, get a (almost-certainly) nonsensicalyet syntactically well-formed AST. The encoding is fairly efficient,not as good as optimized Huffman coding but close.
This work was motivated by the sometimes bad (O(n^4)) complexity inthe Java bytecode verifier (or at least in early versions of it).
My view is that there will never be a standardized bytecode(politics look insuperable to me), and more: that there should notbe. Besides the conflicts among target VM technical details, andignoring latent IPR issues, I believe view-source capability isessential. Even minification lets one pretty-print (http://jsbeautifier.org/) and learn or diagnose.
JS is still used in edit-shift-reload, crawl-walk-run developmentstyle and part of this culture involves sharing. Of course no onecould mandate binary syntax to the exclusion of source, but a binarysyntax that did not allow pretty-printing would shove us all downthe slippery slope toward the opaque, closed-box world of Javaapplets, Flash SWFs (modulo Flash+Flex's server-fetched view-sourcecapabilities), etc.
Compression at the transport (session, whatever, the model isclimbing the traditional layering) is a separate issue.

Given the above, do you think there is a valid case to be made for aserialization format other than JavaScript source itself? It seemslike anything binary is likely to have the same downsides as bytecode,and anything text-based enough to be truly readable and view-sourcecompatible would be rather inefficient as a wire format (I wouldconsider a JSON encoding with mysterious integers all over to be nottruly view-source compatible). Thus I would propose that we should notdefine an alternate serialization at all.

(This is as considered separately from the possibility ofprogramatically manipulating a parsed AST - the use cases for that areclear. Though there may still be verification issues depending on thenature of the manipulation API. It seems like the possibilities areeither specialized objects that enforce validity on every individualmanipulation, or something that accepts JSON-like objects and verifiesvalidity after the fact, or something that accepts JSON-like objectsand verifies validity by converting to JavaScript source code and thenparsing it).


Regards,
Maciej

_______________________________________________
es-discuss mailing list
[email protected]
https://mail.mozilla.org/listinfo/es-discuss

Re: AST in JSON format

Reply via email to