A good compromise between human readability, machine processability and
filesize would be using YAML.
Unlike JSON, YAML supports comments, anchors and references, multiple
documents in a file and several other features.
Regards,
Marius Spix
On Fri, 31 Aug 2018 06:58:37 +0200 (CEST) Marcel Schn
On Thu, Aug 30 2018 at 2:27 +0200, unicode@unicode.org writes:
[...]
> Given NamesList.txt / Code Charts comments are kept minimal by design,
> one couldn’t simply pop them into XML or whatever, as the result would be
> disappointing and call for completion in the aftermath. Yet another task
On 30/08/18 23:34 Philippe Verdy via Unicode wrote:
>
> Welel an alternative to XML is JSON which is more compact and faster/simpler
> to process;
Thanks for pointing the problem and the solution alike. Indeed the main
drawback of the XML
format of UCD is that it results in an “insane” filesize
Thank you for looking into this. First, I’m unable to retrieve the publication
you are citing,
but a February thread had nearly the same subject, referring to Vol. 50. How
did you
compute these figures? Is that a code phrase to say: “The same questions over
and
over again; let’s settle this o
>
> On 29 August 2018 at 06:47 "Janusz S. Bień via Unicode"
> wrote:
>
> > >
> > Storing this information in a font, by hook or crook, would lock
> > users
> > of those PUA characters into that font. At that rate, you might as
> > well
> > use ASCII-hacke
Welel an alternative to XML is JSON which is more compact and
faster/simpler to process; however JSON has no explicit schema, unless the
schema is being made part of the data itself, complicating its structure
(with many levels of arrays of arrays, in which case it becomes less easy
to read by huma
UnicodeData.txt was devised long before any of the other UCD data files. Though
it might seem like a simple enhancement to us, adding a header block, or even a
single line, would break a lot of existing processes that were built long ago
to parse this file.
So Unicode can't add a header to this
7 matches
Mail list logo