|
On 8/27/2018 2:20 PM, Rebecca
Bettencourt via Unicode wrote:
Not correct. If that was literally true, then all HTML, XML, CSS, C, C#, Java, Python source code files and their compilers would be non-conformant. It's more like, "if a process treats a sequence of bytes as Unicode plain text, then the bytes corresponding to the codes assigned to ⓅⓊⒶⒹⒶⓉⒶ just stand for ⓅⓊⒶⒹⒶⓉⒶ. Any meaning is imparted by the (human) reader." However, if the process treats the file as a source file in a markup language, there's nothing that prevents it from assigning particular interpretations to ⓅⓊⒶⒹⒶⓉⒶ, including, but not limited to not displaying these code points as characters. The interpretation of the remainder of the file may well be conformant to the Unicode Standard, just as the display of the contents of many HMTL elements is usually conformant to the Unicode Standard.
Correct, the rub here is that all these schemes that treat
characters as both syntax and text depending on context amount to
mark-up languages and are therefore ipso-facto no longer plain
text (except if displayed as source code, but already applying
syntax coloring would no longer be purely treating the data as
plain text). In-band markup has thus a dual nature as plain text and rich
text, depending on how it is processed.
That could probably be remedied by the usual techniques.
:)
There are situations where an ad-hoc markup language seems to fulfill a need that is not well served by the existing full-fledged markup languages. You find them in internet "bulletin boards" or services like GitHub, where pure plain text is too restrictive but the required text styles purposefully limited - which makes the syntactic overhead of a full-featured mark-up language burdensome. Too bad that there's been no "winner" among these, and therefore no universally accepted one. If so, it might have presented an obvious target for a PUA extension. A./ |
- Re: Private Use areas William_J_G Overington via Unicode
- RE: Private Use areas Peter Constable via Unicode
- Re: Private Use areas James Kass via Unicode
- Re: Private Use areas William_J_G Overington via Unicode
- Re: Private Use areas William_J_G Overington via Unicode
- Re: Private Use areas James Kass via Unicode
- Re: Private Use areas Mark E. Shoulson via Unicode
- Re: Private Use areas William_J_G Overington via Unicode
- Re: Private Use areas Rebecca Bettencourt via Unicode
- Re: Private Use areas Mark E. Shoulson via Unicode
- Re: Private Use areas Asmus Freytag via Unicode
- Re: Private Use areas William_J_G Overington via Unicode
- Re: Private Use areas Mark E. Shoulson via Unicode
- Re: Private Use a... William_J_G Overington via Unicode
- Re: Private U... William_J_G Overington via Unicode
- Re: Private Use areas William_J_G Overington via Unicode
- Re: Private Use areas Mark E. Shoulson via Unicode
- Re: Private Use areas Doug Ewell via Unicode
- Re: Private Use areas Janusz S. Bień via Unicode
- Re: Private Use areas Wordingham Richard via Unicode
- Re: Private Use areas Marcel Schneider via Unicode

