Re: The Case Against Autodecode

Timon Gehr via Digitalmars-d Thu, 02 Jun 2016 15:16:27 -0700

On 02.06.2016 23:56, Walter Bright wrote:

On 6/2/2016 1:12 PM, Timon Gehr wrote:

...
It is not
meaningful to compare utf-8 and utf-16 code units directly.


Yes, you have a good point. But we do allow things like:

    byte b;
    if (b == 10000) ...

Well, this is a somewhat different case, because 10000 is just notrepresentable as a byte. Every value that fits in a byte fits in an intthough.

It's different for code units. They are incompatible both ways. E.g.dchar obviously does not fit in a char, and while the lower half of charis compatible with dchar, the upper half is specific to the encoding.dchar cannot represent upper half char code units. You get the codepoints with the corresponding values instead.


E.g.:

void main(){
    import std.stdio,std.utf;
    foreach(dchar d;"ö".byCodeUnit)
        writeln(d); // "Ã", "¶"
}

Re: The Case Against Autodecode

Reply via email to